[Question] Stability of embeddings on consecutive runs? #264

ml7 · 2022-09-01T22:10:46Z

Hi there! I hope all is well. I noticed in the code that the embeddings seem to be initialized from a centered normal distribution (originally thought torch.empty was being used), which naturally produces different results on each call (both in terms of magnitude and orientation). We're noticing that the resulting embeddings trained on two separate runs (holding data fixed) seem to differ noticeably. I imagine that it's probably up to a difference in rotation/translation. We're wondering if the initialization might be the cause.

Would it also potentially be caused by the negative sampling not producing the same negatives? I did not see a generator/random seed in the negative sampling function. Any thoughts are appreciated!

parsa-saadatpanah · 2022-09-08T14:53:22Z

There are a few random operations (including parameter initialization and negative sampling) involved in the training process that would contribute to different trained embeddings.
Unfortunately setting the random seed is currently not supported.

ml7 · 2022-09-12T22:15:27Z

Thank you @parsa-saadatpanah! Could you elaborate on some of the other random operations (aside from parameter initialization and negative sampling) you see that could contribute to different trained embeddings? It would help me figure out what we can do on our end to adjust. Thanks!

parsa-saadatpanah · 2022-09-13T13:52:20Z

I believe the bucket scheduling and how each bucket is split into sub-buckets can also be random.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Question] Stability of embeddings on consecutive runs? #264

[Question] Stability of embeddings on consecutive runs? #264

ml7 commented Sep 1, 2022 •

edited

Loading

parsa-saadatpanah commented Sep 8, 2022

ml7 commented Sep 12, 2022 •

edited

Loading

parsa-saadatpanah commented Sep 13, 2022

[Question] Stability of embeddings on consecutive runs? #264

[Question] Stability of embeddings on consecutive runs? #264

Comments

ml7 commented Sep 1, 2022 • edited Loading

parsa-saadatpanah commented Sep 8, 2022

ml7 commented Sep 12, 2022 • edited Loading

parsa-saadatpanah commented Sep 13, 2022

ml7 commented Sep 1, 2022 •

edited

Loading

ml7 commented Sep 12, 2022 •

edited

Loading