Strange behavior with randomness #8

Cupcee · 2022-01-15T12:28:09Z

On first run, say if I clone the repo and train with command python3 refine_train.py --dataset ba3 --hid 50 --epoch 1 --ratio 0.4 --lr 1e-4, I always get the same ACC-AUC of 0.518. On second and all subsequent runs, this same command gives me ACC-AUC 0.490.

This happens with any number of epochs but is easiest to verify with just one. It seems like something is not quite working with the random seed on first run (although this first run is still seeded as it produces the same result of 0.518 every time), and then once it is trained once, the seed starts working. I even cloned the repo several times again and this same pattern always happened.

I tried to fix this myself but couldn't. Now, this isn't really critical to fix or anything, but I think it's good to mention as it caused quite a bit of confusion for me when testing the code.

Torch version is 1.8.0 because I couldn't get 1.6.0 to work with torch-scatter. Otherwise the setup is the same as in README

The text was updated successfully, but these errors were encountered:

smiles724 · 2022-06-17T02:44:40Z

Me too. I could not reproduce the results on BA3, and the ACC-AUC of Refine is around 0.54-0.55, which is far less than the reported number in the paper.

I agree with you that the performance is very dependent on the choice of random seed.

smiles724 · 2022-06-17T02:56:06Z

This is the result of my reproduction.

Wuyxin · 2022-06-17T03:03:26Z

Note that by running refine_train.py, you only finished the pretraining phase.

You can try to process the trained model to the fine-tuning phase by running evaluate.pyto get the final results.

Plus, did you re-train the GNN model? I found the results could also be sensitive to different GNN models. Thus it would be better to try a few different GNN models under different seeds.

smiles724 · 2022-06-17T03:34:16Z

Yeah, I also found that the results are very sensitive to the GNN models. Previously I did retrain the model. But when I was aware of that, I used the original model weight that you provide. However, I am still struggling to get the expected results.

This is the results of running evaluate.py:

Cupcee mentioned this issue Jan 15, 2022

ReFine training #6

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Strange behavior with randomness #8

Strange behavior with randomness #8

Cupcee commented Jan 15, 2022 •

edited

Loading

smiles724 commented Jun 17, 2022 •

edited

Loading

smiles724 commented Jun 17, 2022

Wuyxin commented Jun 17, 2022

smiles724 commented Jun 17, 2022

Strange behavior with randomness #8

Strange behavior with randomness #8

Comments

Cupcee commented Jan 15, 2022 • edited Loading

smiles724 commented Jun 17, 2022 • edited Loading

smiles724 commented Jun 17, 2022

Wuyxin commented Jun 17, 2022

smiles724 commented Jun 17, 2022

Cupcee commented Jan 15, 2022 •

edited

Loading

smiles724 commented Jun 17, 2022 •

edited

Loading