Skip to content
This repository has been archived by the owner on Dec 12, 2019. It is now read-only.

Reproducing the result for STS-B #3

Open
cooelf opened this issue Mar 20, 2019 · 0 comments
Open

Reproducing the result for STS-B #3

cooelf opened this issue Mar 20, 2019 · 0 comments

Comments

@cooelf
Copy link

cooelf commented Mar 20, 2019

Hi, thanks for your codes! I am struggling to reproduce the results for STS-B but the results are far from the reported one in the BERT paper with lots of runs.
The following is my best result (still low) along with the corresponding hyper-parameters.

pearson:  [dev] 90.42%    [test] 83.0%
Spearman: [dev] 90.04%    [test] 81.2%
learning rate: 2e-5
number of epochs: 5
max_seq_len: 128

Have you reproduced the results and can you share some hints or settings?

Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant