-
Notifications
You must be signed in to change notification settings - Fork 194
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Added transformer_text_generation #276
base: main
Are you sure you want to change the base?
Conversation
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I don't see a Robustness Evaluation in the README.md. Per the organizers e-mail, can you run evaluate.py?
This type of augmentation might be risky to apply under some conditions so in the README it would be good to add "What are the limitations of this transformation?" section and explicitly add domain constraints, language constraints and so on. |
Potentially generation model could output top n predictions instead of one |
Concerning the robustness evaluation, do you you mean running code from https://github.com/GEM-benchmark/NL-Augmenter/tree/main/evaluation? |
that's correct, this is currently supported by our code by setting num_return_sequences parameter of generate method to n. |
Kaustubh sent an e-mail about the PR process. In it, it says "To evaluate your transformation, run the evaluate.py command and add the results of the models in your transformation's readme under the "Robustness Evaluation" section." That's all I am asking for - to add that section the README. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Please add your transformation name to the test/mapper.py in the right dictionary for the pytest to pick up your test.json. By default, we're testing only light transformations and filters.
Done! |
Hi |
No description provided.