Added the stopword removal transformation #268

juanyiloke · 2021-09-01T03:49:25Z

No description provided.

…ugmenter into stopword_removal

kaustubhdhole · 2021-09-03T22:37:12Z

transformations/stopword_removal/README.md

@@ -0,0 +1,14 @@
+# Stopword Removal
+Removes stopwords from a piece of text.


Hi @juanyiloke please add your name, email and affiliation.

done! @kaustubhdhole

timothy22000 · 2021-09-08T00:17:54Z

transformations/stopword_removal/test.json

+          "sentence": "OMG!!! jUSTin is AmAZEballs!!!"
+        },
+        "outputs": [{
+          "sentence": "OMG!!! jUSTin is AmAZEballs!!!"


Just curious, why isn't the stopword "is" removed in this particular example?

Good catch, fixed!

timothy22000 · 2021-09-14T23:44:40Z

transformations/stopword_removal/transformation.py

+
+
+def stopword_remove(text, max_outputs=1):
+    """


the max_outputs argument isn't used anywhere in the stopword_remove function?

aadesh11 · 2021-09-17T14:13:54Z

transformations/stopword_removal/transformation.py

+        super().__init__(seed, max_outputs=max_outputs)
+
+    def generate(self, raw_text: str):
+        pertubed_text = stopword_remove(


Minor cosmetic change: pertubed_text -> perturbed_text

Good catch, fixed!

aadesh11 · 2021-09-17T14:19:27Z

transformations/stopword_removal/transformation.py

+        TaskType.TEXT_TO_TEXT_GENERATION,
+    ]
+    languages = ["en"]
+    heavy = True


I think we can mark this transformation as light i.e heavy = False, as we are only using nltk package.

timothy22000 · 2021-09-18T22:21:15Z

transformations/stopword_removal/transformation.py

+        TaskType.TEXT_TO_TEXT_GENERATION,
+    ]
+    languages = ["en"]
+    heavy = False


Need to add the keywords

timothy22000 · 2021-09-18T22:22:20Z

transformations/stopword_removal/README.md

+Author: Juan Yi Loke
+Email: [email protected]
+Affliation: University of Toronto
+


I believe you would need to add the Robustness Evaluation as per the instructions in the email :)

msobrevillac

I think that the transformation is well-implemented, however, I think it needs for being well-motivated description in order to show the usefulness of this transformation for the project.

kaustubhdhole · 2021-10-04T20:46:12Z

Okay, this transformation looks great. I do have a suggestion though: first, if you remove all stopwords, that can be dangerous: example the shakespeare sentence that you added in the README gives a clear example. It might be better you provide a parameter to control the amount of change that should be permitted in a sentence, eg. how many stopwords can be removed at a single time. This way you might be able to generate multiple sentences too with little loss in meaning. Besides, please add appropriate keywords and a robustness evaluation to make this PR stronger. Also, you might want to mention any work relating to the influence of stopword removal. (Might also give better insights).

aadesh11

Please also add your transformation name in test/mapper.py so that the test job can run your test cases.

juanyiloke · 2021-10-22T06:05:08Z

Thanks so much for the reviews everyone, I'll get to them this weekend. It's midterms season for me but that should be over soon.

juanyiloke and others added 10 commits August 31, 2021 23:46

Added stopword_removal transformation

fb63d68

added newline at eof

4f04626

Update transformation.py

6977b92

minor fix

541f10f

Merge branch 'stopword_removal' of https://github.com/juanyiloke/NL-A…

f90597b

…ugmenter into stopword_removal

Update test.json

0c98d2a

update test.json

980554e

Merge branch 'stopword_removal' of https://github.com/juanyiloke/NL-A…

795bc1b

…ugmenter into stopword_removal

should be g now

f7448c4

last one

1495063

kaustubhdhole added the transformation label Sep 3, 2021

kaustubhdhole reviewed Sep 3, 2021

View reviewed changes

added name, email, and affliation

059b27f

juanyiloke requested a review from kaustubhdhole September 3, 2021 23:07

timothy22000 reviewed Sep 14, 2021

View reviewed changes

aadesh11 reviewed Sep 17, 2021

View reviewed changes

juanyiloke added 3 commits September 18, 2021 15:50

fixed test.json

738e404

made changes based on comments

3d3e655

remove unneeded param

a6f7620

juanyiloke requested a review from aadesh11 September 18, 2021 20:00

timothy22000 reviewed Sep 18, 2021

View reviewed changes

msobrevillac reviewed Sep 19, 2021

View reviewed changes

JosephSefara mentioned this pull request Oct 5, 2021

Added Synonym insertion #160

Merged

aadesh11 requested changes Oct 22, 2021

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Added the stopword removal transformation #268

Added the stopword removal transformation #268

juanyiloke commented Sep 1, 2021

kaustubhdhole Sep 3, 2021

juanyiloke Sep 3, 2021 •

edited

Loading

kaustubhdhole Sep 6, 2021

timothy22000 Sep 8, 2021

juanyiloke Sep 18, 2021

timothy22000 Sep 14, 2021

juanyiloke Sep 18, 2021 •

edited

Loading

aadesh11 Sep 17, 2021

juanyiloke Sep 18, 2021

aadesh11 Sep 17, 2021

juanyiloke Sep 18, 2021

timothy22000 Sep 18, 2021

msobrevillac Sep 19, 2021

timothy22000 Sep 18, 2021

msobrevillac left a comment

kaustubhdhole commented Oct 4, 2021

aadesh11 left a comment

juanyiloke commented Oct 22, 2021

		@@ -0,0 +1,14 @@
		# Stopword Removal
		Removes stopwords from a piece of text.

Added the stopword removal transformation #268

Are you sure you want to change the base?

Added the stopword removal transformation #268

Conversation

juanyiloke commented Sep 1, 2021

Choose a reason for hiding this comment

juanyiloke Sep 3, 2021 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

juanyiloke Sep 18, 2021 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

msobrevillac left a comment

Choose a reason for hiding this comment

kaustubhdhole commented Oct 4, 2021

aadesh11 left a comment

Choose a reason for hiding this comment

juanyiloke commented Oct 22, 2021

juanyiloke Sep 3, 2021 •

edited

Loading

juanyiloke Sep 18, 2021 •

edited

Loading