emnlp-pragtag-2023

In order to reproduce the results given in the paper, you need to install all libraries in the requirements.txt in Python 3.8. Once all the libraries are installed, you can go through one of the following two routes -

Training and Inference -
1. For pre-training using MLM, execute "Domain Adaptation.py"
2. For fine-tuning sentence classification using labeled data after step 1, execute Classification.py
3. For fine-tuning sentence classification using labeled data without step 1, execute training_wo_mlm.py
4. To get the predictions after Step 2, execute inference_w_mlm.py
5. To get the predictions after Step 3, execute inference_wo_mlm.py
6. Use Word_Distribution_Analysis.ipynb to generate charts in the "PragTag 2023 - Vocabulary Analysis" paper.
7. To obtain performance of model fined tuned on model pre-trained with MLM on out of split data, execute inference_w_mlm_cv.py
8. To obtain performance of model fined tuned without pre-training on MLM on out of split data, execute inference_wo_mlm_cv.py
Inference -
1. To get the predictions from models trained after MLM pre-training, execute inference_w_mlm.py
2. To get the predictions from models trained without MLM pre-training, execute inference_wo_mlm.py
3. Use Word_Distribution_Analysis.ipynb to generate charts in the "PragTag 2023 - Vocabulary Analysis" paper.

Name		Name	Last commit message	Last commit date
Latest commit History 35 Commits
pdfs		pdfs
public_data		public_data
public_secret		public_secret
starting_kit/starting_kit		starting_kit/starting_kit
.gitignore		.gitignore
Dockerfile		Dockerfile
LICENSE.txt		LICENSE.txt
NOTICE.txt		NOTICE.txt
README.md		README.md
Word_Distribution_Analysis.ipynb		Word_Distribution_Analysis.ipynb
classification.py		classification.py
domain_adaptation.ipynb		domain_adaptation.ipynb
domain_adaptation.py		domain_adaptation.py
domain_adaptation_warm_restart.py		domain_adaptation_warm_restart.py
eval.py		eval.py
gold_data_split_0.json		gold_data_split_0.json
gold_data_split_1.json		gold_data_split_1.json
gold_data_split_2.json		gold_data_split_2.json
gold_data_split_3.json		gold_data_split_3.json
gold_data_split_4.json		gold_data_split_4.json
gold_data_wo_split_split_0.json		gold_data_wo_split_split_0.json
gold_data_wo_split_split_1.json		gold_data_wo_split_split_1.json
gold_data_wo_split_split_2.json		gold_data_wo_split_split_2.json
gold_data_wo_split_split_3.json		gold_data_wo_split_split_3.json
gold_data_wo_split_split_4.json		gold_data_wo_split_split_4.json
inference_w_mlm.py		inference_w_mlm.py
inference_w_mlm_cv.py		inference_w_mlm_cv.py
inference_wo_mlm.py		inference_wo_mlm.py
inference_wo_mlm_cv.py		inference_wo_mlm_cv.py
load.py		load.py
pdfs.zip		pdfs.zip
predicted_split_0.json		predicted_split_0.json
predicted_split_1.json		predicted_split_1.json
predicted_split_2.json		predicted_split_2.json
predicted_split_3.json		predicted_split_3.json
predicted_split_4.json		predicted_split_4.json
predicted_wo_mlm_split_0.json		predicted_wo_mlm_split_0.json
predicted_wo_mlm_split_1.json		predicted_wo_mlm_split_1.json
predicted_wo_mlm_split_2.json		predicted_wo_mlm_split_2.json
predicted_wo_mlm_split_3.json		predicted_wo_mlm_split_3.json
predicted_wo_mlm_split_4.json		predicted_wo_mlm_split_4.json
requirements.txt		requirements.txt
scores_split_0.txt		scores_split_0.txt
scores_split_1.txt		scores_split_1.txt
scores_split_2.txt		scores_split_2.txt
scores_split_3.txt		scores_split_3.txt
scores_split_4.txt		scores_split_4.txt
scores_wo_mlm_split_0.txt		scores_wo_mlm_split_0.txt
scores_wo_mlm_split_1.txt		scores_wo_mlm_split_1.txt
scores_wo_mlm_split_2.txt		scores_wo_mlm_split_2.txt
scores_wo_mlm_split_3.txt		scores_wo_mlm_split_3.txt
scores_wo_mlm_split_4.txt		scores_wo_mlm_split_4.txt
starting_kit.zip		starting_kit.zip
training.ipynb		training.ipynb
training.py		training.py
training_wo_mlm.py		training_wo_mlm.py
update.sh		update.sh
utils.py		utils.py
vocabulary-overlap-0-case.pdf		vocabulary-overlap-0-case.pdf
vocabulary-overlap-0-diso.pdf		vocabulary-overlap-0-diso.pdf
vocabulary-overlap-0-iscb.pdf		vocabulary-overlap-0-iscb.pdf
vocabulary-overlap-0-scip.pdf		vocabulary-overlap-0-scip.pdf
vocabulary-overlap-1-case.pdf		vocabulary-overlap-1-case.pdf
vocabulary-overlap-1-diso.pdf		vocabulary-overlap-1-diso.pdf
vocabulary-overlap-1-iscb.pdf		vocabulary-overlap-1-iscb.pdf
vocabulary-overlap-1-rpkg.pdf		vocabulary-overlap-1-rpkg.pdf
vocabulary-overlap-1-scip.pdf		vocabulary-overlap-1-scip.pdf
vocabulary-overlap-2-case.pdf		vocabulary-overlap-2-case.pdf
vocabulary-overlap-2-diso.pdf		vocabulary-overlap-2-diso.pdf
vocabulary-overlap-2-rpkg.pdf		vocabulary-overlap-2-rpkg.pdf
vocabulary-overlap-2-scip.pdf		vocabulary-overlap-2-scip.pdf
vocabulary-overlap-3-case.pdf		vocabulary-overlap-3-case.pdf
vocabulary-overlap-3-diso.pdf		vocabulary-overlap-3-diso.pdf
vocabulary-overlap-3-iscb.pdf		vocabulary-overlap-3-iscb.pdf
vocabulary-overlap-3-rpkg.pdf		vocabulary-overlap-3-rpkg.pdf
vocabulary-overlap-3-scip.pdf		vocabulary-overlap-3-scip.pdf
vocabulary-overlap-4-case.pdf		vocabulary-overlap-4-case.pdf
vocabulary-overlap-4-diso.pdf		vocabulary-overlap-4-diso.pdf
vocabulary-overlap-4-iscb.pdf		vocabulary-overlap-4-iscb.pdf
vocabulary-overlap-4-rpkg.pdf		vocabulary-overlap-4-rpkg.pdf
vocabulary-overlap-4-scip.pdf		vocabulary-overlap-4-scip.pdf
vocabulary-overlap-secret-case.pdf		vocabulary-overlap-secret-case.pdf
vocabulary-overlap-secret-diso.pdf		vocabulary-overlap-secret-diso.pdf
vocabulary-overlap-secret-iscb.pdf		vocabulary-overlap-secret-iscb.pdf
vocabulary-overlap-secret-rpkg.pdf		vocabulary-overlap-secret-rpkg.pdf
vocabulary-overlap-secret-scip.pdf		vocabulary-overlap-secret-scip.pdf
vocabulary-overlap-secret-secret.pdf		vocabulary-overlap-secret-secret.pdf
vocabulary-overlap-test-case.pdf		vocabulary-overlap-test-case.pdf
vocabulary-overlap-test-diso.pdf		vocabulary-overlap-test-diso.pdf
vocabulary-overlap-test-iscb.pdf		vocabulary-overlap-test-iscb.pdf
vocabulary-overlap-test-rpkg.pdf		vocabulary-overlap-test-rpkg.pdf
vocabulary-overlap-test-scip.pdf		vocabulary-overlap-test-scip.pdf

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

emnlp-pragtag-2023

About

Releases

Packages

Languages

License

suri-kunal/emnlp-pragtag-2023

Folders and files

Latest commit

History

Repository files navigation

emnlp-pragtag-2023

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages