The PERSUADE 2.0 corpus builds on the PERSUADE 1.0 corpus by providing holistic essay scores to each persuasive essay in the PERSUADE 1.0 corpus as well as proficiency scores for each argumentative and discourse element found in the initial corpus. This version also contains all essays (as compared to 1.0 which linked the training set for the Kaggle competition)
In total, the PERSUADE 2.0 corpus comprises over 25,000 argumentative essays produced by 6th-12th grade students in the United States for 15 prompts on two writing tasks: independent and source-based writing. The PERSUADE 2.0 corpus provides detailed individual and demographic information for each writer as well as the initial annotations for argumentative and discourse element found PERSUADE 1.0.
The .csv files are too large for github. The links for the dataframes are below
All the argumentative and discourse element annotations and effectiveness scores https://drive.google.com/file/d/1rDy69X3sE7xtVgUQUsA6whlGANV1RgH3/view?usp=share_link
The holistic scores, demographic data, and individual differences https://drive.google.com/file/d/10U558k6ocLeIRIwapDH-IqXjq0neK1R7/view?usp=share_link
A pre-print of the associated paper is at https://zenodo.org/record/8221504.