Skip to content

This is the data associated with the PERSUADE Corpus 2.0 version

Notifications You must be signed in to change notification settings

bshahabi1/persuade_corpus_2.0

 
 

Repository files navigation

persuade_corpus_2.0

The PERSUADE 2.0 corpus builds on the PERSUADE 1.0 corpus by providing holistic essay scores to each persuasive essay in the PERSUADE 1.0 corpus as well as proficiency scores for each argumentative and discourse element found in the initial corpus. This version also contains all essays (as compared to 1.0 which linked the training set for the Kaggle competition)

In total, the PERSUADE 2.0 corpus comprises over 25,000 argumentative essays produced by 6th-12th grade students in the United States for 15 prompts on two writing tasks: independent and source-based writing. The PERSUADE 2.0 corpus provides detailed individual and demographic information for each writer as well as the initial annotations for argumentative and discourse element found PERSUADE 1.0.

The .csv files are too large for github. The links for the dataframes are below

All the argumentative and discourse element annotations and effectiveness scores https://drive.google.com/file/d/1rDy69X3sE7xtVgUQUsA6whlGANV1RgH3/view?usp=share_link

The holistic scores, demographic data, and individual differences https://drive.google.com/file/d/10U558k6ocLeIRIwapDH-IqXjq0neK1R7/view?usp=share_link

A pre-print of the associated paper is at https://zenodo.org/record/8221504.

About

This is the data associated with the PERSUADE Corpus 2.0 version

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published