Reddit-to-Dialogue is a tool that transforms a Reddit post & its comments into a dialogue.
This project began at Emory University in the Emory NLP Lab under the direction of Dr. Jinho Choi.
The development constituted two separate undergraduate honors theses, undertaken by Daniil Huryn and Mack Hutsell, and resulted in a long paper accepted to COLING 2022.
!! Pip Package Coming Soon !! `pip3 install reddit-to-dialogue`Data should be in a JSON format, organized as defined by PRAW. Example in reddit folder.
Dialogues will be returned in the format (example in exampleoutput):
[{
"sid": "",
"link": "",
"title:": "",
"text": "",
"author": "",
"created": unix timestamp,
"updated": unix timestamp,
"over_18": boolean,
"upvotes": integer,
"upvote_ratio": decimal value 0 - 1,
"response": [
"",
],
"dialogue": [
"",
],
"score":
}, ]
Where response is a list of Speaker 2 statements and dialogue alternates Speaker 1 and Speaker 2 statements.