Skip to content

Commit

Permalink
Add new version of the dataset
Browse files Browse the repository at this point in the history
  • Loading branch information
bryant1410 committed May 27, 2018
1 parent bd545e6 commit 3146889
Show file tree
Hide file tree
Showing 14 changed files with 324,754 additions and 99,891 deletions.
158,231 changes: 117,801 additions & 40,430 deletions annotations.csv

Large diffs are not rendered by default.

47,378 changes: 27,283 additions & 20,095 deletions annotations_by_tweet.csv

Large diffs are not rendered by default.

27,283 changes: 27,283 additions & 0 deletions annotations_by_tweet_all.csv

Large diffs are not rendered by default.

25,201 changes: 25,201 additions & 0 deletions annotations_by_tweet_top.csv

Large diffs are not rendered by default.

14 changes: 13 additions & 1 deletion extraction/analyses.sql
Original file line number Diff line number Diff line change
Expand Up @@ -139,7 +139,7 @@ FROM tweets t
GROUP BY session_id
HAVING COUNT(*) > 1000) top_sessions
GROUP BY t.tweet_id
INTO OUTFILE '/var/lib/mysql-files/annotations_by_tweet_top2.csv' FIELDS TERMINATED BY ','
INTO OUTFILE '/var/lib/mysql-files/annotations_by_tweet_top.csv' FIELDS TERMINATED BY ','
OPTIONALLY ENCLOSED BY '"'
ESCAPED BY '' LINES TERMINATED BY '\n';

Expand Down Expand Up @@ -206,3 +206,15 @@ FROM tweets
INTO OUTFILE '/var/lib/mysql-files/tweets.csv' FIELDS TERMINATED BY ','
OPTIONALLY ENCLOSED BY '"'
ESCAPED BY '' LINES TERMINATED BY '\n';
-- Then add the CSV headers manually.

SELECT
tweet_id,
session_id,
date,
vote
FROM votes
INTO OUTFILE '/var/lib/mysql-files/annotations.csv' FIELDS TERMINATED BY ','
OPTIONALLY ENCLOSED BY'"'
ESCAPED BY '' LINES TERMINATED BY '\n';
-- Then add the CSV headers manually.
2 changes: 1 addition & 1 deletion index.html
Original file line number Diff line number Diff line change
Expand Up @@ -109,7 +109,7 @@ <h2>Download</h2>
<h2>Roadmap</h2>

<ol>
<li>Add more information to this website - ETA: Dec 2017</li>
<li>Add and update information in this website (there's old stuff in here!) - no ETA yet</li>
<li>HUMOR dataset bundled in NLTK - no ETA yet</li>
</ol>

Expand Down
40,430 changes: 40,430 additions & 0 deletions previous/annotations.csv

Large diffs are not rendered by default.

20,095 changes: 20,095 additions & 0 deletions previous/annotations_by_tweet.csv

Large diffs are not rendered by default.

File renamed without changes.
File renamed without changes.
File renamed without changes.
Loading

0 comments on commit 3146889

Please sign in to comment.