DisasterClf

Description

These code are used to train the model used for the competition "Real or Not? NLP with Disaster Tweets"

https://www.kaggle.com/c/nlp-getting-started/overview/description

Bernoulli Naive Bayes with Lemmatization, OneHotEncoder, CountVectorizer -> 78.92 %
222 Topic Unigram with Random Forest Classifier -> 61.87 %
GloVe With LSTM -> 79.20 %

By far the approach that generate the best accuracy is Neural Network using GloVe with LSTM

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
dataset		dataset
lib		lib
topic_modeling_data		topic_modeling_data
.gitignore		.gitignore
README.md		README.md
disaster_clf.py		disaster_clf.py
disaster_dl.py		disaster_dl.py
disaster_topic_modeling.py		disaster_topic_modeling.py