Project to score phrases within sentences according to sentiment
The file simple_inference_pipeline.ipynb is a first attempt at creating a baseline for this project using only heuristics and existing (pre-trained) models.
It is a Jupyter notebook which allows you to input a multi-part sentence, and see that sentence with postive and negative sentiment phrases color coded green and red respectively. Sentences are broken into phrases on conjunctions (and, but, or, etc.).
- Improve exisitng pipeline using more heuristics to break sentences, besides conjunctions.
- Add training pipeline using data from Stanford NLP "Sentiment Treebank" dataset (with custom model).
- Compare new pipeline to baseline pipeline and iterate on datasets, features, and models.
Use of conjunctions and other planned heuristics inspired by this paper:
Dataset found at:
https://nlp.stanford.edu/sentiment/index.html
Future potential resource: