Sentiment Based Product Recommendation

Problem Statement

The e-commerce business is quite popular today. Here, you do not need to take orders by going to each customer. A company launches its website to sell the items to the end consumer, and customers can order the products that they require from the same website. Famous examples of such e-commerce companies are Amazon, Flipkart, Myntra, Paytm and Snapdeal.

Suppose you are working as a Machine Learning Engineer in an e-commerce company named 'Ebuss'. Ebuss has captured a huge market share in many fields, and it sells the products in various categories such as household essentials, books, personal care products, medicines, cosmetic items, beauty products, electrical appliances, kitchen and dining products and health care products.

With the advancement in technology, it is imperative for Ebuss to grow quickly in the e-commerce market to become a major leader in the market because it has to compete with the likes of Amazon, Flipkart, etc., which are already market leaders.

As a senior ML Engineer, you are asked to build a model that will improve the recommendations given to the users given their past reviews and ratings.

In order to do this, you planned to build a sentiment-based product recommendation system, which includes the following tasks.

Data sourcing and sentiment analysis
Building a recommendation system
Improving the recommendations using the sentiment analysis model
Deploying the end-to-end project with a user interface

Solution

github link: https://github.com/PravinAmin/sbprs-capstone
Heroku (Application is Live): https://ebuss-sbprs-app.herokuapp.com/

Built with

Python 3.9.7
scikit-learn 1.0.2
xgboost 1.5.1
numpy 1.22.0
nltk 3.6.7
pandas 1.3.5
Flask 2.0.2
Bootstrap CDN 5.1.3

Solution Approach

Dataset and Attribute description are available under "datasource" folder
Dataset loading, exploratory data analysis, data cleaning and visualization, Text Pre-processing is performed on the dataset.
TF-IDF Vectorizer is used to vectorize the textual data (review_title and review_text)
During EDA, it is observed that the Dataset suffers from Class Imbalance Issue which is handled using SMOTE Oversampling technique before applying the model
Machine Learning Classification Models (Logistic Regression, Naive Bayes, Tree Algorithms : (Decision Tree, Random Forest, XGBoost) are applied on the vectorized data and the target column (user_sentiment). the objective of this ML model is to classify the sentiment to positive(1) or negative(0). Best Model is selected based on the various ML classification metrics (Accuracy, Precision, Recall, F1 Score, AUC). XGBoost is selected to be a better model based on the evaluation metrics.
Collaborative Filtering Recommender system is created based on User-user and item-item approaches. RMSE evaluation metric is used for the evaluation.
\sbprs.ipynb Jupyter notebook contains the code for Sentiment Classification and Recommender Systems
Top 20 products are filtered using the better recommender system, and for each of the products predicted the user_sentiment for all the reviews and filtered out the Top 5 products that have higher Postive User Sentiment (model.py)
Machine Learning models are saved in the pickle files(under the folder pickle); Flask API (app.py) is used to interface and test the Machine Learning models. Bootstrap and Flask jinja templates (templates\index.html) are used for setting up the User interface.
End to End application is deployed in Heroku

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
datasource		datasource
pickle		pickle
templates		templates
.DS_Store		.DS_Store
LICENSE		LICENSE
Procfile		Procfile
README.md		README.md
app.py		app.py
model.py		model.py
nltk.txt		nltk.txt
requirements.txt		requirements.txt
sbprs.ipynb		sbprs.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Sentiment Based Product Recommendation

Problem Statement

Solution

Built with

Solution Approach

About

Releases

Packages

Languages

License

PravinAmin/sbprs-capstone

Folders and files

Latest commit

History

Repository files navigation

Sentiment Based Product Recommendation

Problem Statement

Solution

Built with

Solution Approach

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages