Yelp Restaurant Recommendation System

This recommendation system uses data from the Yelp Open Dataset, available here.

1. The Data

The full dataset includes:

6,685,900 reviews for 192,609 businesses by 1,637,138 users

The data used for training the recommendation system filtered this raw data down to only reviews for restaurants, and only reviews by users who gave 10 or more reviews:

2,295,089 reviews for 73,100 businesses by 81,416 users

2. Exploratory Data Analysis

Two graphs per distribution showing the difference in the data in the complete dataset and the data used to train the model.

a. Distribution of Ratings

b. Distribution of Reviews per Business

c. Distribution of Reviews per User

3. The Recommendation System

The recommendation system is built on a Singular Value Decomposition model from surprise.

The base model (default hyperparameters) returned error metrics RMSE = 1.0917 and MAE = 0.8587, and took ~20 minutes to train with 5-fold validation
After tuning hyperparameters over several interations of GridSearch, final error metrics RMSE = 1.0780 and MAE = 0.8495, taking 48 seconds training on 80% of the full dataset and testing on the remaining 20%
This means on average, the recommender predicts a rating between 1 and 5 with an error of 0.85 stars

Version 1 (02/07/2019):

Pick a user, the app shows the reviews the user's already made, and makes the same number of recommendations with Google Maps links to the recommended restaurants.

Name		Name	Last commit message	Last commit date
Latest commit History 16 Commits
dash_package		dash_package
images		images
.gitignore		.gitignore
README.md		README.md
finalbusinessesindexed.json		finalbusinessesindexed.json
finaldata.csv		finaldata.csv
finalmodel		finalmodel
model-testing.ipynb		model-testing.ipynb
run.py		run.py
yelp-recommender.ipynb		yelp-recommender.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Yelp Restaurant Recommendation System

1. The Data

2. Exploratory Data Analysis

a. Distribution of Ratings

b. Distribution of Reviews per Business

c. Distribution of Reviews per User

3. The Recommendation System

Version 1 (02/07/2019):

About

Releases

Packages

Languages

yishuen/yelp-restaurant-recommendation-system

Folders and files

Latest commit

History

Repository files navigation

Yelp Restaurant Recommendation System

1. The Data

2. Exploratory Data Analysis

a. Distribution of Ratings

b. Distribution of Reviews per Business

c. Distribution of Reviews per User

3. The Recommendation System

Version 1 (02/07/2019):

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages