Skip to content

Python implementation of Personalize Expedia Hotel Searches - ICDM 2013

Notifications You must be signed in to change notification settings

igorpejic/personalize_expedia_hotel_searches_2013

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

4 Commits
 
 
 
 
 
 

Repository files navigation

This is an impelementation of a model which achieves 4th (NDCG@38=0.534) place on: Personalize Expedia Hotel Searches - ICDM 2013 challenge https://www.kaggle.com/c/expedia-personalized-sort/overview

First, you need to install the packages:

pip3 install -r requirements.txt

Afterwards you can run the training and generation of the submission file like this:

python3 run.py /home/i/data/train.csv /home/i/data/test.csv /home/i/results_directory/

The results (in kaggle submission format) will be saved to submission.csv and the trained model to model.dat

The process of training will take long (on VM with 32 vCPU it took 2.5 hours with the default parameters). To make training faster, but less accurate change 'dart' to 'goss' and decrease n_estimators.

It is recommended to have 8GB RAM free otherwise you might run into memory issues.

This project was done in the scope of the course Data Mining Techniques taught at Vrije Universitet, Amsterdam in 2019. Special thanks to team members @P4ppenheimer and @ijanerik.

About

Python implementation of Personalize Expedia Hotel Searches - ICDM 2013

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages