4th place solution to the TBrain competition - E.SUN AI Open Competition Summer 2019 - House Price Prediction (玉山人工智慧公開挑戰賽2019夏季賽 - 台灣不動產AI神預測)
- Chen-Hsi (Sky) Huang (github.com/skyhuang1208)
- Louis Yang (github.com/louis925)
We won 4th place out of 766 teams (top 0.5%) with private leaderboard score 6210.877.
dataset
- place input datasettrain.csv
andtest.csv
gen_5_fold_cv.ipynb
- generate 5-fold CV dataset
eda-and-exp/
- contains notebooks for exploratory data analysis and various experimentseda*
- exploratory data analysisexp*
- experiments
model-<model_number>-build-<model_name>.ipynb/.py
- parameters search with small training step for single modelmodel-<model_number>-predict-<model_name>.ipynb/.py
- complete single model training process and prediction. Output single model CV and test set prediction for stackingstack_<stack_method>_<stacking_model_number>_<model_numbers_used>.ipynb
- stacking model from single models in<model_numbers_used>
. Output to final test set prediction for submission.feature_engineering.py
- label encoders and feature scalerskeras_get_best.py
- early stop callback for kerasutilities.py
- utility functions including scoring, feature processing, ..., etcvars_03.py
- feature importance computed by experiments for feature selectionloss_exp.ipynb
- experiment on the smooth hit rate loss functionprice_quantizer*.ipynb
- quantize predicted price used by stack 16 and 18