Background
- This is the capstone project for Udacity Machine Learning Engineer Nanodegree. Series of ensemble models are fitted to evaluate probabilities of device failures for purpose of predictive maintenance. Resampling technique is also used to mitigate imbalanced nature of the dataset. Model performnace is continuously calibrated with help of confusion matrix. In the final analysis, the best performing model is able to score 90% precision and 90% recall in the validation dataset.
Dataset
Software Requirements
conda install -c anaconda numpy pandas matplotlib seaborn scikit-learn -y
conda install -c conda-forge imbalanced-learn xgboost lightgbm -y
Files
proposal.pdf
: initial workplanreport.pdf
: final reportUdacity_ML_Capstone_Analysis.ipynb
: detail steps of the analysis captured in jupyter notebook