Description
This project is part of an 8000 level, research intensive course, offered by Dr. Vipin Kumar from the Dept. of CSE, College of Sci. & Engg., UMN in Fall 2022
Retrieving SD and VOD parameters using the recently developed physical models is not desirable due to the lack of universal parameterization of static variables.
Due to the difficulty of the physical model's lack of parameterization, we propose a data-driven approach using cutting-edge machine learning models to overcome this issue in this work.
We aim for the RMSE accuracy of the snow density to be below 30 kg/m3
and the VOD to be below 0.1
in this investigation.
Our hypothesis that it is possible to extract VOD across heavily snow-covered locations in high northern latitudes where the training data is missing which demonstrates the generalizability of our designed machine learning models.
- Data manipulation, cleaning and transforming pixel-wise data
- Ridge Regression
- Decision Tree Regression
- Random Forest regression
- DNN
- LSTM
- CNN
- Scaling and Hyperparameter optimization
- Plotting and visualizations
- Kumawat, Divya (Ph.D. Civil Engineering Department, CEGE, UMN) | GitHub
- Ziebol, Destiny (MS Data Science, Dept. of CSE, CSE, UMN) | GitHub
Convolutional neural networks (CNNs), knowledge-guided machine learning (KGML), exploring feature importance in our models using SHAP or permutation importance, contrasting VOD estimates above 65°N to leaf area index and tree height data, and hyperparameter tuning on our numerous models are some of the techniques we plan to test.
Convolutional neural networks would be a logical progression for this study and a reasonable expansion for comparison (CNNs).
While CNNs have been modified to operate with spatial autocorrelation, LSTMs are built to work with temporal autocorrelation. Further studies might even use a more recent hybrid CNN LSTM model since both are present in our data set.