GitHub - datthan1576/ml-phystech

Lesson 1: Introduction to Machine Learning

Using Sklearn for Iris dataset.
Binary classification, multiclass classification.

Lesson 2: Linear classifier and stochastic gradient

Stochastic gradient in practice.
Maximum Likelihood and Regularization L1,L2.
Find optimize regularization using LOO.

Lesson 3: Neural Networks: Gradient Optimization Techniques

Autograd.
MLP for MNIST.
Tuning hyperparameters for MLP.

Lesson 4: Metric classification and regression methods

kNN, kernel-kNN, Parzen window method, potential function method.
Reference element selection, STOLP, Nadarai Watson formula.

Lesson 5: Support vector machine

SVM , kernel-SVM for classification, regression.
SVM feature.

Lesson 6: Multidimensional Linear Regression

Multidimendional Linear Regression, SVD, regularization for MLR using SVD.
Dependence of the approximation quality on the condition number.
PCA on MNIST.
PCA for images.

Lesson 7: Nonlinear Regression

Non-linear regression example.
Compare gradient descent, Newton-Raphson and Newton-Gauss.
Generalized linear models: optimal sample size.
Loss function for the problem of finding close sentences.
Convergence visualization of the Newton Raphson method and the stochastic gradient.

Lesson 8: Model Selection Criteria and Feature Selection Methods

Model quality assessment: external and internal criteria.
Feature selection: exhaustive search, Add algorithm, Add-Del algorithm.
Precision,Recall.
Example of information retrieval task.

Lesson 9: Logical classification methods

Logical classifier implementation.
Informative criteria.
Decision list, simple implementation.
Decision tree.
Random forest.

Lesson 10: Search for association rules

Statement of the problem of association rules.
Synthetic example.
Example of real data from Kaggle.
Apriori algorithm.
FP-growth algorithm.
Generalization for real data.
Generalized association rules.

Lesson 11: Linear Ensembles

DummyEnsemble.
AdaBoost.
Gradient boosting, XGBoost.
An example of real data from kaggle.
RandomForest.
Mixture Of Expert.

Lesson 12: Advanced Ensembling Techniques

ComBoost.
Gradient Boosting.
XGBoost.
CatBoost.

Lesson 13: Bayesian theory of classification

Maximum Likelihood Principle: Visualization.
Density reconstruction from empirical data.
Using LOO to select the window width.
Naive Bayes classifier.

Lesson 14: Clustering and semi-supervised learning

Clustering examples.
K-means.
DBSCAN.
Hierarchical clustering.
Semi-supervised learning.
Self-training, 1970.
Unlabeled data in deep learning.

Lesson 15: Deep Neural Networks

CNN, RNN, Tensorboard, Transfer Learning, Interpretability of NN.

Lesson 16: AutoEncoder,GAN

Autoencoder, Linear Autoencoder, Autoencoder using CNN, Variational autoencoder.
Transfer learning from a pre-trained model.
Generative adversarial networks.

Lesson 17: Tokenization,Word2Vec(Fasttext)

An example of classifying tweets.
Text tokenization.
Word2Vec (based on the FastText model).
FastText model (compressed to emb-dim=10 for lightness).
Problems for unsupervised learning of vectorization models.

Lesson 18: Attention.Transformer

Attention model RNN.
Transformer.
T2T translator.
BPE tokenization.
BERT.
LaBSE.

Lesson 19: Modeling

LDA.
PLSA(bigartm).

Lesson 20: Homework

Lesson 21: Learning to rank

Basic concept.
An example of a ranking problem.
An example of a recommender system.
Training a search engine based on pyserini.

Lesson 22: Recommender Systems

Constant model.
Correlation system.
SLIM.
SVD.

Lesson 23: Time Series Analysis

Autoregression model.
Exponential smoothing.
Cluster analysis of time series.

Lesson 24: Online Learning

Lesson 25: Reinforcement Learning

Stationary multi-armed bandit.
Non-stationary multi-armed bandit.
Swim problem.

Lesson 26: Active Learning

Active learning with a random additive element.
Active learning with the addition of the element with the maximum variance.

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
10_association_rules		10_association_rules
11_ensemble		11_ensemble
12_advanced_ensemble		12_advanced_ensemble
13_bayes_classidication		13_bayes_classidication
14_clustering_semi		14_clustering_semi
15_dnn		15_dnn
16_autoencoder_gan		16_autoencoder_gan
17_word2vec		17_word2vec
18_attention_transformer		18_attention_transformer
19_modeling		19_modeling
1_intro		1_intro
20		20
21_ranking		21_ranking
22_recommender_systems		22_recommender_systems
23_time_series_analysis		23_time_series_analysis
24_online_learning		24_online_learning
25_reinforcement_learning		25_reinforcement_learning
26_active_learning		26_active_learning
27		27
28		28
29		29
2_linear_method		2_linear_method
3_ann_optimization		3_ann_optimization
4_metric_method		4_metric_method
5_svm		5_svm
6_multilinear_reg_SVD_PCA		6_multilinear_reg_SVD_PCA
7_nonlinear_reg_GLM		7_nonlinear_reg_GLM
8_model_selection_feature_selection		8_model_selection_feature_selection
9_logic_method		9_logic_method
homeworks		homeworks
lectures		lectures
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

About

Releases

Packages

Languages

datthan1576/ml-phystech

Folders and files

Latest commit

History

Repository files navigation

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages