Few Shot Learning Image Classification (Intro 2 DL 11-785 @ CMU)

We proposed a Weighted-distribution Calibration (WC) to alleviate bias of distribution of novel classes by generating more data from a calibrated distribution, which takes advantage of transferred statistics of all base classes.
With a backbone of ResNet-12 and a logistic regression classifier, WC successfully improves the model's performance from a base accuracy of 57.33% to a surprising accuracy of 63.87%.
We experimented on Vision Transformer (ViT) based backbone feature extractor. We expected ViT to pay more attention to target objects, thus decreasing the redundancy in extracted features. Unfortunately, it did not work well in our experiments, possibly, due to its loss of reception field.

The implementations of backbone networks are adapted from here.

Note due to the size limit, the pretrained model, miniImageNet dataset are not included but they can be downloaded from the following links:

miniImageNet Dataset

File Structure

The config folder contains all .yaml files to configure models, loggers, and miscellaneous parameters.
The core folder contains all the code for initializing models, training, and testing them. In particular train.py and test.py are used for training and testing the ResNet-12 backbone. ViTtrainer.py and ViTtest.py are used for training and testing the ViT backbone.

Steps to run code

Download miniImageNet Dataset and extract the dataset.
Move dataset to a designated location, and match data_root in /config/headers/data.yaml to this location.
install all dependencies in requirements.txt

Run the Pretrained ResNet 12 Baseline

Download the Pretrained ResNet 12 Backbone
Modify PATH variable in run_test.py to point to the downloaded backbone.
python run_test.py

Train the Backbone

Modify configuration in corresponding [backbone_name].yaml file
Start training: python run_trainer.py

Run the Test with Weighted-distribution Calibration (WC)

Modify test configuration in config.yaml file at the root path
Start testing: python run_test.py

Experiments on Vision Transformer-based Feature Extractor

Train the Vision Transformer

Modify configuration in vitconfig.yaml
Start training: python run_vit_trainer.py

Run the Vision Transformer Test with Weighted-distribution Calibration (WC)

Modify test configuration in vitconfig.yaml file at the root path
Start testing: python run_vit_test.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

README.md

Few Shot Learning Image Classification (Intro 2 DL 11-785 @ CMU)

File Structure

Steps to run code

Run the Pretrained ResNet 12 Baseline

Train the Backbone

Run the Test with Weighted-distribution Calibration (WC)

Experiments on Vision Transformer-based Feature Extractor

Train the Vision Transformer

Run the Vision Transformer Test with Weighted-distribution Calibration (WC)

Files

README.md

Latest commit

History

README.md

File metadata and controls

Few Shot Learning Image Classification (Intro 2 DL 11-785 @ CMU)

File Structure

Steps to run code

Run the Pretrained ResNet 12 Baseline

Train the Backbone

Run the Test with Weighted-distribution Calibration (WC)

Experiments on Vision Transformer-based Feature Extractor

Train the Vision Transformer

Run the Vision Transformer Test with Weighted-distribution Calibration (WC)