GitHub - JiachengZuo/VLM-C3L

VLM-C3L: Vision Language Model Driven Corner Case Datasets Optimization and Continuous Learning for Autonomous Driving

Installation

The codebases are built on top of Detectron2 and DETR.

Requirements

Linux or macOS with Python ≥ 3.6
PyTorch ≥ 1.5 and torchvision that matches the PyTorch installation. You can install them together at pytorch.org to make sure of this
OpenCV is optional and needed by demo and visualization

Steps

Install and build libs

cd SparseR-CNN
python setup.py build develop

Link coco dataset path to SparseR-CNN/datasets/coco

mkdir -p datasets/coco
ln -s /path_to_coco_dataset/annotations datasets/coco/annotations
ln -s /path_to_coco_dataset/train2017 datasets/coco/train2017
ln -s /path_to_coco_dataset/val2017 datasets/coco/val2017

Install CLIP

$ conda install --yes -c pytorch pytorch=1.7.1 torchvision cudatoolkit=11.0
$ pip install ftfy regex tqdm
mkdir un-LLM
cd un-LLM
$ pip install git+https://github.com/openai/CLIP.git

Extract Core Data

python un-LLM/core.py --num-gpus 1 \
    --config-file projects/SparseRCNN/configs/sparsercnn.res50.100pro.3x.yaml \
    --eval-only MODEL.WEIGHTS pth_path

Extract Corner Case Data

python un-LLM/CLIP/coda_classfiy_data_aug.py

Merge Corner Case and Core Data

python un-LLM/merge.py

Train SparseR-CNN

python projects/SparseRCNN/train_net.py --num-gpus 8 \
    --config-file projects/SparseRCNN/configs/sparsercnn.res50.100pro.3x.yaml

Evaluate SparseR-CNN

python projects/SparseRCNN/train_net.py --num-gpus 8 \
    --config-file projects/SparseRCNN/configs/sparsercnn.res50.100pro.3x.yaml \
    --eval-only MODEL.WEIGHTS path/to/model.pth

Visualize SparseR-CNN

python demo/demo.py\
    --config-file projects/SparseRCNN/configs/sparsercnn.res50.100pro.3x.yaml \
    --input path/to/images --output path/to/save_images --confidence-threshold 0.4 \
    --opts MODEL.WEIGHTS path/to/model.pth

Third-party resources

mmdetection implementation: sparse_rcnn. Thank Shilong Zhang!
cvpod implementation:sparse_rcnn. Thank Benjin Zhu!
paddledetection implementation:sparse_rcnn. Thank FL77N!

License

SparseR-CNN is released under MIT License.

Citing

If you use SparseR-CNN in your research or wish to refer to the baseline results published here, please use the following BibTeX entries:

@article{peize2020sparse,
  title   =  {{SparseR-CNN}: End-to-End Object Detection with Learnable Proposals},
  author  =  {Peize Sun and Rufeng Zhang and Yi Jiang and Tao Kong and Chenfeng Xu and Wei Zhan and Masayoshi Tomizuka and Lei Li and Zehuan Yuan and Changhu Wang and Ping Luo},
  journal =  {arXiv preprint arXiv:2011.12450},
  year    =  {2020}
}

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
configs		configs
datasets		datasets
demo		demo
demo_d2		demo_d2
detectron2		detectron2
dev		dev
docker		docker
docs		docs
projects		projects
readme		readme
tests		tests
tools		tools
un-LLM		un-LLM
GETTING_STARTED.md		GETTING_STARTED.md
INSTALL.md		INSTALL.md
LICENSE		LICENSE
MODEL_ZOO.md		MODEL_ZOO.md
README.md		README.md
README_d2.md		README_d2.md
setup.cfg		setup.cfg
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

VLM-C3L: Vision Language Model Driven Corner Case Datasets Optimization and Continuous Learning for Autonomous Driving

Installation

Requirements

Steps

Third-party resources

License

Citing

About

Releases

Packages

Languages

License

JiachengZuo/VLM-C3L

Folders and files

Latest commit

History

Repository files navigation

VLM-C3L: Vision Language Model Driven Corner Case Datasets Optimization and Continuous Learning for Autonomous Driving

Installation

Requirements

Steps

Third-party resources

License

Citing

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages