`🍎`EVE: Everything in Video can be Segmented End-to-End.

🍎EVE(Everything in Video can be Segmented End-to-End) is a simple toy aimed at automatically or interactively segmenting anything in videos. It incorporates algorithms including SAM(Segment Anything) and XMem(Long-Term Video Object Segmentation with an Atkinson-Shiffrin Memory Model). The native SAM is used to encode and decode image information and produce single frame predictions, while XMem equips EVE with the ability to integrate temporal information. 🍎EVE can be trained end-to-end, allowing users to fine-tune it on their own datasets easily.

🚀Updates

[2023/05/18] We release 🍎EVE!

😄requirements

conda create -n EVE python=3.8 -y
conda activate EVE

# video_demo
pip install gradio

# XMem
conda install pytorch torchvision torchaudio pytorch-cuda=11.7 -c pytorch -c nvidia
pip install opencv-python
pip install -r requirements.txt

# SAM
# SAM we use is modified, 
# do not run 'pip install git+https://github.com/facebookresearch/segment-anything.git'
pip install pycocotools matplotlib onnxruntime onnx

# davis2017evaluation
cd davis2017evaluation
python setup.py install

🎏Getting Started

Video Tutorials

Segment everything in video.
Segment specific object in video.

Detailed Tutorails

Please refer to TOTARIALS.md for more details

⛵️Experiments on VOS

Datasets

If you want to retrain 🍎EVE on VOS-Datasets(e.g. DAVIS2017 or YouTubeVOS2019), you need to structure datasets(DV17, YV18, YV19) as follows.

DATASETS
├── DAVIS2017
│   ├── Annotations
│   │   └── 480p [150 entries]
│   ├── ImageSets
│   │   ├── 2016
│   │   │   ├── train.txt
│   │   │   └── val.txt
│   │   └── 2017
│   │       ├── test-challenge.txt
│   │       ├── test-dev.txt
│   │       ├── train.txt
│   │       └── val.txt
│   └── JPEGImages
|       └── 480p [150 entries]
└── YoutubeVOS2019
    ├── test
    │   ├── Annotations [541 entries]
    │   ├── JPEGImages [541 entries]
    │   └── meta.json
    ├── train
    │   ├── Annotations [3471 entries]
    │   ├── JPEGImages [3471 entries]
    │   └── meta.json
    └── valid
        ├── Annotations [507 entries]
        ├── JPEGImages [507 entries]
        └── meta.json

Retrain on VOS

Please refer to train_s2.sh for more details.

Inference on VOS


python sam_scripts/eval_EVE.py \
--eval_on_dv17 \
--model_type vit_h \
--output output/D17_val_EVE \
--model saves/EVE.pth

Performances on DAVIS2017-val

J&F-Mean	J-Mean	J-Recall	J-Decay	F-Mean	F-Recall	F-Decay
0.831953	0.805476	0.893966	0.071745	0.85843	0.929103	0.098502

🔧TODO List

enable 🍎EVE to delete object.
develop a function to utilize interactive masks or strokes to guide 🍎EVE.
enbale 🍎EVE to refine the masks in the intermediate process of inference.
create a local interactive_demo.
train 🍎EVE by the data engine proposed by SAM.

👏 Acknowledgements

This project is based on XMem and Segment-Anything. The design of demo_video is inspired by SAM-Track. Thanks for their outstanding work.

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
dataset		dataset
davis2017evaluation		davis2017evaluation
docs		docs
inference		inference
model		model
sam_scripts		sam_scripts
scripts		scripts
segment_anything		segment_anything
util		util
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
README_XMem.md		README_XMem.md
del_debug.sh		del_debug.sh
eval.py		eval.py
interactive_demo.py		interactive_demo.py
kill_python.sh		kill_python.sh
merge_multi_scale.py		merge_multi_scale.py
requirements.txt		requirements.txt
requirements_demo.txt		requirements_demo.txt
train.py		train.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

`🍎`EVE: Everything in Video can be Segmented End-to-End.

🚀Updates

😄requirements

🎏Getting Started

Video Tutorials

Detailed Tutorails

⛵️Experiments on VOS

Datasets

Retrain on VOS

Inference on VOS

🔧TODO List

👏 Acknowledgements

About

Releases

Packages

Languages

License

9p15p/EVE

Folders and files

Latest commit

History

Repository files navigation

🍎EVE: Everything in Video can be Segmented End-to-End.

🚀Updates

😄requirements

🎏Getting Started

Video Tutorials

Detailed Tutorails

⛵️Experiments on VOS

Datasets

Retrain on VOS

Inference on VOS

🔧TODO List

👏 Acknowledgements

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

`🍎`EVE: Everything in Video can be Segmented End-to-End.

Packages