OVRE: Open-Vocabulary Video Relation Extraction (AAAI24)

Moments-OVRE is a large-scale video relation dataset containing diverse 3-second videos from Multi-Moments in Time. Action in the video are densely annotated with corresponding relation triplets. We provide Moments-OVRE dataset and baselines in this repo.

Data Preparation

Download annotations to data/.

Raw videos can be download from http://moments.csail.mit.edu/. You can use extract_frames.py to extract frames(~758GB).

Usage

Requirement

git clone https://github.com/Iriya99/OVRE.git && cd OVRE
conda create -n ovre python=3.7
conda activate ovre
pip install -r requirements.txt

Training with fine-tuning of CLIP and GPT2

torchrun --nproc_per_node 8 main.py --epochs 50 --version patch

Testing

You can download checkpoint to checkpoints/.

torchrun --nproc_per_node 1 main.py --load_epoch 50 --bs 1 --mode test --version patch
python evaluation.py --result_path your_result_path

Citation

If you use this code for your research, please cite:

@misc{tian2023openvocabulary,
    title={Open-Vocabulary Video Relation Extraction},
    author={Wentao Tian and Zheng Wang and Yuqian Fu and Jingjing Chen and Lechao Cheng},
    year={2023},
    eprint={2312.15670},
    archivePrefix={arXiv},
    primaryClass={cs.CV}
}

Name		Name	Last commit message	Last commit date
Latest commit History 15 Commits
fig		fig
results		results
attention.py		attention.py
data.zip		data.zip
evaluation.py		evaluation.py
extract_frames.py		extract_frames.py
main.py		main.py
optim.py		optim.py
readme.md		readme.md
requirements.txt		requirements.txt
transforms.py		transforms.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

OVRE: Open-Vocabulary Video Relation Extraction (AAAI24)

Data Preparation

Usage

Requirement

Training with fine-tuning of CLIP and GPT2

Testing

Citation

About

Releases

Packages

Languages

Iriya99/OVRE

Folders and files

Latest commit

History

Repository files navigation

OVRE: Open-Vocabulary Video Relation Extraction (AAAI24)

Data Preparation

Usage

Requirement

Training with fine-tuning of CLIP and GPT2

Testing

Citation

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages