C$^2$KD: Bridging the Modality Gap for Cross-Modal Knowledge Distillation

Code for paper "C$^2$KD: Bridging the Modality Gap for Cross-Modal Knowledge Distillation".

Usage

requirements.txt

Download Original Dataset： CREMA-D, AVE, VGGSound,

For AVE, CREMA-D and VGGSound dataset, we provide code to pre-process videos into RGB frames and audio wav files in the directory utils/data/.

Detailed descriptions of options can be found in main_overlap_tag.py

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
ave		ave
cramed		cramed
vggsound		vggsound
README.md		README.md