MusicVideoClipRetrival

This is a project for my Masters degree, for the Multimodal master degree course

In order to run run the application the following steps are necessary:

Include the data folder in the same directory (90 most popular songs in Spotify
Install the libraries in the requirements.txt
To set up the image data for the Resnet-50 model run the frames_set_up.ipynb to create the frames for each music video clip.
To set up the text data for the Bert-uncased model run the lyrics_set_up.ipynb to create both text files that include the lyrics and csv files that include vital information for the content of the music video clips.
Run the resnet_model.ipynb, that will create the trained torch model.
Run the bert_base_model to get the results of the textual model.

Name	Name	Last commit message	Last commit date
Latest commit StamatisOrfanos Add presentation Jul 3, 2023 1021351 · Jul 3, 2023 History 31 Commits
LICENSE	LICENSE	Initial commit	Jun 28, 2023
Music Video Clips Retrieval based on similarity.pdf	Music Video Clips Retrieval based on similarity.pdf	Add presentation	Jul 3, 2023
README.md	README.md	Small fix README.md	Jun 29, 2023
Stamatios_Orfanos_mtn2211.pdf	Stamatios_Orfanos_mtn2211.pdf	Add report	Jul 2, 2023
bert_base_model.ipynb	bert_base_model.ipynb	Add model files	Jul 2, 2023
demo.ipynb	demo.ipynb	Add demo files	Jul 2, 2023
demo_image_audio_text.py	demo_image_audio_text.py	Add demo files	Jul 2, 2023
frames_set_up.ipynb	frames_set_up.ipynb	Fix paths for project set-up files	Jun 30, 2023
lyrics_set_up.ipynb	lyrics_set_up.ipynb	Fix paths for project set-up files	Jun 30, 2023
resnet-50_model.ipynb	resnet-50_model.ipynb	Add model files	Jul 2, 2023

Provide feedback