IMPORTANT: Please setup Hadoop (optional) for distributed file system management as the embedding files can get very big
Project Report Link - https://docs.google.com/document/d/1eKzupUwlNBVfo8zvZM7SBaJVt7C00i4ZdpvMbNbRuJ0/edit
- RoBERTa
- COMET - Commonsense Transformers for Automatic Knowledge Graph Construction
- node2vec
Code for generating the embeddings of RoBERTa, COMET and ndoe2vec are put under their respective markdown cell.
Ensure you run the MELD Dataset Loader, to get the data loaded in correctly.
To make things easier, we have compiled all the embeddings and loaded all the data in a pickle file which can be imported as a dataframe. This will save both time and heavy computation. If required to run on a different dataset, please create a custom data loader which will match the input format of the respective dataset.
- RoBERTa - 768
- COMET - 768
- node2vec - 256 (Customisable in the node2vec embedding generator.