NRMS_new for MIND News Recommendation

The source codes of the NRMS_new model for the MIcrosoft News Dataset(MIND) inspired by these papers:

NRMS--"Neural News Recommendation with Multi-Head Self-Attention" Chuhan Wu, Fangzhao Wu, Suyu Ge, Tao Qi, Yongfeng Huang,and Xing Xie (EMNLP 2019).
MIND--"MIND: A Large-scale Dataset for News Recommendation" Fangzhao Wu, Ying Qiao, Jiun-Hung Chen, Chuhan Wu, Tao Qi, Jianxun Lian, Danyang Liu, Xing Xie, Jianfeng Gao, Winnie Wu, Ming Zhou (ACL 2020)
LSTUR--"Neural News Recommendation with Long- and Short-term User Representations" Mingxiao An, Fangzhao Wu, Chuhan Wu, Kun Zhang, Zheng Liu, Xing Xie (ACL 2019)
NAML--"Neural News Recommendation with Attentive Multi-View Learning" Chuhan Wu, Fangzhao Wu, Mingxiao An, Jianqiang Huang, Yongfeng Huang, Xing Xie (IJCAI 2019)

Brief model descriptions:

The whole model consists of three modules: TextEncoder, NewsEncoder, and NRMS_new, which is a hierarchical self-attention and additive attention structure to embed users' reading histories, each news in their reading histories, and each part of news at the same time.

TextEncoder: same as the NewsEncoder in NRMS, a combindation of multi-head self-attention and additive attention mechanism to generate embeddings for a text, which can be title text, abstract text, title entities, and abstract entities. It also serves as a natural framework to encode the reading history news sequences of users on top of the embeded news to a single vector.
NewsEncoder: an additive attention that combines category embeddings, subcategory embeddings, title text embeddings, abstract text embeddings, title entity embeddings, and abstract entity embeddings and adds them up to a single vector for a piece of news.
NRMS_new: an additive attention that combines the embeddings of a user's reading histories and recent browsed news to a single vector and then performs dot product with the candidate news embeddings. Negative sampling is used in model training. For each news browsed by a user (regarded as a positive sample), randomly sample K news which are shown in the same impression but not clicked by the user (regarded as negative samples). Re-formulate the news click probability prediction problem as a pseudo (K + 1)-way classification task, and the lossfunction for model training is the negative log-likelihood of all positive samples.

Steps to run the codes:

Set up three folders in the same directory:
- 'MINDlarge_train' for training data
- 'MINDlarge_dev' for validation data
- 'glove' for the pretrained GloVe embeddings
Data preprocess:
- Generate the look-up table from the pretrained glove embeddings:
  - python glove/generate_glove_dict.py
- Preprocess users' reading histories and sequentialize them:
  - python data_preprocess/behavior_preprocess.py
- Preprocess features of news:
  - python data_preprocess/news_preprocess.py
- Generate test set:
  - python data_preprocess/behavior_preprocess_evaluation.py
  (the last three commands can be ran in parallel)
Model training:
1. Set up directories for data files and hyperparameters in src/utils.py.
2. Model training: python src/main.py
Generate ranking list of news for test set:
- python src/generate_recs.py

contact: Weijie Jiang, jiangwj[at]berkeley.edu

Name		Name	Last commit message	Last commit date
Latest commit History 21 Commits
data_preprocess		data_preprocess
glove		glove
src		src
README.md		README.md
model.png		model.png

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

NRMS_new for MIND News Recommendation

Brief model descriptions:

Steps to run the codes:

About

Releases

Packages

Languages

fabulosa/NRMS_new-for-MIND

Folders and files

Latest commit

History

Repository files navigation

NRMS_new for MIND News Recommendation

Brief model descriptions:

Steps to run the codes:

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages