A Pytorch Implementation of R-BERT relation classification model

This is an unofficial pytorch implementation of R-BERT model described paper Enriching Pre-trained Language Model with Entity Information for Relation Classification.

In addition to the SemEval 2010 dataset tested in the original paper, I aslo test implementation on the more recent TACRED dataset

Requirements:

Python version >= 3.6
Pytorch version >= 1.1
Transformer library version >= 2.5.1

Install

$ https://github.com/mickeystroller/R-BERT
$ cd R-BERT

Train

SemEval-2010

The SemEval-2010 dataset is already included in this repo and you can directly run:

CUDA_VISIBLE_DEVICES=0 python r_bert.py --config config.ini

TACRED

You need to first download TACRED dataset from LDC, which due to the license issue I cannot put in this repo. Then, you can directly run:

CUDA_VISIBLE_DEVICES=0 python r_bert.py --config config_tacred.ini

Eval

SemEval-2010

We use the official script for SemEval 2010 task-8

$ cd eval
$ bash test.sh
$ cat res.txt

TACRED

First, we generate prediction file tac_res.txt

$ python eval_tacred.py

You may change test file/model path in the eval_tacred.py file

Then, we use the official scoring script for TACRED dataset

$ python ./eval/score.py -gold_file <TACRED_DIR/data/gold/test.gold> -pred_file ./eval/tac_res.txt

Results

SemEval-2010

Below is the Macro-F1 score

Model	Original Paper	Ours
BERT-uncased-base	----	88.40
BERT-uncased-large	89.25	90.16

TACRED

Below is the evaluation result

Model	Precision (Micro)	Recall (Micro)	F1 (Micro)
BERT-uncased-base	72.99	62.50	67.34
BERT-cased-base	71.27	64.84	67.91
BERT-uncased-large	72.91	66.20	69.39
BERT-cased-large	70.86	65.96	68.32

Name	Name	Last commit message	Last commit date
Latest commit mickeysjm update link to paper-with-code Apr 20, 2020 4288cbe · Apr 20, 2020 History 8 Commits
data	data	update runable R-BERT model with new transformer library	Apr 17, 2020
eval	eval	Add support for TACRED dataset	Apr 20, 2020
.gitignore	.gitignore	Initial commit	Apr 17, 2020
LICENSE	LICENSE	Initial commit	Apr 17, 2020
README.md	README.md	update link to paper-with-code	Apr 20, 2020
config.ini	config.ini	update new SemEval dataset result	Apr 18, 2020
config.py	config.py	update runable R-BERT model with new transformer library	Apr 17, 2020
config_tacred.ini	config_tacred.ini	Add support for TACRED dataset	Apr 20, 2020
eval_tacred.py	eval_tacred.py	Add support for TACRED dataset	Apr 20, 2020
generate_tacred_tsv.py	generate_tacred_tsv.py	Add support for TACRED dataset	Apr 20, 2020
model.py	model.py	Add support for TACRED dataset	Apr 20, 2020
r_bert.py	r_bert.py	Add support for TACRED dataset	Apr 20, 2020
utils.py	utils.py	remove unnecessary code	Apr 20, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

A Pytorch Implementation of R-BERT relation classification model

Requirements:

Install

Train

SemEval-2010

TACRED

Eval

SemEval-2010

TACRED

Results

SemEval-2010

TACRED

Reference

About

Releases

Packages

Languages

License

mickeysjm/R-BERT

Folders and files

Latest commit

History

Repository files navigation

A Pytorch Implementation of R-BERT relation classification model

Requirements:

Install

Train

SemEval-2010

TACRED

Eval

SemEval-2010

TACRED

Results

SemEval-2010

TACRED

Reference

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages