Fine-graned analysis using Transformers and Checklist

This repo uses Huggingface Transformers git repo as well as Checklist testing according to the ACL2020 paper. It uses the provided pipelines in transformers with extension to support batches. It also extends both transformers and checklist to work efficiently on a single pipeline, using data parallel for multiple GPUs and hidden layers caching.

The purpose of this package is to enable ablation study (for now) on NLP models on checklist tasks. The enviroment currently supports:

Models from transformers like:

'Bert', 'RoBerta', 'DistillBert'

and more...

Agregation on per layer, per timestep and per neuron with zeroing out nurons (and other methods)

Available checklist tasks include:

'Negation', 'Vocabulary', 'Taxonomy', 'Robustness', 'NER',  'Fairness', 'Temporal', 'Coref', 'SRL', 'Logic'

Installation

Install checklist from source

git clone [email protected]:marcotcr/checklist.git
cd checklist
pip install -e .

Clone this package and cd to it and install

git clone https://github.com/berryweinst/ChecklistTransformers.git
cd ChecklistTransformers
pip install -e .

Copy and extract the suits tar file from checklist into this package

cp -rf ../checklist/release_data.tar.gz ./checklist_transformers/
cd checklist_transformers
tar xvzf release_data.tar.gz
cd ../

Examples

For efficient multi-gpu run on a subset of neurons:

python main.py --device-ids 0 1 2 3 --batch-size 1996 --study-num-layer 6 --study-num-ts 16 --study-num-neurons 32

For all neurons omit the --study_* flags

The outputs of the main script are two heatmaps of the nurons. Specifically, the error rate on the checklist task and the confidence by score, when ablating each and one of the neurons by layer, timestep and neuron inside the hidden dimension. Example run Jupyter notebook on a subset of all neurons can be found in here.

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
.circleci		.circleci
.idea		.idea
checklist_transformers		checklist_transformers
docker		docker
docs		docs
examples		examples
model_cards		model_cards
notebooks		notebooks
src/transformers		src/transformers
templates		templates
tests		tests
utils		utils
.coveragerc		.coveragerc
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
MANIFEST.in		MANIFEST.in
Makefile		Makefile
README.md		README.md
codecov.yml		codecov.yml
deploy_multi_version_doc.sh		deploy_multi_version_doc.sh
hubconf.py		hubconf.py
setup.cfg		setup.cfg
setup.py		setup.py
valohai.yaml		valohai.yaml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Fine-graned analysis using Transformers and Checklist

Installation

Examples

About

Releases

Packages

Languages

License

berryweinst/ChecklistTransformers

Folders and files

Latest commit

History

Repository files navigation

Fine-graned analysis using Transformers and Checklist

Installation

Examples

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages