Mitigating Unwanted Biases in Word Embeddings with Adversarial Learning using PyTorch

PyTorch implementation of "Mitigating Unwanted Biases in Word Embeddings with Adversarial Learning". Adapted from https://colab.research.google.com/notebooks/ml_fairness/adversarial_debiasing.ipynb, which is written with TensorFlow.

Large parts of the data processing code and documentation are copied directly from https://colab.research.google.com/notebooks/ml_fairness/adversarial_debiasing.ipynb. Both https://colab.research.google.com/notebooks/ml_fairness/adversarial_debiasing.ipynb and this repository implement an experiment from "Mitigating Unwanted Biases with Adversarial Learning". One way in which this code differs from the original implementation is that it uses two-means to compute the binary gender bias direction instead of PCA.

To run this code, simply execute python3 adversarial_bias_mitigation.py.

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
LICENSE		LICENSE
README.md		README.md
adversarial_bias_mitigation.py		adversarial_bias_mitigation.py
utils.py		utils.py

Provide feedback