What is your favorite gender MLM?: Gender Bias Evaluation in Multilingual Masked Language Models

This GitHub page consists of the dataset and implementation of our paper "What is your favorite gender MLM?: Gender Bias Evaluation in Multilingual Masked Language Models." Our work distinguishes itself from other works through its unique features and characteristics such as:

Strengths

It provides a multi-lingual gender lexicon in English, German, Spanish, Portuguese, and Chinese.
It evaluates the gender bias of language models on any corpus in these five languages.
The evaluation corpus and the language model can be easily altered to assess gender bias.

Guideline

Multilingual Gender Lexicon

MGL in five languages, English, German, Spanish, Portuguese, and Chinese are within "eval_words" folder in the repository.
Encoded as a pickle file, each file is classified with respect to gender and language.
In generating the pairs of sentences for evaluating gender bias of language models, each file is required as input.

Lexicon_based and Model_based Sentence Extraction

Given this MGL from eval_words folder, lexicon_based and model_based sentence extraction is conducted through "extract.py" file.
Within this file, one can change the evaluation corpus by modifying the arguments of this Python file.
The required arguments to pass are the language of the corpus(model), the male gender lexicon, the female gender lexicon, and the corpus.
This file first tokenizes the corpus, extracts the sentences containing the gendered word, generates the sentences, and writes the sentences in pickle format.
One can also use Jupyter Notebook to make the sentence that is shown in "extraction_chn.ipynb" file.
An illustration of how this pipeline works is shown in the main function of "extract.py" file.

Multilingual Bias Evaluation Metrics

Using the sentences, Strict Bias Metrics that quantify gender bias of language models can be evaluated in "MBE_Calculation.ipynb" file.
With the size of our corpus being approximately 30,000 sentences for each language, our evaluation for each language took less than 10 minutes for each language.

Contact

Jeongrok Yu

Name		Name	Last commit message	Last commit date
Latest commit History 51 Commits
eval_words		eval_words
sentence		sentence
wordlist		wordlist
German -Final.csv		German -Final.csv
MBE_Calcuation.ipynb		MBE_Calcuation.ipynb
Portuguese - Final.csv		Portuguese - Final.csv
README.md		README.md
Spanish - Final.csv		Spanish - Final.csv
extract.py		extract.py
extraction_de.ipynb		extraction_de.ipynb
female_word_file.txt		female_word_file.txt
lexicon_validation.ipynb		lexicon_validation.ipynb
male_word_file.txt		male_word_file.txt
parallel_extract.py		parallel_extract.py
preprocess.py		preprocess.py
validate.py		validate.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

What is your favorite gender MLM?: Gender Bias Evaluation in Multilingual Masked Language Models

Guideline

Contact

About

Releases

Packages

Contributors 2

Languages

SeongUgKim/gender_bias_in_nlp

Folders and files

Latest commit

History

Repository files navigation

What is your favorite gender MLM?: Gender Bias Evaluation in Multilingual Masked Language Models

Guideline

Contact

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages