Cross-lingual Editing in Large Language Models

Setup

Environment

This codebase uses Python 3.7.9. Other versions may work as well.

Create a virtualenv (pyenv can help with this) and install the dependencies:

$ python -m venv env
$ source env/bin/activate
(env) $ pip install -r requirements.txt

Data

You can download the data needed for this project from this Google Drive link. Download the dataset and change the path in the run.py with the path of the dataset where it is downloaded.

Citation

@inproceedings{beniwal-etal-2024-cross,
    title = "Cross-lingual Editing in Multilingual Language Models",
    author = "Beniwal, Himanshu  and
      D, Kowsik  and
      Singh, Mayank",
    editor = "Graham, Yvette  and
      Purver, Matthew",
    booktitle = "Findings of the Association for Computational Linguistics: EACL 2024",
    month = mar,
    year = "2024",
    address = "St. Julian{'}s, Malta",
    publisher = "Association for Computational Linguistics",
    url = "https://aclanthology.org/2024.findings-eacl.140",
    pages = "2078--2128",
    abstract = "The training of large language models (LLMs) necessitates substantial data and computational resources, and updating outdated LLMs entails significant efforts and resources. While numerous model editing techniques (METs) have emerged to efficiently update model outputs without retraining, their effectiveness in multilingual LLMs, where knowledge is stored in diverse languages, remains an underexplored research area. This research paper introduces the cross-lingual model editing (XME) paradigm, wherein a fact is edited in one language, and the subsequent update propagation is observed across other languages. To investigate the XME paradigm, we conducted experiments using BLOOM, mBERT, and XLM-RoBERTa using the two writing scripts: Latin (English, French, and Spanish) and Indic (Hindi, Gujarati, and Bengali). The results reveal notable performance limitations of state-of-the-art METs under the XME setting, mainly when the languages involved belong to two distinct script families. These findings highlight the need for further research and development of XME techniques to address these challenges. For more comprehensive information, the dataset used in this research and the associated code are publicly available at the following [URL](https://github.com/lingo-iitgn/XME).",
}

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
algs		algs
config		config
data_classes		data_classes
es_loc_out_plks		es_loc_out_plks
heatmap-extended		heatmap-extended
heatmap_to_latex-paper		heatmap_to_latex-paper
run_our_experiments		run_our_experiments
.env		.env
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
automate.py		automate.py
blk_to_ bin.py		blk_to_ bin.py
changes.txt		changes.txt
command_generator.py		command_generator.py
commands-efk.txt		commands-efk.txt
commands-ft.txt		commands-ft.txt
commands-mend.txt		commands-mend.txt
commands-mend_singleEnglish.txt		commands-mend_singleEnglish.txt
del_outputs.py		del_outputs.py
edit_example.py		edit_example.py
editable_model.py		editable_model.py
excelify.py		excelify.py
ext.txt		ext.txt
extract.csv		extract.csv
finetuning_mned_mbert.txt		finetuning_mned_mbert.txt
finetuning_mned_mbert_mid_layers.txt		finetuning_mned_mbert_mid_layers.txt
ft_mend_bloom_last_layers.txt		ft_mend_bloom_last_layers.txt
ft_mend_bloom_mid_layers.txt		ft_mend_bloom_mid_layers.txt
get_model_name.py		get_model_name.py
get_model_name_ft.py		get_model_name_ft.py
get_model_name_ft_bngu.py		get_model_name_ft_bngu.py
get_model_name_mend.py		get_model_name_mend.py
gpt-medium-full.txt		gpt-medium-full.txt
hooks.py		hooks.py
index_list.csv		index_list.csv
index_list.md		index_list.md
index_list.xlsx		index_list.xlsx
losses.py		losses.py
ment-bert-summary.txt		ment-bert-summary.txt
models.py		models.py
nn.py		nn.py
oracle.py		oracle.py
python_verseion.txt		python_verseion.txt
requirements.txt		requirements.txt
requirements_efk.txt		requirements_efk.txt
requirements_env.txt		requirements_env.txt
requirements_updated.txt		requirements_updated.txt
run.py		run.py
temp.txt		temp.txt
test.py		test.py
test.txt		test.txt
tester.py		tester.py
trainer.py		trainer.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Cross-lingual Editing in Large Language Models

Setup

Environment

Data

Citation

About

Releases

Packages

Languages

License

lingo-iitgn/XME

Folders and files

Latest commit

History

Repository files navigation

Cross-lingual Editing in Large Language Models

Setup

Environment

Data

Citation

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages