MultiLingual Quality Estimation (MLQE) Dataset

This repository contains data for the 2020 Quality Estimation Shared Task:
http://www.statmt.org/wmt20/quality-estimation-task.html

Training and development data

Check the 'data' folder

NMT models

Check the 'nmt-models' folder

Parallel data used to train the NMT models

Check 'http://www.statmt.org/wmt20/quality-estimation-task.html'

Citation

If you use this data in your work, please cite:

@article{tacl2020,
    title = {Unsupervised Quality Estimation for Neural Machine Translation},
    author = {Fomicheva, Marina and Sun, Shuo and Yankovskaya, Lisa and Blain, Frédéric and Guzmán, Francisco and Fishel, Mark and Aletras, Nikolaos and Chaudhary, Vishrav and Specia, Lucia},
    journal = {Transactions of the Association for Computational Linguistics},
    volume = {8},
    pages = {539-555},
    year = {2020}
}

Changelog

2020-03-15: Adding details about training data for NMT models
2020-03-19: Releasing dataset

License

The dataset is licensed under CC-BY-SA, see the LICENSE file for details.

Name		Name	Last commit message	Last commit date
Latest commit History 35 Commits
data-multi-hyp		data-multi-hyp
data		data
nmt_models		nmt_models
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

MultiLingual Quality Estimation (MLQE) Dataset

Training and development data

NMT models

Parallel data used to train the NMT models

German-English

Chinese-English

Romanian-English

Estonian-English

Sinhala-English

Nepali-English

Citation

Changelog

License

About

Releases

Packages

Contributors 3

License

facebookresearch/mlqe

Folders and files

Latest commit

History

Repository files navigation

MultiLingual Quality Estimation (MLQE) Dataset

Training and development data

NMT models

Parallel data used to train the NMT models

German-English

Chinese-English

Romanian-English

Estonian-English

Sinhala-English

Nepali-English

Citation

Changelog

License

About

Resources

License

Code of conduct

Security policy

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Packages