NollySenti: Leveraging Transfer Learning and Machine Translation for Nigerian Movie Sentiment Classification
This repository contains the code for training movie review sentiment classification and the NollySenti data for Nigerian languages.
The code is based on HuggingFace implementation (License: Apache 2.0).
The license of the data is in CC-BY-4.0.
- python
- transformers : state-of-the-art Natural Language Processing for TensorFlow 2.0 and PyTorch.
- sklearn : for F1-score evaluation
- ptvsd : remote debugging server for Python support in Visual Studio and Visual Studio Code.
pip install transformers scikit-learn ptvsd
If you make use of this dataset, please cite us:
@misc{shode2023nollysenti,
title={NollySenti: Leveraging Transfer Learning and Machine Translation for Nigerian Movie Sentiment Classification},
author={Iyanuoluwa Shode and David Ifeoluwa Adelani and Jing Peng and Anna Feldman},
year={2023},
eprint={2305.10971},
archivePrefix={arXiv},
primaryClass={cs.CL}
}