Update

shintaro-ozaki · Jun 26, 2024 · b68f445 · b68f445
1 parent f3429d1
commit b68f445
Showing 1 changed file with 18 additions and 0 deletions.
diff --git a/README.md b/README.md
@@ -804,3 +804,21 @@ Turbo can generate questions with adequate general knowledge in both languages,
 albeit not as culturally 'deep' as humans. We also observe a higher occurrence
 of fluency errors in the Sundanese dataset, highlighting the discrepancy
 between medium- and lower-resource languages.
+<br>http://arxiv.org/abs/2402.17302v2
+Can LLM Generate Culturally Relevant Commonsense QA Data? Case Study in Indonesian and Sundanese
+Large Language Models (LLMs) are increasingly being used to generate
+synthetic data for training and evaluating models. However, it is unclear
+whether they can generate a good quality of question answering (QA) dataset
+that incorporates knowledge and cultural nuance embedded in a language,
+especially for low-resource languages. In this study, we investigate the
+effectiveness of using LLMs in generating culturally relevant commonsense QA
+datasets for Indonesian and Sundanese languages. To do so, we create datasets
+for these languages using various methods involving both LLMs and human
+annotators, resulting in ~4.5K questions per language (~9K in total), making
+our dataset the largest of its kind. Our experiments show that automatic data
+adaptation from an existing English dataset is less effective for Sundanese.
+Interestingly, using the direct generation method on the target language, GPT-4
+Turbo can generate questions with adequate general knowledge in both languages,
+albeit not as culturally 'deep' as humans. We also observe a higher occurrence
+of fluency errors in the Sundanese dataset, highlighting the discrepancy
+between medium- and lower-resource languages.