Split pragmatics into presuppositions and scalar implicatures #2938

raileymontalan · 2024-08-16T09:13:38Z

No description provided.

raileymontalan · 2024-09-06T14:45:39Z

Hi @weiqipedia, for your info.

yifanmai

Looks good overall. Note that you have to change schema_bhasa.yaml to reflect changes (but that can be done in a separate pull request).

src/helm/benchmark/scenarios/bhasa_scenario.py

yifanmai · 2024-09-09T22:33:36Z

src/helm/benchmark/scenarios/bhasa_scenario.py

+                    instruction=instruction.format(row["choices_translated"]),
+                )
+                # Split "True or False" into ["True", "or", "False"]
+                choices = row["choices"].split()


Optional: For English, you can do row["choices"].split(" or ")

src/helm/benchmark/scenarios/bhasa_scenario.py

yifanmai · 2024-09-09T22:35:01Z

src/helm/benchmark/scenarios/bhasa_scenario.py

+                )
+                # Split "True or False" into ["True", "or", "False"]
+                choices = row["choices"].split()
+                choices_translated = row["choices_translated"].split()


Does this work consistently across every (supported) language?

That's a good question! For now we only have Indonesian (and Tamil), and this splitting and taking the first and third index of the list does work for both languages. But just FYI, this will not work for Thai because of the lack of spaces, and we'll have to use something more similar to your suggestion of " or " (but we will not be having Thai any time soon)

src/helm/benchmark/scenarios/bhasa_scenario.py

yifanmai · 2024-09-24T20:37:23Z

run_eval.sh

+
+export HF_HOME=/mnt/fs-arf-01/railey4/cache
+export HF_DATASETS_CACHE=/mnt/fs-arf-01/railey4/cache
+export HF_TOKEN=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


Careful with exposing secrets to the public. You should invalidate this token and avoid adding other tokens to the pull request.

yifanmai · 2024-09-24T20:38:46Z

run_eval.sh

If you'd like to add bash scripts to the git, could you:

put this in the scripts/bhasa or scripts/aisingapore folder and

add comments to the script that explains the purpose of the script?

src/helm/benchmark/run_specs/bhasa_run_specs.py

yifanmai · 2024-09-24T20:52:51Z

src/helm/benchmark/run_specs/bhasa_run_specs.py

@@ -606,14 +607,14 @@ def get_lindsea_pragmatics_pragmatic_reasoning_single_spec(language="id") -> Run
        scenario_spec=scenario_spec,
        adapter_spec=adapter_spec,
        metric_specs=get_exact_match_metric_specs(),
-        groups=["bhasa_linguistic", f"lindsea_pragmatics_pragmatic_reasoning_single_{language}"],
+        groups=["bhasa_linguistic", f"lindsea_pragmatics_presuppositions_{subset}_{language}"],


at least one of these strings has to match the group name in schema_bhasa.yaml, which is currently "lindsea_pragmatics_presuppositions_id". I'd suggest doing:

groups=["bhasa_linguistic", f"lindsea_pragmatics_presuppositions_{language}", f"lindsea_pragmatics_presuppositions_{subset}_{language}"],

yifanmai · 2024-09-24T20:57:47Z

src/helm/benchmark/scenarios/bhasa_scenario.py

+        if self.language not in self.prompts.keys():
+            raise (Exception(f"Unsupported language {self.language} - supported languages are {self.prompts.keys()}"))
+        else:
+            self.prompt_components = self.prompts[self.language]

    def download_dataset(self, output_path: str):
        BASE_URL = "https://raw.githubusercontent.com/aisingapore/BHASA/main/lindsea/"


Optional: You can pin this to a specific commit githash so that future changes to the git won't cause this scenario to change. e.g.

BASE_URL = "https://raw.githubusercontent.com/aisingapore/BHASA/10e34008e8142bef400cf8ffab15b2b6aaf3aa7f/lindsea/"

yifanmai · 2024-09-24T20:59:56Z

src/helm/benchmark/scenarios/bhasa_scenario.py

+        if self.language not in self.prompts.keys():
+            raise (Exception(f"Unsupported language {self.language} - supported languages are {self.prompts.keys()}"))
+        else:
+            self.prompt_componets = self.prompts[self.language]


prompt_componets is misspelled - it should be prompt_components

yifanmai · 2024-09-24T21:03:45Z

src/helm/benchmark/scenarios/bhasa_scenario.py

+                question = self.prompt_components["single_question"]
+                instruction = self.prompt_components["single_instruction"]
+
+                passage = "{question}\nPernyataan: {text}\n{instruction}".format(


Move Pernyataan into prompt components?

yifanmai · 2024-09-24T21:07:36Z

src/helm/benchmark/scenarios/bhasa_scenario.py

+                instruction = self.prompt_components["pair_instruction"]
+                label = self.prompt_components[str(row["label"])]
+
+                passage = "Situasi: {premise}\n{question}\nPernyataan: {conclusion}\n{instruction}".format(


Move Situasi into prompt components.

yifanmai · 2024-09-24T21:07:47Z

src/helm/benchmark/scenarios/bhasa_scenario.py

+                question = self.prompt_components["single_question"]
+                instruction = self.prompt_components["single_instruction"]
+
+                passage = "{question}\nPernyataan: {text}\n{instruction}".format(


Move Pernyataan into prompt components.

yifanmai · 2024-09-24T21:10:48Z

src/helm/benchmark/scenarios/bhasa_scenario.py

@@ -171,7 +171,7 @@ def __init__(self, language: str):
        super().__init__()
        self.language = language
        self.splits = {"train": TRAIN_SPLIT, "test": TEST_SPLIT}
-        self.map = {
+        self.prompts = {


Rename to self.language_to_prompt_components.

Same below.

raileymontalan added 4 commits July 1, 2024 02:54

Add LINDSEA pragmatics subset

999bdec

Split pragmatics

fdad90d

Split pragmatics into pressupositions and scalar implicatures

44b9a04

Update run entries for pragmatics

e903728

raileymontalan marked this pull request as draft August 16, 2024 09:14

raileymontalan added 9 commits August 17, 2024 02:48

Fix formatting

2e70dfe

Rerun unit tests

a27b9e8

Merge branch 'stanford-crfm:main' into lindsea_pragmatics_scenario_split

de472e6

Enforce input typing

14b31f5

Enforce input typing

4b14707

Remove line

8648ac5

Fix type checks

7ac4866

Fix line error

8e84dd2

Simplify dict

6bad749

raileymontalan marked this pull request as ready for review September 6, 2024 14:44

yifanmai reviewed Sep 9, 2024

View reviewed changes

raileymontalan added 4 commits September 12, 2024 03:45

Update BHASA schema, add exception for unsupported languages for LINDSEA

86ad100

Add file extension for downloaded files

b8020e3

Fix naming convention

13400af

Add error raising for unsupported langauges

3d88380

raileymontalan requested a review from yifanmai September 23, 2024 05:22

Add run_eval

0ab8fc3

yifanmai requested changes Sep 24, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Split pragmatics into presuppositions and scalar implicatures #2938

Split pragmatics into presuppositions and scalar implicatures #2938

raileymontalan commented Aug 16, 2024

raileymontalan commented Sep 6, 2024

yifanmai left a comment

yifanmai Sep 9, 2024

yifanmai Sep 9, 2024

weiqipedia Sep 10, 2024

yifanmai Sep 24, 2024

yifanmai Sep 24, 2024

yifanmai Sep 24, 2024

yifanmai Sep 24, 2024

yifanmai Sep 24, 2024

yifanmai Sep 24, 2024

yifanmai Sep 24, 2024

yifanmai Sep 24, 2024

yifanmai Sep 24, 2024

Split pragmatics into presuppositions and scalar implicatures #2938

Are you sure you want to change the base?

Split pragmatics into presuppositions and scalar implicatures #2938

Conversation

raileymontalan commented Aug 16, 2024

raileymontalan commented Sep 6, 2024

yifanmai left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment