Does this version of few-shot learning make sense? : LocalLLaMA #551

irthomasthomas · 2024-02-18T10:49:54Z

Does this version of few-shot learning make sense? : LocalLLaMA

TITLE: Does this version of few-shot learning make sense? : LocalLLaMA

DESCRIPTION: I know I'm supposed to give examples when teaching an LLM a new task. I've always seen it done in the first prompt: Help me perform sentiment analysis on some reviews. Here are a few examples: "This movie rocks!" - Positive "This movie sucks!" - Negative "The movie was meh" - Neutral My question is if there are cases where the examples work better if they are split as separate messages, alternating between roles: [{'role':'system', 'content':'Help me perform sentiment analysis on some reviews'}, {'role':'user', 'content':'This movie rocks!'}, {'role':'assistant', 'content':'Positive'}, {'role':'user', 'content':'This movie sucks!'}, {'role':'assistant', 'content':'Negative'}, {'role':'user', 'content':'This movie is meh'}, {'role':'assistant', 'content':'Neutral'}] Or are those the same thing? Or are there case where one is better than the other? 5 comments sharesavehidereport all 5 comments sorted by: best

Want to add to the discussion? Post a comment!

[–]phree_radical 3 points 21 hours ago The second one is few-shot, as it follows a pattern, and targets in-context learning which is something LLMs learn in pre-training. The first, on the other hand, relies on the instruct training, and assumes the fine-tuning taught it well how to fish examples out of instructions. It may leverage some of the in-context learning ability to some extent, while also working against it. When following a pattern like your second example, you will find you can remove the instructions. If you leave them in or change them, you can observe that the pattern-following overrides the instructions. This might seem unreasonable, but it irks me that OpenAI pushed the terminology to say "few-shot" when you provide examples in instructions, as it's not taking advantage of in-context task learning, it's relying on facets of the fine-tuned task permalinkembedsavereportreply

[–]Revolutionary_Ad6574[S] 1 point 21 hours ago Thank you for the detailed response, but could you give me the ELI5 version? permalinkembedsaveparentreportreply

[–]phree_radical 2 points 21 hours ago First one is following instructions, which might not be strong. The chatbot might not know how to follow your instruction with examples. I don't think people should call it few-shot. Second one follows a pattern, which is strong. So strong that there's really no point in putting instructions (like your system prompt) as the strong pattern-following will just overpower them permalinkembedsaveparentreportreply

[–]Revolutionary_Ad6574[S] 1 point 21 hours ago So whenever I have time to set it up it's always better to use the second one? permalinkembedsaveparentreportreply

[–]phree_radical 2 points 21 hours ago* I'm pretty sure, yeah. If you want an LLM to follow examples, use the second pattern An exception might be if there's something about the examples you don't want to follow. Like placeholder text or something permalinkembedsaveparentreportreply

URL: https://old.reddit.com/r/LocalLLaMA/comments/1at0zat/does_this_version_of_fewshot_learning_make_sense/

Suggested labels

{'label-name': 'instruction-based-learning', 'label-description': 'Discussion on teaching AI systems through following instructions and examples sequentially.', 'gh-repo': 'content-labels', 'confidence': 67.09}

ShellLM mentioned this issue Apr 12, 2024

Structured Prompting: Overcoming Length Limits in In-Context Learning #805

Open

1 task

ShellLM mentioned this issue May 15, 2024

Prompt engineering: dialog writing and disagreement #834

Open

1 task

ShellLM mentioned this issue Aug 20, 2024

[2202.12837] Rethinking the Role of Demonstrations: What Makes In-Context Learning Work? #899

Open

1 task

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Does this version of few-shot learning make sense? : LocalLLaMA #551

Does this version of few-shot learning make sense? : LocalLLaMA #551

irthomasthomas commented Feb 18, 2024

Does this version of few-shot learning make sense? : LocalLLaMA #551

Does this version of few-shot learning make sense? : LocalLLaMA #551

Comments

irthomasthomas commented Feb 18, 2024

TITLE: Does this version of few-shot learning make sense? : LocalLLaMA

Suggested labels

{'label-name': 'instruction-based-learning', 'label-description': 'Discussion on teaching AI systems through following instructions and examples sequentially.', 'gh-repo': 'content-labels', 'confidence': 67.09}