Mistral models output gibberish #6

TheLounger · 2023-12-05T03:49:25Z

Testing on oobabooga webui as being implemented here.
Llama-2 models (13B 2Bit/4Bit) work as expected.

Tested models:

Typical output:

tsengalb99 · 2023-12-05T04:03:38Z

I'm pretty sure this is because the webui repo uses the llama tokenizer, and mistral uses a different tokenizer. If you use the mistral tokenizer / AutoTokenizer you should get reasonable output. For example when running interactive_gen.py (our "chat" script) with 4 bit Mistral

Please enter your prompt or 'quit' (without quotes) to quit: Call me Ishmael
Setting `pad_token_id` to `eos_token_id`:2 for open-end generation.

Model Output:  Call me Ishmael.

I’m an avid reader of Moby Dick, a book that I read every year or so. It’s one of my favorite books, and the reason for that is simple: Ishmael is my alter ego.

In fact, I

tsengalb99 closed this as completed Dec 5, 2023

oobabooga mentioned this issue Dec 6, 2023

Add QuIP# support oobabooga/text-generation-webui#4803

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Mistral models output gibberish #6

Mistral models output gibberish #6

TheLounger commented Dec 5, 2023 •

edited

Loading

tsengalb99 commented Dec 5, 2023

Mistral models output gibberish #6

Mistral models output gibberish #6

Comments

TheLounger commented Dec 5, 2023 • edited Loading

tsengalb99 commented Dec 5, 2023

TheLounger commented Dec 5, 2023 •

edited

Loading