Custom tokenizer does not work with setup script #1695

forestoak777 · 2024-03-10T01:09:33Z

Setting a custom tokenizer in the llm section of the settings-local.yaml file did not change the value in the privateGPT/settings/settings python script, so the setup script ended up setting the tokenizer name to 'None', and so the setup script crashed. I did a temporary fix by just putting in the literal string right into the private_gpt/settings/settings.py LLMSettings class in the tokenizer field, where None was.

LLM model downloaded!
Downloading tokenizer None

...
(A BUNCH OF PYTHON ERROR STUFF)
...

Repository Not Found for url: https://huggingface.co/None/resolve/main/tokenizer_config.json.
Please make sure you specified the correct `repo_id` and `repo_type`.
If you are trying to access a private or gated repo, make sure you are authenticated.
Invalid username or password.

The text was updated successfully, but these errors were encountered:

forestoak777 · 2024-03-10T01:10:22Z

If you are looking for a temporary fix to this, see the last sentence in the issue above.

ItsCRC · 2024-03-10T06:28:33Z

I added tokenizer: mistralai/Mistral-7B-Instruct-v0.2 in settings.yaml under llm

imartinez mentioned this issue Mar 11, 2024

Set default tokenizer to avoid running make setup fail #1709

Merged

imartinez closed this as completed in #1709 Mar 13, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Custom tokenizer does not work with setup script #1695

Custom tokenizer does not work with setup script #1695

forestoak777 commented Mar 10, 2024

forestoak777 commented Mar 10, 2024 •

edited

Loading

ItsCRC commented Mar 10, 2024

Custom tokenizer does not work with setup script #1695

Custom tokenizer does not work with setup script #1695

Comments

forestoak777 commented Mar 10, 2024

forestoak777 commented Mar 10, 2024 • edited Loading

ItsCRC commented Mar 10, 2024

forestoak777 commented Mar 10, 2024 •

edited

Loading