-
Notifications
You must be signed in to change notification settings - Fork 251
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Add tokenizer config path argument #2110
Add tokenizer config path argument #2110
Conversation
Thanks for this suggestion! This PR looks good to me. Note that we have another PR #1996 that removes the |
Oh, I have not looked at PR #1996, it seems much more natural. Also I found after submitting this PR that PR #1761 was working somehow by chance I think. The custom config passed to
Also the I was struggling with this and about to ask why this behavior is so strange, why are the flags Anyway, I will test PR #1996 , and see it it works for me. |
Hi @JosselinSomervilleRoberts , I think that PR #1996 approach is more simple, less verbose. Sadly, I was not able to execute the Anyways, PR #1996 will still suffer from the behavior that I commented earlier. If someone does the following: register_tokenizer_configs_from_path(some_path) # to register a custom config on `some_path` (for any reason).
register_helm_configurations() # to load the rest of the configuration files In this case the tokenizer config on |
@RawthiL I just merged #2142 which also addresses this issue. Could you try it out and see if it works for you? Otherwise, feel free to add the |
For the record, you'll need to run from main to get the fixes i.e. I'm planning on releasing v0.4.0 tomorrow, which will also have the fixes. |
@yifanmai I have installed the current main and it seems to be working perfectly! For anyone stepping into this, now you can pass the configuration path to the run command using the helm-run \
--conf-paths /home/helm/config/test_helm.conf \
--suite test-2 \
--local-path /home/helm/config where
Now (e3cf155) the behavior is as expected, the config is loaded only inside the main function and overridden by the one in the given path:
I will close this as the current method is better. |
The possibility of modifying two out of the tree config files was added in PR 1761, however the tokenizer configuration was left out (not sure why). I believe that we should enable the user to select a custom
tokenizer_configs.yaml
file by means of a new parameter:In my particular case this is required to enable the separation of the tokenizer endpoint from the model endpoint. By means of the PR, the user will be able to do the following:
1- Set a tokenizer and model deployment config:
2- Assign different urls to each endpoint:
(model_deployments.yaml)
(tokenizer_configs.yaml)
Note: The other way of changing the URL is through the
HELM_HTTP_MODEL_BASE_URL
environment variable, but this changes both the model and the tokenizer URL which not what I required.