customise number of experts in mixtral #1553

scienlabs · 2023-12-15T20:21:19Z

Could you someone provide guidance or documentation on how to adjust the number of experts in mixtral? I'm particularly interested in understanding if there's a way to dynamically adjust this number based on the requirements of different tasks or scenarios.

RafaAguilar · 2023-12-19T19:15:13Z

I'm not sure what Ollama uses, but for the llama.cpp backend you can override a Key in the model with:

--override-kv KEY=TYPE:VALUE
                        advanced option to override model metadata by key. may be specified multiple times.
                        types: int, float, bool. example: --override-kv tokenizer.ggml.add_bos_token=bool:false

For example I override them using:

--override-kv llama.expert_used_count=int:3

But I think this is not yet supported by MODELFILE.

scienlabs · 2023-12-27T17:14:51Z

how can i do it with ollama? wondering if anyone can help

PLK2 · 2024-05-08T15:53:39Z

Figured it out yet?

ColumbusAI · 2024-08-02T03:53:18Z

Any update on this?

scienlabs changed the title ~~cutomise number of experts in mixtral~~ customise number of experts in mixtral Dec 15, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

customise number of experts in mixtral #1553

customise number of experts in mixtral #1553

scienlabs commented Dec 15, 2023

RafaAguilar commented Dec 19, 2023

scienlabs commented Dec 27, 2023

PLK2 commented May 8, 2024

ColumbusAI commented Aug 2, 2024

customise number of experts in mixtral #1553

customise number of experts in mixtral #1553

Comments

scienlabs commented Dec 15, 2023

RafaAguilar commented Dec 19, 2023

scienlabs commented Dec 27, 2023

PLK2 commented May 8, 2024

ColumbusAI commented Aug 2, 2024