Are Qwen1.5 MOE models supported? #6415

l3utterfly · 2024-04-01T02:21:34Z

I tried to convert the Qwen 1.5 MOE: https://huggingface.co/Qwen/Qwen1.5-MoE-A2.7B

It gives me an error message:

 python convert-hf-to-gguf.py /home/layla/src/text-generation-webui/models/Qwen1.5-MoE-A2.7B
Loading model: Qwen1.5-MoE-A2.7B
Traceback (most recent call last):
  File "/home/layla/src/llama.cpp/convert-hf-to-gguf.py", line 2296, in <module>
    main()
  File "/home/layla/src/llama.cpp/convert-hf-to-gguf.py", line 2276, in main
    model_class = Model.from_model_architecture(hparams["architectures"][0])
                  ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
  File "/home/layla/src/llama.cpp/convert-hf-to-gguf.py", line 215, in from_model_architecture
    raise NotImplementedError(f'Architecture {arch!r} not supported!') from None
NotImplementedError: Architecture 'Qwen2MoeForCausalLM' not supported!

I see others have uploaded GGUFs of the Qwen MOE models on HF, so it leads me to think I'm doing something wrong

The text was updated successfully, but these errors were encountered:

l3utterfly · 2024-04-01T02:23:49Z

I see this issue: huggingface/transformers#29377

Which adds support to Qwen2, do I need that PR to convert Qwen1.5 moe?

gswsqffsapd3056 · 2024-04-01T02:32:41Z

Same needs, waiting to be solved

Jeximo · 2024-04-01T03:05:18Z

I see there's only 2 people making Qwen MOE 1.5 GGUFs on Huggingface(not sure how). llama.cpp shows support for Qwen, but doesn't specify Qwen Moe(yet).

Here's a PR discussing converting/running it, though it appears there's more work to be done before it'll run.

Lyzin · 2024-04-03T03:19:37Z

I don't know when qwen1.5-MoE will support llama.cpp, I can only wait for qwen official update!

maziyarpanahi · 2024-04-03T08:02:37Z

Speaking of MoE and Qwen, #6453

github-actions · 2024-05-18T01:58:17Z

This issue was closed because it has been inactive for 14 days since being marked as stale.

l3utterfly added the bug-unconfirmed label Apr 1, 2024

l3utterfly closed this as completed Apr 1, 2024

l3utterfly reopened this Apr 1, 2024

l3utterfly changed the title ~~Are Qwen MOE models supported?~~ Are Qwen1.5 MOE models supported? Apr 1, 2024

github-actions bot added the stale label May 4, 2024

github-actions bot closed this as completed May 18, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Are Qwen1.5 MOE models supported? #6415

Are Qwen1.5 MOE models supported? #6415

l3utterfly commented Apr 1, 2024

l3utterfly commented Apr 1, 2024

gswsqffsapd3056 commented Apr 1, 2024

Jeximo commented Apr 1, 2024 •

edited

Loading

Lyzin commented Apr 3, 2024

maziyarpanahi commented Apr 3, 2024

github-actions bot commented May 18, 2024

Are Qwen1.5 MOE models supported? #6415

Are Qwen1.5 MOE models supported? #6415

Comments

l3utterfly commented Apr 1, 2024

l3utterfly commented Apr 1, 2024

gswsqffsapd3056 commented Apr 1, 2024

Jeximo commented Apr 1, 2024 • edited Loading

Lyzin commented Apr 3, 2024

maziyarpanahi commented Apr 3, 2024

github-actions bot commented May 18, 2024

Jeximo commented Apr 1, 2024 •

edited

Loading