KeyError: 'transformer.wte.weight' #4530

h9-tect · 2023-12-19T08:07:12Z

Hello I having this issue while converting the model

!python llama.cpp/convert.py jais-13b \
  --outfile jais-13b.gguf \
  --outtype q8_0

Loading model file jais-13b/pytorch_model-00001-of-00006.bin
Loading model file jais-13b/pytorch_model-00001-of-00006.bin
Loading model file jais-13b/pytorch_model-00002-of-00006.bin
Loading model file jais-13b/pytorch_model-00003-of-00006.bin
Loading model file jais-13b/pytorch_model-00004-of-00006.bin
Loading model file jais-13b/pytorch_model-00005-of-00006.bin
Loading model file jais-13b/pytorch_model-00006-of-00006.bin
Traceback (most recent call last):
  File "/content/llama.cpp/convert.py", line 1279, in <module>
    main()
  File "/content/llama.cpp/convert.py", line 1207, in main
    model_plus = load_some_model(args.model)
  File "/content/llama.cpp/convert.py", line 1142, in load_some_model
    model_plus = merge_multifile_models(models_plus)
  File "/content/llama.cpp/convert.py", line 635, in merge_multifile_models
    model = merge_sharded([mp.model for mp in models_plus])
  File "/content/llama.cpp/convert.py", line 614, in merge_sharded
    return {name: convert(name) for name in names}
  File "/content/llama.cpp/convert.py", line 614, in <dictcomp>
    return {name: convert(name) for name in names}
  File "/content/llama.cpp/convert.py", line 589, in convert
    lazy_tensors: list[LazyTensor] = [model[name] for model in models]
  File "/content/llama.cpp/convert.py", line 589, in <listcomp>
    lazy_tensors: list[LazyTensor] = [model[name] for model in models]
KeyError: 'transformer.wte.weight'

The text was updated successfully, but these errors were encountered:

dspasyuk · 2023-12-23T22:07:04Z

@h9-tect Did you figure it out?

h9-tect · 2023-12-25T14:18:45Z

@dspasyuk Not yet

jadechip · 2023-12-26T09:22:07Z

Have you tried using convert-hf-to-gguf.py?

h9-tect · 2023-12-28T07:03:59Z

@jadechip yeah, didn't work

LaniakeaS · 2023-12-29T07:39:39Z

got same problem on starcoder 15B.

dz28b · 2024-01-08T13:00:40Z

@h9-tect any ubdates ؟

h9-tect · 2024-01-10T08:12:21Z

Nah

gswsqffsapd3056 · 2024-01-18T02:56:43Z

Have you tried using convert-hf-to-gguf.py?

The same problem is encountered with lama.cpp/convert.py, but convert-hf-to-gguf.py works.
Model Qwen-72B-Chat.

LaniakeaS · 2024-01-18T09:08:08Z

interesting update... I've tried convert-hf-to-gguf.py to convert starchat-beta, got following result.

Traceback (most recent call last):
  File "/home/guest/**/llama.cpp/convert-hf-to-gguf.py", line 1173, in <module>
    model_instance.write()
  File "/home/guest/**/llama.cpp/convert-hf-to-gguf.py", line 136, in write
    self.write_tensors()
  File "/home/guest/**/llama.cpp/convert-hf-to-gguf.py", line 97, in write_tensors
    for name, data_torch in self.get_tensors():
  File "/home/guest/**/llama.cpp/convert-hf-to-gguf.py", line 62, in get_tensors
    ctx = contextlib.nullcontext(torch.load(str(self.dir_model / part_name), map_location="cpu", weights_only=True))
  File "/home/guest/miniconda3/envs/code_model/lib/python3.10/site-packages/torch/serialization.py", line 791, in load
    with _open_file_like(f, 'rb') as opened_file:
  File "/home/guest/miniconda3/envs/code_model/lib/python3.10/site-packages/torch/serialization.py", line 271, in _open_file_like
    return _open_file(name_or_buffer, mode)
  File "/home/guest/miniconda3/envs/code_model/lib/python3.10/site-packages/torch/serialization.py", line 252, in __init__
    super().__init__(open(name, mode))
FileNotFoundError: [Errno 2] No such file or directory: 'models/starchat-beta/pytorch_model-00001-of-00005.bin'

But the weight files contain 4 files instead of 5. Don't know where that came from...

added_tokens.json	handler.py				   pytorch_model-00004-of-00004.bin  trainer_state.json
all_results.json	merges.txt				   pytorch_model.bin.index.json      training_args.bin
config.json		model-00001-of-00004.safetensors.download  README.md			     train_results.json
dialogue_template.json	model_logo.png				   requirements.txt		     vocab.json
eval_results.json	pytorch_model-00001-of-00004.bin	   special_tokens_map.json
generation_config.json	pytorch_model-00002-of-00004.bin	   tokenizer_config.json
ggml-model-f16.gguf	pytorch_model-00003-of-00004.bin	   tokenizer.json

Btw, starcoder works fine under convert-hf-to-gguf.py

wanbo432503 · 2024-02-20T02:41:25Z

This is because you have another file with "bin" type: training_args.bin . The way the converf-hf-to-gguf.py to count the number of weight files is simply to count the number of "*.bin". Just change the suffix of traning_args.bin will solve the problem.

interesting update... I've tried convert-hf-to-gguf.py to convert starchat-beta, got following result.

Traceback (most recent call last):
  File "/home/guest/**/llama.cpp/convert-hf-to-gguf.py", line 1173, in <module>
    model_instance.write()
  File "/home/guest/**/llama.cpp/convert-hf-to-gguf.py", line 136, in write
    self.write_tensors()
  File "/home/guest/**/llama.cpp/convert-hf-to-gguf.py", line 97, in write_tensors
    for name, data_torch in self.get_tensors():
  File "/home/guest/**/llama.cpp/convert-hf-to-gguf.py", line 62, in get_tensors
    ctx = contextlib.nullcontext(torch.load(str(self.dir_model / part_name), map_location="cpu", weights_only=True))
  File "/home/guest/miniconda3/envs/code_model/lib/python3.10/site-packages/torch/serialization.py", line 791, in load
    with _open_file_like(f, 'rb') as opened_file:
  File "/home/guest/miniconda3/envs/code_model/lib/python3.10/site-packages/torch/serialization.py", line 271, in _open_file_like
    return _open_file(name_or_buffer, mode)
  File "/home/guest/miniconda3/envs/code_model/lib/python3.10/site-packages/torch/serialization.py", line 252, in __init__
    super().__init__(open(name, mode))
FileNotFoundError: [Errno 2] No such file or directory: 'models/starchat-beta/pytorch_model-00001-of-00005.bin'

But the weight files contain 4 files instead of 5. Don't know where that came from...

added_tokens.json	handler.py				   pytorch_model-00004-of-00004.bin  trainer_state.json
all_results.json	merges.txt				   pytorch_model.bin.index.json      training_args.bin
config.json		model-00001-of-00004.safetensors.download  README.md			     train_results.json
dialogue_template.json	model_logo.png				   requirements.txt		     vocab.json
eval_results.json	pytorch_model-00001-of-00004.bin	   special_tokens_map.json
generation_config.json	pytorch_model-00002-of-00004.bin	   tokenizer_config.json
ggml-model-f16.gguf	pytorch_model-00003-of-00004.bin	   tokenizer.json

Btw, starcoder works fine under convert-hf-to-gguf.py

LaniakeaS · 2024-02-20T03:03:36Z

ah, you are right. It has been solved. But it's still a little bit weird to just consider the suffix instead the whole file name, right? Does this mean this is a bug that needs to be fixed?

namehta4 · 2024-02-20T21:12:26Z

Hi,
I am encountering the same error as OP.
Changing conversion command to
python llama.cpp/convert-hf-to-gguf.py mpt-7b-storywriter --outfile mpt-7b-storywriter.gguf
results in the following error:

Traceback (most recent call last):
  File "/Users/namehta4/Documents/Laptop_Neil/Research/Consulting/ML_tutorial/LLM/llama.cpp/convert-hf-to-gguf.py", line 1876, in <module>
    main()
  File "/Users/namehta4/Documents/Laptop_Neil/Research/Consulting/ML_tutorial/LLM/llama.cpp/convert-hf-to-gguf.py", line 1863, in main
    model_instance.set_vocab()
  File "/Users/namehta4/Documents/Laptop_Neil/Research/Consulting/ML_tutorial/LLM/llama.cpp/convert-hf-to-gguf.py", line 63, in set_vocab
    self._set_vocab_gpt2()
  File "/Users/namehta4/Documents/Laptop_Neil/Research/Consulting/ML_tutorial/LLM/llama.cpp/convert-hf-to-gguf.py", line 304, in _set_vocab_gpt2
    if tokenizer.added_tokens_decoder[i].special:
       ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
AttributeError: 'GPTNeoXTokenizerFast' object has no attribute 'added_tokens_decoder'

Thank you!
Neil

github-actions · 2024-04-06T01:06:20Z

This issue was closed because it has been inactive for 14 days since being marked as stale.

lipingtang17 · 2024-05-03T07:19:09Z

Hi there. Is there any update for this issue? I am using JAIS model and meeting the same error.

LaniakeaS mentioned this issue Dec 29, 2023

Support starcoder family architectures (1B/3B/7B/13B) #3076

Closed

github-actions bot added the stale label Mar 22, 2024

github-actions bot closed this as completed Apr 6, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

KeyError: 'transformer.wte.weight' #4530

KeyError: 'transformer.wte.weight' #4530

h9-tect commented Dec 19, 2023

dspasyuk commented Dec 23, 2023

h9-tect commented Dec 25, 2023

jadechip commented Dec 26, 2023

h9-tect commented Dec 28, 2023

LaniakeaS commented Dec 29, 2023

dz28b commented Jan 8, 2024

h9-tect commented Jan 10, 2024

gswsqffsapd3056 commented Jan 18, 2024

LaniakeaS commented Jan 18, 2024

wanbo432503 commented Feb 20, 2024 •

edited

Loading

LaniakeaS commented Feb 20, 2024

namehta4 commented Feb 20, 2024

github-actions bot commented Apr 6, 2024

lipingtang17 commented May 3, 2024

KeyError: 'transformer.wte.weight' #4530

KeyError: 'transformer.wte.weight' #4530

Comments

h9-tect commented Dec 19, 2023

dspasyuk commented Dec 23, 2023

h9-tect commented Dec 25, 2023

jadechip commented Dec 26, 2023

h9-tect commented Dec 28, 2023

LaniakeaS commented Dec 29, 2023

dz28b commented Jan 8, 2024

h9-tect commented Jan 10, 2024

gswsqffsapd3056 commented Jan 18, 2024

LaniakeaS commented Jan 18, 2024

wanbo432503 commented Feb 20, 2024 • edited Loading

LaniakeaS commented Feb 20, 2024

namehta4 commented Feb 20, 2024

github-actions bot commented Apr 6, 2024

lipingtang17 commented May 3, 2024

wanbo432503 commented Feb 20, 2024 •

edited

Loading