Supporting LmHead and Embedding Layers for Adapters #231

magdyksaleh · 2024-02-08T20:59:32Z

System Info

Doesn't work if you make changes to the vocab

Information

Docker
The CLI directly

Tasks

An officially supported command
My own modifications

Reproduction

to come

Expected behavior

to come

tgaddair · 2024-02-17T00:43:47Z

Context: https://stackoverflow.com/questions/72775559/resize-token-embeddings-on-the-a-pertrained-model-with-different-embedding-size

arnavgarg1 · 2024-02-20T18:58:58Z

Here's a code block also demonstrating what might be needed:

>>> from transformers import AutoTokenizer, AutoModelForCausalLM
>>> model = AutoModelForCausalLM.from_pretrained("yujiepan/llama-2-tiny-random")
>>> tokenizer = AutoTokenizer.from_pretrained("yujiepan/llama-2-tiny-random")
>>> model.get_input_embeddings()
Embedding(32000, 8, padding_idx=0)
>>> len(tokenizer.vocab)
32000
>>> tokenizer.add_tokens(['|INST|'])
1
>>> len(tokenizer.vocab)
32001
>>> model.resize_token_embeddings(len(tokenizer.vocab))
Embedding(32001, 8)
>>> model.get_input_embeddings().padding_idx = 0 # Save before and set again after resizing
>>> model.get_input_embeddings()
Embedding(32001, 8, padding_idx=0)

tgaddair added the enhancement New feature or request label Feb 8, 2024

tgaddair mentioned this issue Mar 4, 2024

mixtral adapters returning broadcast shape error #301

Open

2 tasks

tgaddair assigned geoffreyangus Mar 4, 2024

tgaddair mentioned this issue Mar 17, 2024

Project Roadmap #57

Open

36 tasks

tgaddair assigned ajtejankar May 7, 2024

ajtejankar linked a pull request Jun 8, 2024 that will close this issue

(WIP) Support targeting the embedding layer for LoRA #501

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Supporting LmHead and Embedding Layers for Adapters #231

Supporting LmHead and Embedding Layers for Adapters #231

magdyksaleh commented Feb 8, 2024

tgaddair commented Feb 17, 2024

arnavgarg1 commented Feb 20, 2024

Supporting LmHead and Embedding Layers for Adapters #231

Supporting LmHead and Embedding Layers for Adapters #231

Comments

magdyksaleh commented Feb 8, 2024

System Info

Information

Tasks

Reproduction

Expected behavior

tgaddair commented Feb 17, 2024

arnavgarg1 commented Feb 20, 2024