T5 AdapterDrop Prefix-Tuning Bug #669

FahadEbrahim · 2024-04-07T16:59:23Z

Hi,

I'm trying to use train an AdapterDrop with T5. It's working fine for bottleneck, LoRA and IA3. But it does not work on Prefix Tuning (Therefore MAM and UniPELT).

Following is a notebook:
https://gist.github.com/FahadEbrahim/66686814f02978da9d4376470356647d

The error is:
RuntimeError: The size of tensor a (90) must match the size of tensor b (80) at non-singleton dimension 3.

TimoImhof · 2024-04-11T08:55:34Z

Hi @FahadEbrahim,

The bug happens only for Prefix Tuning and not the other adapter methods because Prefix Tuning changes the input dimensions when adding the prefixes. When now dropping adapter layers during training with AdapterDrop and Prefix Tuning, the individual transformer layers of T5 can have different dimensions. This leads to the runtime error because the positional encoding in T5 is always forwarded to the next layer, assuming the dimensions will never change.

Thanks for bringing this up! With the new PR, this problem should be solved; your script is running fine with the fix on my machine.

FahadEbrahim · 2024-04-12T13:51:54Z

@TimoImhof Thank you for your quick response and feedback. I tested the new PR branch and it's working perfectly.

With my appreciation,
Fahad.

Fixes #669 Changes in this PR: - Avoid throwing `RuntimeError` due to dimension mismatch occuring when passing the positional encoding from layers dropped by AdapterDrop to layers modified by prefix tuning.

FahadEbrahim added the bug Something isn't working label Apr 7, 2024

FahadEbrahim changed the title ~~T5 AdapterDrop Bugs~~ T5 AdapterDrop Prefix-Tuning Bug Apr 8, 2024

TimoImhof self-assigned this Apr 10, 2024

TimoImhof mentioned this issue Apr 11, 2024

Fix Training Error with AdapterDrop and Prefix Tuning #673

Merged

FahadEbrahim closed this as completed Apr 12, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

T5 AdapterDrop Prefix-Tuning Bug #669

T5 AdapterDrop Prefix-Tuning Bug #669

FahadEbrahim commented Apr 7, 2024 •

edited

Loading

TimoImhof commented Apr 11, 2024

FahadEbrahim commented Apr 12, 2024

T5 AdapterDrop Prefix-Tuning Bug #669

T5 AdapterDrop Prefix-Tuning Bug #669

Comments

FahadEbrahim commented Apr 7, 2024 • edited Loading

TimoImhof commented Apr 11, 2024

FahadEbrahim commented Apr 12, 2024

FahadEbrahim commented Apr 7, 2024 •

edited

Loading