Support flash attention 2 with KV's sequence length longer than Q's #2033

BoxiangW · 2023-08-04T03:51:55Z

Implemented this situation with and without causal mask.
My implementation with causal mask looks like:
111000
111100
111110
111111
Where only the right upper triangle part will be masked.
I added P_SEQ for the notation of extra sequence length for KV.

BoxiangW · 2023-08-04T03:54:15Z

Link to issue: #2025

janEbert · 2023-08-09T08:49:35Z

Should this also be added to python/triton/ops/flash_attention.py?

BoxiangW · 2023-08-09T16:13:07Z

I will try to add it to python/triton/ops/flash_attention.py as well.

…r than Q's (triton-lang#2033) Implemented this situation with and without causal mask. My implementation with causal mask looks like: 111000 111100 111110 Where only the right upper triangle part will be masked. I added `P_SEQ` for the notation of extra sequence length for KV. Co-authored-by: Philippe Tillet <[email protected]>

Modified triton flash_attention_2 for KV seq_len > Q seq_len

f5adbca

BoxiangW requested a review from ptillet as a code owner August 4, 2023 03:51

Merge branch 'main' into main

ece90f8

ptillet approved these changes Aug 8, 2023

View reviewed changes

ptillet merged commit f21a053 into triton-lang:main Aug 8, 2023
3 of 4 checks passed

EPronovost mentioned this pull request Sep 8, 2023

[RFC] Goal for trition.ops.flash_attention #2267

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Support flash attention 2 with KV's sequence length longer than Q's #2033

Support flash attention 2 with KV's sequence length longer than Q's #2033

BoxiangW commented Aug 4, 2023 •

edited

Loading

BoxiangW commented Aug 4, 2023

janEbert commented Aug 9, 2023

BoxiangW commented Aug 9, 2023

Support flash attention 2 with KV's sequence length longer than Q's #2033

Support flash attention 2 with KV's sequence length longer than Q's #2033

Conversation

BoxiangW commented Aug 4, 2023 • edited Loading

BoxiangW commented Aug 4, 2023

janEbert commented Aug 9, 2023

BoxiangW commented Aug 9, 2023

BoxiangW commented Aug 4, 2023 •

edited

Loading