Skip to content

[Performance] Avoid cuda sync in postprocess of LLM decoding #15079

[Performance] Avoid cuda sync in postprocess of LLM decoding

[Performance] Avoid cuda sync in postprocess of LLM decoding #15079

Annotations

1 warning

Test

succeeded Oct 28, 2024 in 31m 34s