Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[feat] add disable_sliding_window parameter to vllm/lmi-dist engine args #2338

Merged

Conversation

hommayushi3
Copy link
Contributor

Description

Adds VLLM's disable_sliding_window parameter to vllm/lmi-dist engine args. This allows for using models like Phi-3-vision that specify a sliding window in their model config.json files to be used with Flash Attention 2 because vllm does not currently support flash attention with sliding window.

Addresses #2337.

@hommayushi3 hommayushi3 requested review from zachgk, frankfliu and a team as code owners August 23, 2024 13:23
@tosterberg tosterberg merged commit 32cafd2 into deepjavalibrary:master Aug 25, 2024
8 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants