foundation-model-stack / fms-acceleration Public

Notifications You must be signed in to change notification settings
Fork 6
Star 3

Code
Issues 14
Pull requests 3
Actions
Projects
Security
Insights

Additional navigation options

Code
Issues
Pull requests
Actions
Projects
Security
Insights

Issues: foundation-model-stack/fms-acceleration

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

14 Open 13 Closed

Author

Filter by author

Label

Filter by label

Use alt + click/return to exclude labels

or ⇧ + click/return for logical OR

Projects

Filter by project

Milestones

Filter by milestone

Assignee

Filter by who’s assigned

Assigned to nobody

Sort

Sort by

Newest Oldest Most commented Least commented Recently updated Least recently updated Best match

Most reactions

Issues list

Slowdown and Higher Memory Consumption for GPTQ-LoRA with Bfloat16

#84 opened Sep 12, 2024 by achew010

Distributed Training Problems for QLoRA models with Transformers pre-release 4.45

#83 opened Sep 11, 2024 by achew010

Ensure Model is Correct Loaded Depending on Plugin For Augmentation Purposes

#77 opened Aug 29, 2024 by fabianlim

Introduce Liger Fused Cross Entropy Kernel to FOAK Plugin

#76 opened Aug 29, 2024 by achew010

When HF Memory Metrics Disabled Benchmark CSV Corrupted.

#75 opened Aug 27, 2024 by fabianlim

Inconsistency in Padding-Free Benchmarks with Different Transformers Versions

#70 opened Aug 19, 2024 by achew010

Potentially Dead Code in FOAK

#59 opened Jul 31, 2024 by fabianlim

Introduce a Better Dequantization Fix on Triton Function for FOAK Plugin's GPTQ Fused Operations

#52 opened Jul 15, 2024 by achew010

BNB Benchmark Experiments Run Out of Memory with Non-Zero Lora Dropout

#50 opened Jul 12, 2024 by achew010

Model Patcher To Work with Generics

#49 opened Jul 4, 2024 by fabianlim

Enable packaging CUDA wheels

#43 opened Jun 26, 2024 by fabianlim

Enable CUDA Unit Tests in GH Workflows

#39 opened Jun 24, 2024 by fabianlim

Support Position Ids in Rope

#33 opened Jun 6, 2024 by fabianlim

Release Upper Limit on Torch, Transformers, Accelerate dependency

Issue arises because of a dependency

#17 opened May 23, 2024 by fabianlim

ProTip! Type g i on any issue or pull request to go back to the issue listing page.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly