Skip to content

Pull requests: nod-ai/sharktank

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Sort

Pull requests list

[shortfin] Add argmax host op.
#206 opened Sep 21, 2024 by stellaraccident Loading…
[punet] Add support to export fp8 punet variant
#202 opened Sep 20, 2024 by nithinsubbiah Loading…
Einsum kernel and test, WIP
#200 opened Sep 19, 2024 by KyleHerndon Loading…
Prefill tests for llama 2 7b and llama 3.1 8b
#199 opened Sep 19, 2024 by aviator19941 Loading…
[sharktank] Export Attention IRs for LLMs
#175 opened Sep 9, 2024 by archana-ramalingam Loading…
initial grok
#169 opened Sep 5, 2024 by dan-garvey Loading…
Quantizing manually
#118 opened Jul 25, 2024 by rohan-tan-bhowmik Draft
Quark dataset importer for fp8
#96 opened Jul 9, 2024 by dan-garvey Loading…
Enable Mixtral LLM model
#36 opened May 21, 2024 by archana-ramalingam Loading…
[llama] Enable flash attention path
#7 opened Apr 24, 2024 by rsuderman Loading…
ProTip! Adding no:label will show everything without a label.