-
Notifications
You must be signed in to change notification settings - Fork 35
Pull requests: nod-ai/shark-ai
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Bump IREE requirement pins to their latest versions.
#867
opened Jan 24, 2025 by
shark-pr-automator
bot
Loading…
[sharktank][llama] Adds parity to hf's rotary_embedding layer and a test to maintain it
#863
opened Jan 24, 2025 by
dan-garvey
Loading…
[Sharktank][Llama][FP8] Minimal changes for numerically correct fp8
#859
opened Jan 22, 2025 by
dan-garvey
•
Draft
[Llama] Do not allow configurable partitions for KVCache
#856
opened Jan 21, 2025 by
Groverkss
Loading…
Bump iree dependencies forward to include barrier changes
#834
opened Jan 16, 2025 by
rsuderman
Loading…
Fixed mixed precision contraction semantics for mmt_block_scaled_offset_q4_unsigned
#720
opened Dec 20, 2024 by
Groverkss
Loading…
Enable tokenizers in shortfin packages on Linux x86_64.
#688
opened Dec 12, 2024 by
ScottTodd
Loading…
Expanded sharded support for alternative sharding mechanisms
#680
opened Dec 12, 2024 by
rsuderman
Loading…
[shortfin] Implement async alloc/dealloc of buffers.
#507
opened Nov 14, 2024 by
stellaraccident
Loading…
Previous Next
ProTip!
Add no:assignee to see everything that’s not assigned.