-
Notifications
You must be signed in to change notification settings - Fork 266
Pull requests: huggingface/optimum-habana
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
Add batch splitting in attention layer for decode to hide NIC latency
#2334
opened Oct 31, 2025 by
jthakurH
Loading…
Temporarily remove broken examples for OH 1.20.0 release
#2329
opened Oct 29, 2025 by
gplutop7
Loading…
Backport transpose_for_scores function in modeling xlm roberta.
#2300
opened Oct 8, 2025 by
AKloniecki
Loading…
[llama] Store KV Cache on CPU and Use PyTorch
SPDA for Next token generation
#1182
opened Aug 2, 2024 by
zhentaoyu
Loading…
ProTip!
Filter pull requests by the default branch with base:main.