-
Notifications
You must be signed in to change notification settings - Fork 202
Pull requests: flashinfer-ai/flashinfer
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
fix: Pass backend in BatchPrefillWith*KVCacheWrapper.plan()
#808
opened Feb 12, 2025 by
sfc-gh-yewang
Loading…
feat: support MLA decode, implemented by CuTe targeted to SM80
#766
opened Jan 31, 2025 by
tsu-bin
Loading…
Improve error message when TORCH_CUDA_ARCH_LIST has many supported architechtures.
#686
opened Dec 19, 2024 by
pavanimajety
Loading…
ProTip!
Add no:assignee to see everything that’s not assigned.