-
Notifications
You must be signed in to change notification settings - Fork 1k
Pull requests: huggingface/text-generation-inference
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
hotfix: ipex fails since cuda moe kernel is not supported
#2532
opened Sep 18, 2024 by
sywangyi
Loading…
CI for add gptq and awq int4 support in intel platform
#2494
opened Sep 5, 2024 by
ErikKaum
Loading…
fix: skip cuda graphs that will oom and improve free memory logging
#2450
opened Aug 22, 2024 by
drbh
Loading…
add gptq and awq int4 support in intel platform
#2444
opened Aug 22, 2024 by
sywangyi
Loading…
5 tasks
[TENSORRT-LLM] - Implement new looper thread based backend
#2357
opened Aug 2, 2024 by
mfuntowicz
•
Draft
ProTip!
Adding no:label will show everything without a label.