Skip to content

Pull requests: huggingface/text-generation-inference

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Sort

Pull requests list

We can have a tokenizer anywhere.
#2527 opened Sep 17, 2024 by Narsil Loading…
5 tasks
speculative decoding complete guide added
#2524 opened Sep 17, 2024 by shirinyamani Loading…
Nix integration tests
#2518 opened Sep 12, 2024 by Narsil Loading…
5 tasks
feat: support phi3.5 moe
#2479 opened Aug 30, 2024 by drbh Loading…
9 tasks done
Small fixes for supported models
#2471 opened Aug 29, 2024 by osanseviero Loading…
add gptq and awq int4 support in intel platform
#2444 opened Aug 22, 2024 by sywangyi Loading…
5 tasks
Fix Numba Cache Error
#2443 opened Aug 21, 2024 by tylertitsworth Loading…
1 of 5 tasks
Improve vlm support (add idefics3 support)
#2437 opened Aug 20, 2024 by drbh Draft
4 tasks
fix: repack for marlin when single scale is provided
#2414 opened Aug 13, 2024 by drbh Loading…
add docker load_tests
#2387 opened Aug 9, 2024 by ngxson Draft
feat: add release and sha tagged images
#2360 opened Aug 5, 2024 by drbh Loading…
Update ROCM libs and improvements
#2358 opened Aug 5, 2024 by mht-sharma Loading…
7 tasks done
Update vLLM dependency to 0.5.3.post1 Stale
#2317 opened Jul 26, 2024 by danieldk Draft
5 tasks
Add model_load_time metric
#2311 opened Jul 26, 2024 by Edwinhr716 Loading…
2 of 5 tasks
adding max_token_capacity metric
#2279 opened Jul 22, 2024 by Edwinhr716 Loading…
2 of 5 tasks
feat: Add load tests
#2217 opened Jul 11, 2024 by Hugoch Loading…
1 of 5 tasks
Add FP8 KVCache support
#2028 opened Jun 6, 2024 by mht-sharma Loading…
1 of 4 tasks
ProTip! Adding no:label will show everything without a label.