Skip to content

Issues: vllm-project/llm-compressor

FEATURE REQUESTS
#68 opened Aug 8, 2024 by robertgshaw2-neuralmagic
Open 1
MODEL REQUESTS
#69 opened Aug 8, 2024 by robertgshaw2-neuralmagic
Open 51
Q3 ROADMAP
#30 opened Jul 22, 2024 by robertgshaw2-neuralmagic
Open 4
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Assignee
Filter by who’s assigned
Sort

Issues list

Qwen1.5-MoE-A2.7B-Chat w4a16 Quantization Failed bug Something isn't working
#189 opened Sep 20, 2024 by donpromax
DeepseekV2-w8a8 weight needed in HF enhancement New feature or request
#175 opened Sep 13, 2024 by Eviannn
[USAGE] FP8 W8A8 (+KV) with LORA Adapters enhancement New feature or request
#164 opened Sep 11, 2024 by paulliwog
Error in the file 2:4_w4a16_group-128_recipe.yaml bug Something isn't working
#154 opened Sep 10, 2024 by carrot-o0o
KeyError with torch.float8_e4m3fn bug Something isn't working
#138 opened Sep 2, 2024 by Lue-C
[Performance]: SmoothQuant quantization is too slow
#112 opened Aug 26, 2024 by zxy1119
1 task done
convert model to FP8 error bug Something isn't working
#110 opened Aug 26, 2024 by kuangdao
SmoothQuant doesn't work with cpu offloading bug Something isn't working
#107 opened Aug 23, 2024 by anmarques
[Bug]: Index Error tuple out of range bug Something isn't working
#106 opened Aug 23, 2024 by SeanIsYoung
Layers not skipped with ignore=[ "re:.*"] bug Something isn't working
#91 opened Aug 15, 2024 by horheynm
lm_eval compatibility with generated model bug Something isn't working
#83 opened Aug 13, 2024 by horheynm
Llava model quantization seems not be supported bug Something isn't working
#73 opened Aug 10, 2024 by caojinpei
MODEL REQUESTS enhancement New feature or request
#69 opened Aug 8, 2024 by robertgshaw2-neuralmagic
FEATURE REQUESTS enhancement New feature or request
#68 opened Aug 8, 2024 by robertgshaw2-neuralmagic
Q3 ROADMAP roadmap Items planned to be worked on
#30 opened Jul 22, 2024 by robertgshaw2-neuralmagic
8 of 21 tasks
ProTip! Find all open issues with in progress development work with linked:pr.