-
Notifications
You must be signed in to change notification settings - Fork 10.8k
Issues: ggml-org/llama.cpp
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
Eval bug: CANNOT LINK EXECUTABLE "./llama-cli": library "libomp.so" not found: needed by main executable
bug-unconfirmed
#11979
opened Feb 20, 2025 by
Krallbe68
GGML to GGUF Quantized tensor bytes per row (5120) is not a multiple of Q2_K type size (84)
#11976
opened Feb 20, 2025 by
chokoon123
tensor 'blk.25.ffn_down.weight' has invalid ggml type 42 (NONE)
bug-unconfirmed
#11975
opened Feb 20, 2025 by
evaninf
Misc. bug: Sporadic MUL_MAT Failures in test-backend-ops for Nvidia backend
bug-unconfirmed
#11972
opened Feb 20, 2025 by
ShanoToni
Misc. bug: The KV cache is sometimes truncated incorrectly when making v1/chat/completions API calls
bug-unconfirmed
#11970
opened Feb 20, 2025 by
vnicolici
Maybe it would better to have a diagram to show how llama.cpp process inferences
#11967
opened Feb 20, 2025 by
yinuu
Eval bug: Ram boom after using llama-bench with cuda12.8 and deepseekr1q6
bug-unconfirmed
#11965
opened Feb 20, 2025 by
Xxianna
Misc. bug: Rpc-server does not use opencl backend on Android.
bug-unconfirmed
#11957
opened Feb 19, 2025 by
belog2867
Misc. bug: Segmentation fault when importing model to opencl buffer
bug-unconfirmed
#11953
opened Feb 19, 2025 by
zhouzengming
Eval bug: llama.cpp Incorrectly Parses and Reports sprintf Calls in C++ Code
bug-unconfirmed
#11951
opened Feb 19, 2025 by
perdubug
Misc. bug: hipGraph causes a crash in hipGraphDestroy
AMD GPU
Issues specific to AMD GPUs
#11949
opened Feb 18, 2025 by
IMbackK
Eval bug: Segmentation fault with Docker ROCm image "full-rocm"
bug-unconfirmed
#11947
opened Feb 18, 2025 by
JFingerle
Add option to build CUDA backend without Flash attention
enhancement
New feature or request
#11946
opened Feb 18, 2025 by
slaren
Feature Request: 推理minicpmv时,encoding_image_with_clip耗时很久
enhancement
New feature or request
#11941
opened Feb 18, 2025 by
EnzhiZhou
4 tasks done
Enhancement: Improve ROCm performance on various quants (benchmarks included)
enhancement
New feature or request
#11931
opened Feb 17, 2025 by
cb88
4 tasks done
Misc. bug: RPC attempt fails with a specific error, but I cannot find any info on troubleshooting it
bug-unconfirmed
#11929
opened Feb 17, 2025 by
maglore9900
Eval bug: --api-key is invalid when i set a string contains %
bug-unconfirmed
#11928
opened Feb 17, 2025 by
zhangtao103239
Previous Next
ProTip!
What’s not been updated in a month: updated:<2025-01-20.