Skip to content

Issues: ggerganov/llama.cpp

changelog : libllama API
#9289 opened Sep 3, 2024 by ggerganov
Open 1
changelog : llama-server REST API
#9291 opened Sep 3, 2024 by ggerganov
Open 7
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Assignee
Filter by who’s assigned
Sort

Issues list

fatal error: 'hip/hip_fp16.h' file not found when building using CMake and ROCm 6.2 bug-unconfirmed medium severity Used to report medium severity bugs in llama.cpp (e.g. Malfunctioning Features but still useable)
#10236 opened Nov 9, 2024 by lubosz
Bug: server GET /props request return json with chat_template with last char replaced by \x00 bug-unconfirmed high severity Used to report high severity bugs in llama.cpp (Malfunctioning hinder important workflow)
#10235 opened Nov 9, 2024 by kks-imt
Bug: CUBLAS_STATUS_INTERNAL_ERROR when using --gpu-layers on ROCm 6.2 bug-unconfirmed high severity Used to report high severity bugs in llama.cpp (Malfunctioning hinder important workflow)
#10234 opened Nov 9, 2024 by lubosz
Bug: Server Slows Down Significantly Over Time, Requires Frequent Reboots (RX 7900 XT) bug-unconfirmed high severity Used to report high severity bugs in llama.cpp (Malfunctioning hinder important workflow)
#10227 opened Nov 9, 2024 by tigert2173
Bug: image encoding error with malloc memory bug-unconfirmed low severity Used to report low severity bugs in llama.cpp (e.g. cosmetic issues, non critical UI glitches)
#10225 opened Nov 9, 2024 by dingtine
bge-multilingual-gemma2:ERROR:hf-to-gguf:Model Gemma2Model is not supported bug-unconfirmed low severity Used to report low severity bugs in llama.cpp (e.g. cosmetic issues, non critical UI glitches)
#10215 opened Nov 8, 2024 by hellozjj
Bug: not support langchain v0.3 to use tools bug-unconfirmed high severity Used to report high severity bugs in llama.cpp (Malfunctioning hinder important workflow)
#10214 opened Nov 8, 2024 by lee249876293
Feature Request: Support Airllm enhancement New feature or request
#10202 opened Nov 7, 2024 by kbocock-krg
4 tasks done
Bug: DLLAMA_VULKAN=1 tag is not linking vulkan bug-unconfirmed high severity Used to report high severity bugs in llama.cpp (Malfunctioning hinder important workflow)
#10201 opened Nov 7, 2024 by andrewson97
Bug: Nondeterministic results on AMD RDNA3 (ROCm) despite zero temperature and fixed seed bug-unconfirmed medium severity Used to report medium severity bugs in llama.cpp (e.g. Malfunctioning Features but still useable)
#10197 opened Nov 6, 2024 by Googulator
Bug: SYCL crash bug-unconfirmed critical severity Used to report critical severity bugs in llama.cpp (e.g. Crashing, Corrupted, Dataloss)
#10184 opened Nov 5, 2024 by 0xDEADFED5
Feature Request: Support BitNet.cpp quantization format enhancement New feature or request
#10179 opened Nov 5, 2024 by luionTW
Bug: Failed to convert OuteAI/OuteTTS-0.1-350M bug-unconfirmed medium severity Used to report medium severity bugs in llama.cpp (e.g. Malfunctioning Features but still useable)
#10178 opened Nov 5, 2024 by apepkuss
Bug: Speculative Decoding "Segmentation fault (core dumped)" bug Something isn't working low severity Used to report low severity bugs in llama.cpp (e.g. cosmetic issues, non critical UI glitches)
#10176 opened Nov 4, 2024 by AbdullahMPrograms
tts : add basic example for text-to-speech good first issue Good for newcomers tts Text-to-speech
#10173 opened Nov 4, 2024 by ggerganov
Bug: CANN E89999 Ascend NPU issues specific to Ascend NPUs bug-unconfirmed low severity Used to report low severity bugs in llama.cpp (e.g. cosmetic issues, non critical UI glitches)
#10161 opened Nov 4, 2024 by ninth99
Feature Request: [CANN] backend supports Ascend 310P Ascend NPU issues specific to Ascend NPUs enhancement New feature or request
#10160 opened Nov 4, 2024 by leo-pony
4 tasks done
Bug: GGML_ASSERT(i01 >= 0 && i01 < ne01) failed bug-unconfirmed medium severity Used to report medium severity bugs in llama.cpp (e.g. Malfunctioning Features but still useable)
#10157 opened Nov 4, 2024 by ccreutzi
Bug: --log-disable also disables output from the model bug-unconfirmed low severity Used to report low severity bugs in llama.cpp (e.g. cosmetic issues, non critical UI glitches)
#10155 opened Nov 4, 2024 by sendcat
Bug: gguf tries to access newbyteorder, which was removed in numpy2.0 bug-unconfirmed medium severity Used to report medium severity bugs in llama.cpp (e.g. Malfunctioning Features but still useable)
#10127 opened Nov 1, 2024 by renxida
Bug: llama-quantize --help is not printed bug-unconfirmed low severity Used to report low severity bugs in llama.cpp (e.g. cosmetic issues, non critical UI glitches)
#10122 opened Nov 1, 2024 by ivanstepanovftw
Feature Request: count tokens before calling '/v1/chat/completions' enhancement New feature or request
#10115 opened Nov 1, 2024 by GPTLocalhost
4 tasks done
Bug: [SYCL] SYCL + Docker bug-unconfirmed medium severity Used to report medium severity bugs in llama.cpp (e.g. Malfunctioning Features but still useable)
#10113 opened Oct 31, 2024 by easyfab
ProTip! Type g i on any issue or pull request to go back to the issue listing page.