-
Notifications
You must be signed in to change notification settings - Fork 9.7k
Issues: ggerganov/llama.cpp
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
fatal error: 'hip/hip_fp16.h' file not found when building using CMake and ROCm 6.2
bug-unconfirmed
medium severity
Used to report medium severity bugs in llama.cpp (e.g. Malfunctioning Features but still useable)
#10236
opened Nov 9, 2024 by
lubosz
Bug: server GET /props request return json with chat_template with last char replaced by \x00
bug-unconfirmed
high severity
Used to report high severity bugs in llama.cpp (Malfunctioning hinder important workflow)
#10235
opened Nov 9, 2024 by
kks-imt
Bug: CUBLAS_STATUS_INTERNAL_ERROR when using --gpu-layers on ROCm 6.2
bug-unconfirmed
high severity
Used to report high severity bugs in llama.cpp (Malfunctioning hinder important workflow)
#10234
opened Nov 9, 2024 by
lubosz
Bug: Server Slows Down Significantly Over Time, Requires Frequent Reboots (RX 7900 XT)
bug-unconfirmed
high severity
Used to report high severity bugs in llama.cpp (Malfunctioning hinder important workflow)
#10227
opened Nov 9, 2024 by
tigert2173
Bug: image encoding error with malloc memory
bug-unconfirmed
low severity
Used to report low severity bugs in llama.cpp (e.g. cosmetic issues, non critical UI glitches)
#10225
opened Nov 9, 2024 by
dingtine
bge-multilingual-gemma2:ERROR:hf-to-gguf:Model Gemma2Model is not supported
bug-unconfirmed
low severity
Used to report low severity bugs in llama.cpp (e.g. cosmetic issues, non critical UI glitches)
#10215
opened Nov 8, 2024 by
hellozjj
Bug: not support langchain v0.3 to use tools
bug-unconfirmed
high severity
Used to report high severity bugs in llama.cpp (Malfunctioning hinder important workflow)
#10214
opened Nov 8, 2024 by
lee249876293
Feature Request: Support Airllm
enhancement
New feature or request
#10202
opened Nov 7, 2024 by
kbocock-krg
4 tasks done
Bug: DLLAMA_VULKAN=1 tag is not linking vulkan
bug-unconfirmed
high severity
Used to report high severity bugs in llama.cpp (Malfunctioning hinder important workflow)
#10201
opened Nov 7, 2024 by
andrewson97
Bug: Nondeterministic results on AMD RDNA3 (ROCm) despite zero temperature and fixed seed
bug-unconfirmed
medium severity
Used to report medium severity bugs in llama.cpp (e.g. Malfunctioning Features but still useable)
#10197
opened Nov 6, 2024 by
Googulator
Bug: SYCL crash
bug-unconfirmed
critical severity
Used to report critical severity bugs in llama.cpp (e.g. Crashing, Corrupted, Dataloss)
#10184
opened Nov 5, 2024 by
0xDEADFED5
ggml : move LLAMAFILE/tinyBLAS into a backend
refactoring
Refactoring
#10183
opened Nov 5, 2024 by
ggerganov
ggml : refactor ggml-cpu.c into multiple C++ source files
refactoring
Refactoring
#10180
opened Nov 5, 2024 by
ggerganov
Feature Request: Support BitNet.cpp quantization format
enhancement
New feature or request
#10179
opened Nov 5, 2024 by
luionTW
Bug: Failed to convert Used to report medium severity bugs in llama.cpp (e.g. Malfunctioning Features but still useable)
OuteAI/OuteTTS-0.1-350M
bug-unconfirmed
medium severity
#10178
opened Nov 5, 2024 by
apepkuss
Bug: Speculative Decoding "Segmentation fault (core dumped)"
bug
Something isn't working
low severity
Used to report low severity bugs in llama.cpp (e.g. cosmetic issues, non critical UI glitches)
#10176
opened Nov 4, 2024 by
AbdullahMPrograms
tts : add basic example for text-to-speech
good first issue
Good for newcomers
tts
Text-to-speech
#10173
opened Nov 4, 2024 by
ggerganov
Bug: CANN E89999
Ascend NPU
issues specific to Ascend NPUs
bug-unconfirmed
low severity
Used to report low severity bugs in llama.cpp (e.g. cosmetic issues, non critical UI glitches)
#10161
opened Nov 4, 2024 by
ninth99
Feature Request: [CANN] backend supports Ascend 310P
Ascend NPU
issues specific to Ascend NPUs
enhancement
New feature or request
#10160
opened Nov 4, 2024 by
leo-pony
4 tasks done
Bug: GGML_ASSERT(i01 >= 0 && i01 < ne01) failed
bug-unconfirmed
medium severity
Used to report medium severity bugs in llama.cpp (e.g. Malfunctioning Features but still useable)
#10157
opened Nov 4, 2024 by
ccreutzi
Bug: --log-disable also disables output from the model
bug-unconfirmed
low severity
Used to report low severity bugs in llama.cpp (e.g. cosmetic issues, non critical UI glitches)
#10155
opened Nov 4, 2024 by
sendcat
Bug: gguf tries to access newbyteorder, which was removed in numpy2.0
bug-unconfirmed
medium severity
Used to report medium severity bugs in llama.cpp (e.g. Malfunctioning Features but still useable)
#10127
opened Nov 1, 2024 by
renxida
Bug: llama-quantize --help is not printed
bug-unconfirmed
low severity
Used to report low severity bugs in llama.cpp (e.g. cosmetic issues, non critical UI glitches)
#10122
opened Nov 1, 2024 by
ivanstepanovftw
Feature Request: count tokens before calling '/v1/chat/completions'
enhancement
New feature or request
#10115
opened Nov 1, 2024 by
GPTLocalhost
4 tasks done
Bug: [SYCL] SYCL + Docker
bug-unconfirmed
medium severity
Used to report medium severity bugs in llama.cpp (e.g. Malfunctioning Features but still useable)
#10113
opened Oct 31, 2024 by
easyfab
Previous Next
ProTip!
Type g i on any issue or pull request to go back to the issue listing page.