Skip to content

Pull requests: ggerganov/llama.cpp

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Sort

Pull requests list

nix: update flake.lock nix Issues specific to consuming flake.nix, or generally concerned with ❄ Nix-based llama.cpp deployment
#10243 opened Nov 10, 2024 by ggerganov Loading…
server: Add back samplers examples server
#10239 opened Nov 9, 2024 by MaggotHATE Loading…
2 of 4 tasks
metal : refactor kernel args into structs ggml changes relating to the ggml tensor library for machine learning
#10238 opened Nov 9, 2024 by ggerganov Draft
2 of 4 tasks
llama : use ggml_backend_dev_get_extra_bufts
#10228 opened Nov 9, 2024 by danbev Loading…
2 tasks done
vulkan: Fix newly added tests for permuted mul_mat and 1D im2col ggml changes relating to the ggml tensor library for machine learning Vulkan Issues specific to the Vulkan backend
#10226 opened Nov 9, 2024 by jeffbolznv Loading…
2 of 4 tasks
support for llguidance grammars
#10224 opened Nov 9, 2024 by mmoskal Draft
vulkan: Throttle the number of shader compiles during the build step
#10222 opened Nov 8, 2024 by jeffbolznv Loading…
2 of 4 tasks
ci: add Ascend CANN build devops improvements to build systems and github actions
#10217 opened Nov 8, 2024 by xuedinge233 Draft
CANN Support Ascend310P to accelerate F32 and F16 LLM Model Ascend NPU issues specific to Ascend NPUs enhancement New feature or request
#10216 opened Nov 8, 2024 by leo-pony Loading…
2 of 4 tasks
AVX BF16 and single scale quant optimizations ggml changes relating to the ggml tensor library for machine learning
#10212 opened Nov 8, 2024 by netrunnereve Loading…
2 of 4 tasks
docs: add doxygen documentation build Compilation issues
#10209 opened Nov 8, 2024 by sparkleholic Loading…
2 of 4 tasks
Draft: vulkan: Add VK_NV_cooperative_matrix2 support for mul_mat and FlashAttention2 ggml changes relating to the ggml tensor library for machine learning testing Everything test related Vulkan Issues specific to the Vulkan backend
#10206 opened Nov 7, 2024 by jeffbolznv Loading…
2 of 4 tasks
Introduce IQ4_NL_4_4 format and its neon implementation examples ggml changes relating to the ggml tensor library for machine learning
#10196 opened Nov 6, 2024 by FanShupei Loading…
2 of 4 tasks
CUDA: always create events for split buffers Review Complexity : Medium Generally require more time to grok but manageable by beginner to medium expertise level
#10185 opened Nov 5, 2024 by JohannesGaessler Loading…
Introduce New Lookup-Table(LUT)-Based Matrix Multiplication Method ggml changes relating to the ggml tensor library for machine learning python python script changes
#10181 opened Nov 5, 2024 by QingtaoLi1 Loading…
2 of 4 tasks
gguf-py: Improve GGUFReader read-only mode performance python python script changes
#10159 opened Nov 4, 2024 by Isotr0py Loading…
2 of 4 tasks
main : add new feature: special commands demo Demonstrate some concept or idea, not intended to be merged examples
#10145 opened Nov 3, 2024 by ngxson Draft
2 tasks done
Fix docker locale issue (#6267) devops improvements to build systems and github actions
#10142 opened Nov 2, 2024 by fyun89 Loading…
2 of 4 tasks
ggml : skip register metal backend on os simulator ggml changes relating to the ggml tensor library for machine learning
#10132 opened Nov 2, 2024 by jhen0409 Loading…
2 of 4 tasks
Add user-provided tokenizer/detokenizer functionality examples
#10131 opened Nov 2, 2024 by ilan-theodoro Loading…
2 of 4 tasks
Server: handle generation until context is filled examples python python script changes server
#10128 opened Nov 1, 2024 by VJHack Loading…
2 of 4 tasks
ProTip! Find all pull requests that aren't related to any open issues with -linked:issue.