Skip to content

Pull requests: ggml-org/llama.cpp

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

compare llama-bench: add option to plot python python script changes script Script related
#14169 opened Jun 13, 2025 by am17an Loading…
Add NeoBERT python python script changes
#14164 opened Jun 13, 2025 by huydt84 Loading…
ggml : implement REGLU/GEGLU/SWIGLU ops Apple Metal https://en.wikipedia.org/wiki/Metal_(API) ggml changes relating to the ggml tensor library for machine learning help wanted Extra attention is needed Nvidia GPU Issues specific to Nvidia GPUs testing Everything test related
#14158 opened Jun 12, 2025 by CISC Loading…
ggml-cpu : rework weak alias on apple targets ggml changes relating to the ggml tensor library for machine learning
#14146 opened Jun 12, 2025 by xctan Draft
tests : add test-model-random help wanted Extra attention is needed testing Everything test related
#14139 opened Jun 12, 2025 by compilade Draft
1 of 14 tasks
vulkan: mutex around vkQueueSubmit ggml changes relating to the ggml tensor library for machine learning Vulkan Issues specific to the Vulkan backend
#14127 opened Jun 11, 2025 by jeffbolznv Loading…
llama-model : add dots.llm1 architecture support (#14044) python python script changes
#14118 opened Jun 11, 2025 by Noeda Loading…
ggml: aarch64: Implement SVE Kernels for Int 8 Quantization ggml changes relating to the ggml tensor library for machine learning
#14117 opened Jun 11, 2025 by Vithulep Loading…
scripts: Fix remote option in Windows (#14102) python python script changes
#14100 opened Jun 10, 2025 by pqnet Loading…
Bump ROCm versions, re-enable in GHA devops improvements to build systems and github actions
#14098 opened Jun 10, 2025 by gremlinofthemysticarts Loading…
server: add model alias presets examples python python script changes server
#14083 opened Jun 9, 2025 by am17an Loading…
llama: automatically set runtime parameters such as --n-gpu-layers to fit VRAM ggml changes relating to the ggml tensor library for machine learning
#14067 opened Jun 8, 2025 by JohannesGaessler Draft
vulkan : fix build failure caused by vulkan-shaders-gen install ggml changes relating to the ggml tensor library for machine learning Vulkan Issues specific to the Vulkan backend
#14047 opened Jun 6, 2025 by AsbjornOlling Loading…
ggml-cpu: optimise assembly calls for hsum on s390x
#14037 opened Jun 5, 2025 by taronaeo Loading…
llama : add thread safety test devops improvements to build systems and github actions ggml changes relating to the ggml tensor library for machine learning testing Everything test related
#14035 opened Jun 5, 2025 by slaren Loading…
cpu: Update RISC-V condition to require GCC version 14 or higher ggml changes relating to the ggml tensor library for machine learning
#14032 opened Jun 5, 2025 by Ghosts381937 Loading…
llama : support qwen3 rerank and embeddings examples python python script changes server
#14029 opened Jun 5, 2025 by ngxson Loading…
ggml-cpu: fix uncaught underscore terminators for s390x ggml changes relating to the ggml tensor library for machine learning
#14023 opened Jun 5, 2025 by taronaeo Loading…
llama: Attempt to add ModernBert python python script changes
#14014 opened Jun 4, 2025 by huydt84 Loading…
llama-chat : Do not throw when tool parsing fails
#14012 opened Jun 4, 2025 by p1-0tr Loading…
[CANN]:Replace aclrtMemsetSync with InplaceZero operator for zero tensor creation Ascend NPU issues specific to Ascend NPUs ggml changes relating to the ggml tensor library for machine learning
#14002 opened Jun 4, 2025 by luyhcsu Loading…
ProTip! no:milestone will show everything without a milestone.