Skip to content

Pull requests: ggml-org/llama.cpp

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

vulkan: mutex around vkQueueSubmit ggml changes relating to the ggml tensor library for machine learning Vulkan Issues specific to the Vulkan backend
#14127 opened Jun 11, 2025 by jeffbolznv Loading…
cmake : handle whitepsaces in path during metal build Apple Metal https://en.wikipedia.org/wiki/Metal_(API) ggml changes relating to the ggml tensor library for machine learning
#14126 opened Jun 11, 2025 by ggerganov Loading…
sycl: Remove not needed copy f16->f32 for dnnl mul mat ggml changes relating to the ggml tensor library for machine learning SYCL https://en.wikipedia.org/wiki/SYCL - GPU programming language
#14125 opened Jun 11, 2025 by ShanoToni Loading…
Update multimodal.md documentation Improvements or additions to documentation
#14122 opened Jun 11, 2025 by ddpasa Loading…
llama-model : add dots.llm1 architecture support (#14044) python python script changes
#14118 opened Jun 11, 2025 by Noeda Loading…
ggml: aarch64: Implement SVE Kernels for Int 8 Quantization ggml changes relating to the ggml tensor library for machine learning
#14117 opened Jun 11, 2025 by Vithulep Loading…
scripts: Fix remote option in Windows (#14102) python python script changes
#14100 opened Jun 10, 2025 by pqnet Loading…
Bump ROCm versions, re-enable in GHA devops improvements to build systems and github actions
#14098 opened Jun 10, 2025 by gremlinofthemysticarts Loading…
server: add model alias presets examples python python script changes server
#14083 opened Jun 9, 2025 by am17an Loading…
llama: automatically set runtime parameters such as --n-gpu-layers to fit VRAM ggml changes relating to the ggml tensor library for machine learning
#14067 opened Jun 8, 2025 by JohannesGaessler Draft
vulkan : fix build failure caused by vulkan-shaders-gen install ggml changes relating to the ggml tensor library for machine learning Vulkan Issues specific to the Vulkan backend
#14047 opened Jun 6, 2025 by AsbjornOlling Loading…
ggml-cpu: optimise assembly calls for hsum on s390x
#14037 opened Jun 5, 2025 by taronaeo Loading…
llama : add thread safety test devops improvements to build systems and github actions ggml changes relating to the ggml tensor library for machine learning testing Everything test related
#14035 opened Jun 5, 2025 by slaren Loading…
sycl: Adding additional cpy dbg print output ggml changes relating to the ggml tensor library for machine learning SYCL https://en.wikipedia.org/wiki/SYCL - GPU programming language
#14034 opened Jun 5, 2025 by ShanoToni Loading…
cpu: Update RISC-V condition to require GCC version 14 or higher ggml changes relating to the ggml tensor library for machine learning
#14032 opened Jun 5, 2025 by Ghosts381937 Loading…
llama : support qwen3 rerank and embeddings examples python python script changes server
#14029 opened Jun 5, 2025 by ngxson Loading…
ggml-cpu: fix uncaught underscore terminators for s390x ggml changes relating to the ggml tensor library for machine learning
#14023 opened Jun 5, 2025 by taronaeo Loading…
llama: Attempt to add ModernBert python python script changes
#14014 opened Jun 4, 2025 by huydt84 Loading…
llama-chat : Do not throw when tool parsing fails
#14012 opened Jun 4, 2025 by p1-0tr Loading…
[CANN]:Replace aclrtMemsetSync with InplaceZero operator for zero tensor creation Ascend NPU issues specific to Ascend NPUs ggml changes relating to the ggml tensor library for machine learning
#14002 opened Jun 4, 2025 by luyhcsu Loading…
ProTip! Exclude everything labeled bug with -label:bug.