-
Notifications
You must be signed in to change notification settings - Fork 12.1k
Pull requests: ggml-org/llama.cpp
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
compare llama-bench: add option to plot
python
python script changes
script
Script related
#14169
opened Jun 13, 2025 by
am17an
Loading…
ggml : implement REGLU/GEGLU/SWIGLU ops
Apple Metal
https://en.wikipedia.org/wiki/Metal_(API)
ggml
changes relating to the ggml tensor library for machine learning
help wanted
Extra attention is needed
Nvidia GPU
Issues specific to Nvidia GPUs
testing
Everything test related
#14158
opened Jun 12, 2025 by
CISC
Loading…
models/templates: add mistralai/Mistral-Small-3.1-24B-Instruct-2503 template with tool calling support
#14148
opened Jun 12, 2025 by
bretello
Loading…
vulkan: mutex around vkQueueSubmit
ggml
changes relating to the ggml tensor library for machine learning
Vulkan
Issues specific to the Vulkan backend
#14127
opened Jun 11, 2025 by
jeffbolznv
Loading…
llama-model : add dots.llm1 architecture support (#14044)
python
python script changes
#14118
opened Jun 11, 2025 by
Noeda
Loading…
ggml: aarch64: Implement SVE Kernels for Int 8 Quantization
ggml
changes relating to the ggml tensor library for machine learning
#14117
opened Jun 11, 2025 by
Vithulep
Loading…
scripts: Fix remote option in Windows (#14102)
python
python script changes
#14100
opened Jun 10, 2025 by
pqnet
Loading…
Bump ROCm versions, re-enable in GHA
devops
improvements to build systems and github actions
#14098
opened Jun 10, 2025 by
gremlinofthemysticarts
Loading…
llama: automatically set runtime parameters such as --n-gpu-layers to fit VRAM
ggml
changes relating to the ggml tensor library for machine learning
#14067
opened Jun 8, 2025 by
JohannesGaessler
•
Draft
vulkan : fix build failure caused by vulkan-shaders-gen install
ggml
changes relating to the ggml tensor library for machine learning
Vulkan
Issues specific to the Vulkan backend
#14047
opened Jun 6, 2025 by
AsbjornOlling
Loading…
cpu: Update RISC-V condition to require GCC version 14 or higher
ggml
changes relating to the ggml tensor library for machine learning
#14032
opened Jun 5, 2025 by
Ghosts381937
Loading…
ggml-cpu: fix uncaught underscore terminators for s390x
ggml
changes relating to the ggml tensor library for machine learning
#14023
opened Jun 5, 2025 by
taronaeo
Loading…
server: Enable mtmd in llama-server
/completion
endpoint
examples
server
#14016
opened Jun 4, 2025 by
92MING
Loading…
llama: Attempt to add ModernBert
python
python script changes
#14014
opened Jun 4, 2025 by
huydt84
Loading…
[CANN]:Replace aclrtMemsetSync with InplaceZero operator for zero tensor creation
Ascend NPU
issues specific to Ascend NPUs
ggml
changes relating to the ggml tensor library for machine learning
#14002
opened Jun 4, 2025 by
luyhcsu
Loading…
Previous Next
ProTip!
no:milestone will show everything without a milestone.