-
Notifications
You must be signed in to change notification settings - Fork 11.4k
Pull requests: ggml-org/llama.cpp
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
threading: support for GGML_SCHED_PRIO_LOW, update thread info on Windows to avoid throttling
examples
ggml
changes relating to the ggml tensor library for machine learning
#12995
opened Apr 17, 2025 by
max-krasnyansky
•
Draft
[CANN] Add the n_graph_splits performance metric to llama-bench.
Ascend NPU
issues specific to Ascend NPUs
examples
#12994
opened Apr 17, 2025 by
bachelor-dou
SYCL: Add non-contiguous support in ROPE
ggml
changes relating to the ggml tensor library for machine learning
SYCL
https://en.wikipedia.org/wiki/SYCL - GPU programming language
#12993
opened Apr 17, 2025 by
qnixsynapse
Loading…
Fix convert script for non-hf GLM4 checkpoints
python
python script changes
#12992
opened Apr 17, 2025 by
Tianyue-Zhao
Loading…
2 of 4 tasks
SYCL: Refactor and enable FP16 in binary broadcast OPs
ggml
changes relating to the ggml tensor library for machine learning
SYCL
https://en.wikipedia.org/wiki/SYCL - GPU programming language
#12975
opened Apr 16, 2025 by
qnixsynapse
Loading…
sycl: use DNN in the first part of ggml_sycl_mul_mat_batched_sycl
ggml
changes relating to the ggml tensor library for machine learning
SYCL
https://en.wikipedia.org/wiki/SYCL - GPU programming language
#12972
opened Apr 16, 2025 by
lslusarczyk
•
Draft
Resolved half rope,multi-EOS issues in convert_hf_togguf.py for GLM4Z Model
python
python script changes
#12957
opened Apr 15, 2025 by
piDack
Loading…
rpc : add RPC_CMD_HELLO
examples
ggml
changes relating to the ggml tensor library for machine learning
#12955
opened Apr 15, 2025 by
rgerganov
Loading…
rpc : do not wait for response when sending RPC_CMD_SET_TENSOR
ggml
changes relating to the ggml tensor library for machine learning
set b = ub when b > ub with embedding
examples
server
#12940
opened Apr 14, 2025 by
ahmedshakill
Loading…
server : use std::move whenever possible
examples
server
#12936
opened Apr 14, 2025 by
ngxson
Loading…
gguf-py : GGUF Editor GUI - Python + Qt
python
python script changes
#12930
opened Apr 13, 2025 by
christopherthompson81
Loading…
mtmd : add methods to access
mtmd_image_tokens
examples
#12906
opened Apr 11, 2025 by
ngxson
Loading…
Get CPU model in ggml_backend_cpu_device_context on FreeBSD
ggml
changes relating to the ggml tensor library for machine learning
#12902
opened Apr 11, 2025 by
yurivict
Loading…
cuda: fix compilation error (#12893)
ggml
changes relating to the ggml tensor library for machine learning
Nvidia GPU
Issues specific to Nvidia GPUs
#12894
opened Apr 11, 2025 by
lizhenneng
Loading…
llama-bench: enhance benchmark with improved token throughput measurements
examples
#12874
opened Apr 10, 2025 by
thevishalagarwal
Loading…
ggml : add SSE 4.2 and x64 base variant for CPUs without AVX
ggml
changes relating to the ggml tensor library for machine learning
#12871
opened Apr 10, 2025 by
slaren
Loading…
sycl : implementation of reordered Q4_0 MMVQ for Intel GPUs
ggml
changes relating to the ggml tensor library for machine learning
SYCL
https://en.wikipedia.org/wiki/SYCL - GPU programming language
#12858
opened Apr 10, 2025 by
Alcpz
Loading…
2 of 3 tasks
gguf-py: byteswapping improvements
python
python script changes
#12851
opened Apr 9, 2025 by
AlekseiNikiforovIBM
Loading…
metal : add memory pool for temp allocs
Apple Metal
https://en.wikipedia.org/wiki/Metal_(API)
ggml
changes relating to the ggml tensor library for machine learning
#12850
opened Apr 9, 2025 by
ggerganov
Loading…
9 tasks done
Previous Next
ProTip!
Type g p on any issue or pull request to go back to the pull request listing page.