Skip to content

Pull requests: ggml-org/llama.cpp

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Sort

Pull requests list

threading: support for GGML_SCHED_PRIO_LOW, update thread info on Windows to avoid throttling examples ggml changes relating to the ggml tensor library for machine learning
#12995 opened Apr 17, 2025 by max-krasnyansky Draft
SYCL: Add non-contiguous support in ROPE ggml changes relating to the ggml tensor library for machine learning SYCL https://en.wikipedia.org/wiki/SYCL - GPU programming language
#12993 opened Apr 17, 2025 by qnixsynapse Loading…
Fix convert script for non-hf GLM4 checkpoints python python script changes
#12992 opened Apr 17, 2025 by Tianyue-Zhao Loading…
2 of 4 tasks
SYCL: Refactor and enable FP16 in binary broadcast OPs ggml changes relating to the ggml tensor library for machine learning SYCL https://en.wikipedia.org/wiki/SYCL - GPU programming language
#12975 opened Apr 16, 2025 by qnixsynapse Loading…
sycl: use DNN in the first part of ggml_sycl_mul_mat_batched_sycl ggml changes relating to the ggml tensor library for machine learning SYCL https://en.wikipedia.org/wiki/SYCL - GPU programming language
#12972 opened Apr 16, 2025 by lslusarczyk Draft
Vulkan: Fix Deepseek V2 inference by making ggml_vk_op_supports_incontiguous(GGML_OP_RMS_NORM) return true ggml changes relating to the ggml tensor library for machine learning Vulkan Issues specific to the Vulkan backend
#12960 opened Apr 15, 2025 by stduhpf Draft
Resolved half rope,multi-EOS issues in convert_hf_togguf.py for GLM4Z Model python python script changes
#12957 opened Apr 15, 2025 by piDack Loading…
rpc : add RPC_CMD_HELLO examples ggml changes relating to the ggml tensor library for machine learning
#12955 opened Apr 15, 2025 by rgerganov Loading…
main : Fix Ctrl+D/newline handling examples
#12951 opened Apr 15, 2025 by danielzgtg Loading…
rpc : do not wait for response when sending RPC_CMD_SET_TENSOR ggml changes relating to the ggml tensor library for machine learning
#12943 opened Apr 14, 2025 by rgerganov Draft
gguf-py : GGUF Editor GUI - Python + Qt python python script changes
#12930 opened Apr 13, 2025 by christopherthompson81 Loading…
llama-bench : Add --override-tensors arg examples
#12922 opened Apr 12, 2025 by 4onen Loading…
Get CPU model in ggml_backend_cpu_device_context on FreeBSD ggml changes relating to the ggml tensor library for machine learning
#12902 opened Apr 11, 2025 by yurivict Loading…
cuda: fix compilation error (#12893) ggml changes relating to the ggml tensor library for machine learning Nvidia GPU Issues specific to Nvidia GPUs
#12894 opened Apr 11, 2025 by lizhenneng Loading…
llama-tts : input from stdin examples
#12890 opened Apr 11, 2025 by marcoStocchi Loading…
ggml : add SSE 4.2 and x64 base variant for CPUs without AVX ggml changes relating to the ggml tensor library for machine learning
#12871 opened Apr 10, 2025 by slaren Loading…
sycl : implementation of reordered Q4_0 MMVQ for Intel GPUs ggml changes relating to the ggml tensor library for machine learning SYCL https://en.wikipedia.org/wiki/SYCL - GPU programming language
#12858 opened Apr 10, 2025 by Alcpz Loading…
2 of 3 tasks
gguf-py: byteswapping improvements python python script changes
#12851 opened Apr 9, 2025 by AlekseiNikiforovIBM Loading…
metal : add memory pool for temp allocs Apple Metal https://en.wikipedia.org/wiki/Metal_(API) ggml changes relating to the ggml tensor library for machine learning
#12850 opened Apr 9, 2025 by ggerganov Loading…
9 tasks done
ProTip! Type g p on any issue or pull request to go back to the pull request listing page.