-
Notifications
You must be signed in to change notification settings - Fork 12.2k
Pull requests: ggml-org/llama.cpp
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
ggml: backward pass for split swiglu
ggml
changes relating to the ggml tensor library for machine learning
testing
Everything test related
#14483
opened Jul 1, 2025 by
JohannesGaessler
Loading…
Callback before abort
ggml
changes relating to the ggml tensor library for machine learning
#14481
opened Jul 1, 2025 by
ScaledLizard
Loading…
opencl : add GELU_ERF
ggml
changes relating to the ggml tensor library for machine learning
#14476
opened Jul 1, 2025 by
CISC
Loading…
CUDA: add softmax broadcast
ggml
changes relating to the ggml tensor library for machine learning
Nvidia GPU
Issues specific to Nvidia GPUs
#14475
opened Jul 1, 2025 by
am17an
Loading…
server : (webui) let server send locally-defined default webui settings
examples
server
#14468
opened Jun 30, 2025 by
woof-dog
Loading…
Chore: batch prompts, extract tensors specific layer
examples
#14463
opened Jun 30, 2025 by
VakantieModus
Loading…
convert : correct gemma 3n conversion
python
python script changes
#14450
opened Jun 29, 2025 by
ngxson
Loading…
Pr/7191
build
Compilation issues
devops
improvements to build systems and github actions
python
python script changes
#14447
opened Jun 29, 2025 by
esrakorkmz
Loading…
ggml : implement GEGLU_ERF and GEGLU_QUICK ops
Apple Metal
https://en.wikipedia.org/wiki/Metal_(API)
examples
ggml
changes relating to the ggml tensor library for machine learning
Nvidia GPU
Issues specific to Nvidia GPUs
SYCL
https://en.wikipedia.org/wiki/SYCL - GPU programming language
Vulkan
Issues specific to the Vulkan backend
#14445
opened Jun 29, 2025 by
CISC
Loading…
Added CI with RISC-V RVV1.0 Hardware
devops
improvements to build systems and github actions
#14439
opened Jun 29, 2025 by
alitariq4589
Loading…
ggml : support broadcast for ggml_soft_max_ext and ggml_flash_attn_ext
Apple Metal
https://en.wikipedia.org/wiki/Metal_(API)
Ascend NPU
issues specific to Ascend NPUs
ggml
changes relating to the ggml tensor library for machine learning
Nvidia GPU
Issues specific to Nvidia GPUs
SYCL
https://en.wikipedia.org/wiki/SYCL - GPU programming language
testing
Everything test related
Vulkan
Issues specific to the Vulkan backend
#14435
opened Jun 28, 2025 by
ggerganov
Loading…
3 of 5 tasks
model : add hunyuan moe
python
python script changes
#14425
opened Jun 27, 2025 by
ngxson
Loading…
4 tasks done
ggml : add ggml_scale_bias
Apple Metal
https://en.wikipedia.org/wiki/Metal_(API)
ggml
changes relating to the ggml tensor library for machine learning
testing
Everything test related
[CANN] weight format to nz for Ascend310P3
Ascend NPU
issues specific to Ascend NPUs
ggml
changes relating to the ggml tensor library for machine learning
#14407
opened Jun 27, 2025 by
tqgy6
Loading…
OpenCL: add conv2d kernel
ggml
changes relating to the ggml tensor library for machine learning
#14403
opened Jun 26, 2025 by
rmatif
Loading…
ggml : add pointer to attach user data
ggml
changes relating to the ggml tensor library for machine learning
#14397
opened Jun 26, 2025 by
koush
Loading…
compare-commits.sh: support both llama-bench and test-backend-ops
python
python script changes
script
Script related
#14392
opened Jun 26, 2025 by
yeahdongcn
Loading…
ggml-cpu: Build variant targeting Neoverse-V2
ggml
changes relating to the ggml tensor library for machine learning
#14380
opened Jun 25, 2025 by
ckastner
Loading…
webui: preserve partial content when streaming errors occur
examples
server
#14374
opened Jun 25, 2025 by
Aaryan-549
Loading…
5 of 8 tasks
Q2k interleaving implementation - x86/x64 SIMD
ggml
changes relating to the ggml tensor library for machine learning
#14373
opened Jun 25, 2025 by
Srihari-mcw
Loading…
test-backend-ops: add support for specifying output format
testing
Everything test related
#14368
opened Jun 25, 2025 by
yeahdongcn
Loading…
llama : add high-throughput mode
Apple Metal
https://en.wikipedia.org/wiki/Metal_(API)
examples
ggml
changes relating to the ggml tensor library for machine learning
Previous Next
ProTip!
Updated in the last three days: updated:>2025-06-28.