Skip to content

Actions: ggml-org/llama.cpp

Pull Request Labeler

Actions

Loading...
Loading

Show workflow options

Create status badge

Loading
8,954 workflow runs
8,954 workflow runs

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

CUDA/HIP: Fix fattn-vec-* when device warp size is not 32
Pull Request Labeler #8905: Pull request #12315 opened by IMbackK
March 10, 2025 18:36 14s
March 10, 2025 18:36 14s
DeepSeek V2/V3 implementation refactored to allow non-MLA and MLA
Pull Request Labeler #8904: Pull request #12313 synchronize by jukofyork
March 10, 2025 17:46 13m 38s
March 10, 2025 17:46 13m 38s
ggml : fix quantized cpy op
Pull Request Labeler #8903: Pull request #12310 synchronize by ggerganov
March 10, 2025 17:44 3m 12s
March 10, 2025 17:44 3m 12s
DeepSeek V2/V3 implementation refactored to allow non-MLA and MLA
Pull Request Labeler #8902: Pull request #12313 synchronize by jukofyork
March 10, 2025 17:41 19s
March 10, 2025 17:41 19s
DeepSeek V2/V3 implementation refactored to allow non-MLA and MLA
Pull Request Labeler #8901: Pull request #12313 opened by jukofyork
March 10, 2025 17:30 4m 51s
March 10, 2025 17:30 4m 51s
vulkan: Add N/2 and N/4 optimized paths in coopmat2 shader
Pull Request Labeler #8900: Pull request #12312 opened by jeffbolznv
March 10, 2025 16:49 17s
March 10, 2025 16:49 17s
tool-call: add support for tool-calls using Model Context Protocol
Pull Request Labeler #8899: Pull request #11556 synchronize by bandoti
March 10, 2025 14:59 12s
March 10, 2025 14:59 12s
tool-call: add support for tool-calls using Model Context Protocol
Pull Request Labeler #8898: Pull request #11556 synchronize by bandoti
March 10, 2025 14:46 13s
March 10, 2025 14:46 13s
readme: added Sidekick to available UIs
Pull Request Labeler #8897: Pull request #12311 opened by johnbean393
March 10, 2025 13:59 25m 56s
March 10, 2025 13:59 25m 56s
ggml : fix quantized cpy op
Pull Request Labeler #8896: Pull request #12310 opened by ggerganov
March 10, 2025 13:48 27m 5s
March 10, 2025 13:48 27m 5s
build: build llama.cpp + ggml-qnn in pure command line mode on x86-64 Windows
Pull Request Labeler #8895: Pull request #12215 synchronize by zhouwg
March 10, 2025 13:46 11m 11s
March 10, 2025 13:46 11m 11s
vulkan: use fp32 in coopmat2 q4_k dequant function
Pull Request Labeler #8894: Pull request #12309 opened by jeffbolznv
March 10, 2025 13:45 18s
March 10, 2025 13:45 18s
tests : fix test-quantize-fns to init the CPU backend
Pull Request Labeler #8893: Pull request #12306 opened by ggerganov
March 10, 2025 11:46 16m 0s
March 10, 2025 11:46 16m 0s
server: extract <think> tags from qwq outputs
Pull Request Labeler #8892: Pull request #12297 synchronize by ochafik
March 10, 2025 10:58 18s
March 10, 2025 10:58 18s
server: extract <think> tags from qwq outputs
Pull Request Labeler #8891: Pull request #12297 synchronize by ochafik
March 10, 2025 09:49 50m 14s
March 10, 2025 09:49 50m 14s
sampler: fixes trigger tokens + lazy grammars (fix typo cast from token to string)
Pull Request Labeler #8890: Pull request #12291 synchronize by ochafik
March 10, 2025 09:40 21s
March 10, 2025 09:40 21s
Vulkan: Add DP4A MMQ and Q8_1 quantization shader
Pull Request Labeler #8889: Pull request #12135 synchronize by 0cc4m
March 10, 2025 08:11 15s
March 10, 2025 08:11 15s
Update build.yml for Windows Vulkan builder to use Vulkan 1.4.304 SDK…
Pull Request Labeler #8888: Pull request #12301 opened by oscarbg
March 10, 2025 07:29 13s
March 10, 2025 07:29 13s
fix bug in minicpm-v code
Pull Request Labeler #8887: Pull request #11513 synchronize by tc-mb
March 10, 2025 06:22 19s
March 10, 2025 06:22 19s
B4735 standalone itt
Pull Request Labeler #8886: Pull request #12300 opened by rillomas
March 10, 2025 05:37 2m 6s
March 10, 2025 05:37 2m 6s
metal: Cache compiled library at device level
Pull Request Labeler #8885: Pull request #12265 synchronize by BB-fat
March 10, 2025 05:33 14s
March 10, 2025 05:33 14s
webui: Stop rerender on textarea input and end the devastating lag
Pull Request Labeler #8884: Pull request #12299 opened by woof-dog
March 10, 2025 04:24 13s
March 10, 2025 04:24 13s
server: extract <think> tags from qwq outputs
Pull Request Labeler #8883: Pull request #12297 opened by ochafik
March 10, 2025 02:09 23s
March 10, 2025 02:09 23s
musa: support new arch mp_31 and update doc
Pull Request Labeler #8882: Pull request #12296 opened by yeahdongcn
March 10, 2025 01:50 16s
March 10, 2025 01:50 16s
tool-call: ensure there's always a non-empty tool call id
Pull Request Labeler #8881: Pull request #12292 synchronize by ochafik
March 10, 2025 00:46 2m 33s
March 10, 2025 00:46 2m 33s