Skip to content

Actions: ggml-org/llama.cpp

CI

Actions

Loading...
Loading

Show workflow options

Create status badge

Loading
11,336 workflow runs
11,336 workflow runs

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

SYCL: Delete redundant plus sign and space
CI #20322: Pull request #12391 opened by aubreyli
March 14, 2025 14:41 48m 37s aubreyli:master
March 14, 2025 14:41 48m 37s
SYCL: set extras only on GGML_TYPE_Q4_0
CI #20321: Pull request #12366 synchronize by qnixsynapse
March 14, 2025 14:39 49m 9s qnixsynapse:fix/memory_leak
March 14, 2025 14:39 49m 9s
server: streaming of tool calls and thoughts when --jinja is on
CI #20320: Pull request #12379 synchronize by ochafik
March 14, 2025 13:05 1h 9m 17s ochafik:tool-diffs
March 14, 2025 13:05 1h 9m 17s
Load all MoE experts during warmup (#11571)
CI #20318: Commit 8fcb563 pushed by fairydreaming
March 14, 2025 12:47 1h 3m 31s master
March 14, 2025 12:47 1h 3m 31s
server: streaming of tool calls and thoughts when --jinja is on
CI #20315: Pull request #12379 synchronize by ochafik
March 14, 2025 12:07 54m 51s ochafik:tool-diffs
March 14, 2025 12:07 54m 51s
llama : add llama_batch_ext
CI #20309: Pull request #11875 synchronize by ngxson
March 14, 2025 10:28 1h 50m 39s ngxson:xsn/private_batch_api
March 14, 2025 10:28 1h 50m 39s
llama : add llama_batch_ext
CI #20308: Pull request #11875 synchronize by ngxson
March 14, 2025 10:25 3m 2s ngxson:xsn/private_batch_api
March 14, 2025 10:25 3m 2s
server: fix "--grammar-file" parameter (#12285)
CI #20307: Commit add2a3a pushed by ngxson
March 14, 2025 10:21 1h 39m 5s master
March 14, 2025 10:21 1h 39m 5s
llama : add llama_batch_ext
CI #20306: Pull request #11875 synchronize by ngxson
March 14, 2025 09:47 38m 40s ngxson:xsn/private_batch_api
March 14, 2025 09:47 38m 40s
[WIP] MUSA: enable fastfp16, correct warp reduce impl and perf tuning
CI #20305: Pull request #12383 opened by BodhiHu
March 14, 2025 09:36 1h 19m 54s BodhiHu:musa
March 14, 2025 09:36 1h 19m 54s
Load all MoE experts during warmup
CI #20304: Pull request #11571 synchronize by fairydreaming
March 14, 2025 09:30 1h 55m 17s fairydreaming:experts-warmup
March 14, 2025 09:30 1h 55m 17s
Load all MoE experts during warmup
CI #20302: Pull request #11571 synchronize by fairydreaming
March 14, 2025 09:19 59m 15s fairydreaming:experts-warmup
March 14, 2025 09:19 59m 15s
ggml : fix quantized cpy op
CI #20301: Pull request #12310 synchronize by ggerganov
March 14, 2025 09:00 51m 31s gg/cpu-fix-cpy-q
March 14, 2025 09:00 51m 31s
graph : simplify attn input build for unified KV cache (#12381)
CI #20300: Commit c522ce4 pushed by ggerganov
March 14, 2025 08:47 45m 26s master
March 14, 2025 08:47 45m 26s
[CANN]MUL_MAT optimization
CI #20299: Pull request #12382 synchronize by noemotiovon
March 14, 2025 08:10 45m 58s noemotiovon:mat_mul
March 14, 2025 08:10 45m 58s