Skip to content

Actions: ggml-org/llama.cpp

EditorConfig Checker

Actions

Loading...
Loading

Show workflow options

Create status badge

Loading
13,531 workflow runs
13,531 workflow runs

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

SYCL: using graphs is configurable by environment variable and compile option
EditorConfig Checker #22785: Pull request #12371 synchronize by lslusarczyk
March 14, 2025 11:04 Action required lslusarczyk:sycl-graphs
March 14, 2025 11:04 Action required
llama : add llama_batch_ext
EditorConfig Checker #22784: Pull request #11875 synchronize by ngxson
March 14, 2025 10:28 23s ngxson:xsn/private_batch_api
March 14, 2025 10:28 23s
llama : add llama_batch_ext
EditorConfig Checker #22783: Pull request #11875 synchronize by ngxson
March 14, 2025 10:25 2m 0s ngxson:xsn/private_batch_api
March 14, 2025 10:25 2m 0s
server: fix "--grammar-file" parameter (#12285)
EditorConfig Checker #22782: Commit add2a3a pushed by ngxson
March 14, 2025 10:21 3m 46s master
March 14, 2025 10:21 3m 46s
llama : add llama_batch_ext
EditorConfig Checker #22781: Pull request #11875 synchronize by ngxson
March 14, 2025 09:47 59s ngxson:xsn/private_batch_api
March 14, 2025 09:47 59s
[WIP] MUSA: enable fastfp16, correct warp reduce impl and perf tuning
EditorConfig Checker #22780: Pull request #12383 opened by BodhiHu
March 14, 2025 09:36 11m 13s BodhiHu:musa
March 14, 2025 09:36 11m 13s
Load all MoE experts during warmup
EditorConfig Checker #22779: Pull request #11571 synchronize by fairydreaming
March 14, 2025 09:30 2m 31s fairydreaming:experts-warmup
March 14, 2025 09:30 2m 31s
Load all MoE experts during warmup
EditorConfig Checker #22778: Pull request #11571 reopened by fairydreaming
March 14, 2025 09:19 2m 50s fairydreaming:experts-warmup
March 14, 2025 09:19 2m 50s
Load all MoE experts during warmup
EditorConfig Checker #22777: Pull request #11571 synchronize by fairydreaming
March 14, 2025 09:19 26s fairydreaming:experts-warmup
March 14, 2025 09:19 26s
ggml : fix quantized cpy op
EditorConfig Checker #22776: Pull request #12310 synchronize by ggerganov
March 14, 2025 09:00 2m 14s gg/cpu-fix-cpy-q
March 14, 2025 09:00 2m 14s
graph : simplify attn input build for unified KV cache (#12381)
EditorConfig Checker #22775: Commit c522ce4 pushed by ggerganov
March 14, 2025 08:47 19s master
March 14, 2025 08:47 19s
[CANN]MUL_MAT optimization
EditorConfig Checker #22774: Pull request #12382 synchronize by noemotiovon
March 14, 2025 08:10 19s noemotiovon:mat_mul
March 14, 2025 08:10 19s
[CANN]MUL_MAT optimization
EditorConfig Checker #22773: Pull request #12382 opened by noemotiovon
March 14, 2025 08:03 16s noemotiovon:mat_mul
March 14, 2025 08:03 16s
graph : simplify attn input build for unified KV cache
EditorConfig Checker #22772: Pull request #12381 opened by ggerganov
March 14, 2025 07:06 15m 28s gg/graph-simplify-attn-inp
March 14, 2025 07:06 15m 28s
hparams : add SWA rope parameters (#12374)
EditorConfig Checker #22771: Commit 081bee8 pushed by ggerganov
March 14, 2025 07:03 16s master
March 14, 2025 07:03 16s
server: streaming of tool calls and thoughts when --jinja is on
EditorConfig Checker #22770: Pull request #12379 opened by ochafik
March 14, 2025 04:45 20s ochafik:tool-diffs
March 14, 2025 04:45 20s
[WIP]backend: Integrating QNN (Qualcomm AI Engine Direct) as a dedicated backend for Qualcomm NPUs
EditorConfig Checker #22769: Pull request #12063 synchronize by chraac
March 14, 2025 02:13 Action required chraac:dev-refactoring
March 14, 2025 02:13 Action required
llama : add llama_batch_ext
EditorConfig Checker #22768: Pull request #11875 synchronize by ngxson
March 13, 2025 23:22 19s ngxson:xsn/private_batch_api
March 13, 2025 23:22 19s
Add CLI arg to llama-run to adjust the number of threads used
EditorConfig Checker #22767: Pull request #12370 synchronize by ericcurtin
March 13, 2025 22:36 16s llama-run-n-threads
March 13, 2025 22:36 16s
llama : add llama_batch_ext
EditorConfig Checker #22766: Pull request #11875 synchronize by ngxson
March 13, 2025 22:14 21s ngxson:xsn/private_batch_api
March 13, 2025 22:14 21s
llama : add llama_batch_ext
EditorConfig Checker #22765: Pull request #11875 synchronize by ngxson
March 13, 2025 22:09 16s ngxson:xsn/private_batch_api
March 13, 2025 22:09 16s
llama : add llama_batch_ext
EditorConfig Checker #22764: Pull request #11875 synchronize by ngxson
March 13, 2025 21:56 15s ngxson:xsn/private_batch_api
March 13, 2025 21:56 15s
llama : add llama_batch_ext
EditorConfig Checker #22763: Pull request #11875 synchronize by ngxson
March 13, 2025 21:38 22s ngxson:xsn/private_batch_api
March 13, 2025 21:38 22s
llama : add llama_batch_ext
EditorConfig Checker #22762: Pull request #11875 synchronize by ngxson
March 13, 2025 21:36 16s ngxson:xsn/private_batch_api
March 13, 2025 21:36 16s