Releases · fairydreaming/llama.cpp

03 Dec 20:55

cc98896

b4255

vulkan: optimize and reenable split_k (#10637)

Use vector loads when possible in mul_mat_split_k_reduce. Use split_k
when there aren't enough workgroups to fill the shaders.

Assets 22

29 Nov 13:17

github-actions

b4219

266b851

b4219

sycl : Reroute permuted mul_mats through oneMKL (#10408)

This PR fixes the failing MUL_MAT tests for the sycl backend.

Assets 22

19 Aug 19:12

github-actions

b3605

cfac111

b3605

cann: add doc for cann backend (#8867)

Co-authored-by: xuedinge233 <[email protected]>
Co-authored-by: hipudding <[email protected]>

Assets 19

09 Jul 20:54

github-actions

b3357

fd560fe

b3357

Update README.md to fix broken link to docs (#8399)

Update the "Performance troubleshooting" doc link to be correct - the file was moved into a dir called 'development'

Assets 20

24 Jun 07:50

github-actions

b3212

8cb508d

b3212

disable publishing the full-rocm docker image (#8083)

Assets 20

21 Jun 12:12

github-actions

b3197

557b653

b3197

vulkan: detect multiple devices by deviceUUID instead of deviceID (#8…

Assets 20

20 Jun 18:39

github-actions

b3190

abd894a

b3190

common: fix warning (#8036)

* common: fix warning

* Update common/common.cpp

Co-authored-by: slaren <[email protected]>

---------

Co-authored-by: slaren <[email protected]>

Assets 20

13 Jun 20:16

github-actions

b3145

172c825

b3145

rpc : fix ggml_backend_rpc_supports_buft() (#7918)

Assets 20

22 May 11:00

github-actions

b2963

95fb0ae

b2963

CUDA: remove incorrect precision check (#7454)

Assets 21

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Releases: fairydreaming/llama.cpp

b4255

b4219

b3605

b3357

b3212

b3197

b3190

b3145

b2963