Skip to content

Releases: ggml-org/llama.cpp

b4874

12 Mar 06:44
bf69cfe
Compare
Choose a tag to compare
vulkan: fix bug in coopmat1 mul_mat_id (#12316)

* tests: run mul_mat_id with a larger N

* vulkan: fix bug in coopmat1 mul_mat_id

b4873

11 Mar 20:00
10f2e81
Compare
Choose a tag to compare
CUDA/HIP: refractor mmqv to unify the calculation of nwarps and rows …

b4872

11 Mar 14:59
ba76543
Compare
Choose a tag to compare
ggml-backend : fix backend search path (#12330)

* Fix backend search path

* replace .native() with '/'

* reverted .native()

b4871

11 Mar 12:36
6ab2e47
Compare
Choose a tag to compare
metal : Cache the Metal library at the device context level (#12265)

b4870

11 Mar 09:08
96e1280
Compare
Choose a tag to compare
clip : bring back GPU support (#12322)

* clip : bring back GPU support

* use n_gpu_layers param

* fix double free

* ggml_backend_init_by_type

* clean up

b4869

10 Mar 20:10
2c9f833
Compare
Choose a tag to compare
mat vec double buffer (#12188)

b4868

10 Mar 18:10
2513645
Compare
Choose a tag to compare
musa: support new arch mp_31 and update doc (#12296)

Signed-off-by: Xiaodong Ye <[email protected]>

b4867

10 Mar 17:59
8acdacb
Compare
Choose a tag to compare
opencl: use OpenCL C standard supported by the device (#12221)

This patch nudges the llama.cpp a bit to be supported on PoCL which
doesn't support OpenCL C CL2.0. The issue is solved by querying the
device for the supported OpenCL C versions and using the highest one
available.

b4865

10 Mar 13:04
e128a1b
Compare
Choose a tag to compare
tests : fix test-quantize-fns to init the CPU backend (#12306)

ggml-ci

b4864

10 Mar 12:46
6ef79a6
Compare
Choose a tag to compare
common : refactor '-o' option (#12278)

As discussed in PR 'llama-tts : add -o option' (#12042):

* common_params : 'out_file' string is the only output file name parameter left in common_params. It's intended to be used in all example programs implementing an '-o' option.

* cvector-generator, export-lora, imatrix : default output filenames moved from 'common_params' to the 'main()' of each example program.