Skip to content

Releases: ggerganov/llama.cpp

b2868

13 May 22:39
948f4ec
Compare
Choose a tag to compare
[SYCL] rm wait() (#7233)

b2867

13 May 14:37
9aa6724
Compare
Choose a tag to compare
llama : rename jina tokenizers to v2 (#7249)

* refactor: rename jina tokenizers to v2

* refactor: keep refactoring non-breaking

b2865

13 May 03:25
e586ee4
Compare
Choose a tag to compare
change default temperature of OAI compat API from 0 to 1 (#7226)

* change default temperature of OAI compat API from 0 to 1

* make tests explicitly send temperature to OAI API

b2864

13 May 01:17
cbf7589
Compare
Choose a tag to compare
[SYCL] Add oneapi runtime dll files to win release package (#7241)

* add oneapi running time dlls to release package

* fix path

* fix path

* fix path

* fix path

* fix path

---------

Co-authored-by: Zhang <[email protected]>

b2862

12 May 21:45
dc685be
Compare
Choose a tag to compare
CUDA: add FP32 FlashAttention vector kernel (#7188)

* CUDA: add FP32 FlashAttention vector kernel

* fixup! CUDA: add FP32 FlashAttention vector kernel

* fixup! fixup! CUDA: add FP32 FlashAttention vector kernel

* fixup! fixup! fixup! CUDA: add FP32 FlashAttention vector kernel

b2861

12 May 16:33
6f1b636
Compare
Choose a tag to compare
cmake : fix version cmp (#7227)

b2860

12 May 01:41
b228aba
Compare
Choose a tag to compare
remove convert-lora-to-ggml.py (#7204)

b2859

11 May 19:17
7bd4ffb
Compare
Choose a tag to compare
metal : fix warnings (skipme) (#0)

b2854

11 May 16:12
72c177c
Compare
Choose a tag to compare
fix system prompt handling (#7153)

b2852

11 May 13:34
Compare
Choose a tag to compare
sync : ggml

ggml-ci