feat: use llama cpp server #350

vansangpfiev · 2024-12-26T23:40:25Z

For vision models, the application utilizes a dedicated, customized server that runs within the same process as the main application.
To handle text and embedding models, the application spawns a separate child process for each model.

…feat/use-llama-cpp-server

…-llama-cpp-server

sangjanai added 3 commits December 26, 2024 14:40

feat: using llama.cpp server

30042d8

feat: use llama.cpp server linux

454ac86

fix: add patch

4c1a26f

vansangpfiev force-pushed the feat/use-llama-cpp-server branch 11 times, most recently from 1d900b3 to 48d0455 Compare December 27, 2024 06:34

fix: CI

b1fe9a9

vansangpfiev force-pushed the feat/use-llama-cpp-server branch from 48d0455 to b1fe9a9 Compare December 27, 2024 06:57

sangjanai added 3 commits December 27, 2024 14:52

chore: e2e tests

12971db

fix: support stream_options

6859bff

chore: verify openai api compatibility

b4db561

vansangpfiev force-pushed the feat/use-llama-cpp-server branch from 9020e46 to 7cb0706 Compare December 31, 2024 04:00

chore: cleanup

051e9d6

vansangpfiev force-pushed the feat/use-llama-cpp-server branch from 7cb0706 to 051e9d6 Compare December 31, 2024 06:29

sangjanai added 6 commits December 31, 2024 15:52

Merge branch 'main' of https://github.com/janhq/cortex.llamacpp into …

3274bc7

…feat/use-llama-cpp-server

chore: enable build

e6c4218

Merge branch 'main' of github.com:janhq/cortex.llamacpp into feat/use…

c4c16ed

…-llama-cpp-server

chore: update patch

7aa8135

chore: e2e

3b93f74

fix: build macos

04d90b3

vansangpfiev marked this pull request as ready for review January 2, 2025 02:13

chore: add docs

aefc495

vansangpfiev force-pushed the feat/use-llama-cpp-server branch from 8a9f42d to aefc495 Compare January 2, 2025 07:08

fix: test with cortex.cpp

1903170

vansangpfiev requested review from namchuai and nguyenhoangthuan99 January 3, 2025 02:37

Merge branch 'main' into feat/use-llama-cpp-server

7bbc7fe

Provide feedback