Ettore Di Giacinto
95ff236127
ci: do not fire python_backend on PRs
...
Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com >
2025-06-15 16:02:30 +02:00
Ettore Di Giacinto
2d64269763
feat: Add backend gallery ( #5607 )
...
* feat: Add backend gallery
This PR add support to manage backends as similar to models. There is
now available a backend gallery which can be used to install and remove
extra backends.
The backend gallery can be configured similarly as a model gallery, and
API calls allows to install and remove new backends in runtime, and as
well during the startup phase of LocalAI.
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Add backends docs
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* wip: Backend Dockerfile for python backends
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* feat: drop extras images, build python backends separately
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* fixup on all backends
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* test CI
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Tweaks
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Drop old backends leftovers
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Fixup CI
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Move dockerfile upper
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Fix proto
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Feature dropped for consistency - we prefer model galleries
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Add missing packages in the build image
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* exllama is ponly available on cublas
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* pin torch on chatterbox
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Fixups to index
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* CI
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Debug CI
* Install accellerators deps
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Add target arch
* Add cuda minor version
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Use self-hosted runners
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* ci: use quay for test images
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* fixups for vllm and chatterbox
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Small fixups on CI
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* chatterbox is only available for nvidia
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Simplify CI builds
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Adapt test, use qwen3
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* chore(model gallery): add jina-reranker-v1-tiny-en-gguf
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* fix(gguf-parser): recover from potential panics that can happen while reading ggufs with gguf-parser
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Use reranker from llama.cpp in AIO images
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Limit concurrent jobs
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
---------
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com >
2025-06-15 14:56:52 +02:00
LocalAI [bot]
a7a6020328
chore: ⬆️ Update ggml-org/whisper.cpp to 705db0f728310c32bc96f4e355e2b18076932f75 ( #5643 )
...
⬆️ Update ggml-org/whisper.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-06-15 08:39:00 +02:00
Ettore Di Giacinto
40618164b2
chore: improve tests ( #5646 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-06-14 10:07:05 +02:00
fuder.eth
eb8c29f90a
Minor Documentation Updates: Clarified Comments in Python and Go Files ( #5641 )
...
* Update ui.go
Signed-off-by: fuder.eth <139509124+vtjl10@users.noreply.github.com >
* Update backend.py
Signed-off-by: fuder.eth <139509124+vtjl10@users.noreply.github.com >
---------
Signed-off-by: fuder.eth <139509124+vtjl10@users.noreply.github.com >
2025-06-13 19:55:25 +02:00
Gavin Mogan
63116a2c6a
docs: Update docs metadata headers so when mentioned on slack it doesn't say hugo ( #5642 )
...
Update docs metadata headers so when mentioned on slack it doesn't say hugo
Signed-off-by: Gavin Mogan <github@gavinmogan.com >
2025-06-13 19:54:57 +02:00
LocalAI [bot]
311c2cf539
chore: ⬆️ Update ggml-org/llama.cpp to ed52f3668e633423054a4eab61bb7efee47025ab ( #5636 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-06-12 23:33:33 +02:00
Ettore Di Giacinto
a6fcbd991d
chore(model gallery): add yanfei-v2-qwen3-32b ( #5639 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-06-12 22:24:13 +02:00
kilavvy
2e1dc8deef
Fix Typos in Comments and Error Messages ( #5637 )
...
* Update initializers.go
Signed-off-by: kilavvy <140459108+kilavvy@users.noreply.github.com >
* Update base.go
Signed-off-by: kilavvy <140459108+kilavvy@users.noreply.github.com >
---------
Signed-off-by: kilavvy <140459108+kilavvy@users.noreply.github.com >
2025-06-12 18:34:32 +02:00
LocalAI [bot]
282e017b22
chore: ⬆️ Update ggml-org/whisper.cpp to ebbc874e85b518f963a87612f6d79f5c71a55e84 ( #5635 )
...
⬆️ Update ggml-org/whisper.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-06-11 23:47:00 +02:00
Ettore Di Giacinto
f86cb8be2d
chore(model gallery): add qwen3-embedding-0.6b ( #5634 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-06-11 11:40:41 +02:00
Ettore Di Giacinto
5c56ec4f87
chore(model gallery): add qwen3-embedding-8b ( #5633 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-06-11 11:38:44 +02:00
Ettore Di Giacinto
dd2845a034
chore(model gallery): add qwen3-embedding-4b ( #5632 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-06-11 11:31:43 +02:00
Ettore Di Giacinto
2e7db014b6
chore(model gallery): add openbuddy_openbuddy-r1-0528-distill-qwen3-32b-preview0-qat ( #5631 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-06-11 11:27:30 +02:00
Ettore Di Giacinto
6faeee1d92
chore(model gallery): add baai_robobrain2.0-7b ( #5630 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-06-11 11:17:32 +02:00
Ettore Di Giacinto
31d73eb934
chore(model gallery): add mistralai_magistral-small-2506 ( #5629 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-06-11 11:11:44 +02:00
Ettore Di Giacinto
60863b9e52
chore(model gallery): add sophosympatheia_strawberrylemonade-l3-70b-v1.0 ( #5628 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-06-11 11:08:17 +02:00
Ettore Di Giacinto
a9fc71e2f3
chore(model gallery): add kwaipilot_kwaicoder-autothink-preview ( #5627 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-06-11 11:06:38 +02:00
leopardracer
ce9a9a30e0
Improve Comments and Documentation for MixedMode and ParseJSON Functions ( #5626 )
...
Update parse.go
Signed-off-by: leopardracer <136604165+leopardracer@users.noreply.github.com >
2025-06-11 09:46:53 +02:00
LocalAI [bot]
2693a21da5
chore: ⬆️ Update ggml-org/whisper.cpp to 2679bec6e09231c6fd59715fcba3eebc9e2f6076 ( #5625 )
...
⬆️ Update ggml-org/whisper.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-06-11 09:35:28 +02:00
LocalAI [bot]
d460eab18e
chore: ⬆️ Update ggml-org/llama.cpp to 3678b838bb71eaccbaeb479ff38c2e12bfd2f960 ( #5620 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-06-11 09:00:39 +02:00
LocalAI [bot]
c61e5fe266
chore: ⬆️ Update ggml-org/whisper.cpp to d78f08142381c1460604713e2f2ddf3331c7d816 ( #5619 )
...
⬆️ Update ggml-org/whisper.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
Co-authored-by: Ettore Di Giacinto <mudler@users.noreply.github.com >
2025-06-10 17:29:58 +02:00
Ettore Di Giacinto
88e570b5de
fix(deps): pin grpcio ( #5621 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-06-10 14:21:51 +02:00
Ettore Di Giacinto
6efa97ce0b
chore(model gallery): add qwen2.5-omni-3b ( #5606 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-06-09 10:54:42 +02:00
LocalAI [bot]
41cde5468a
chore: ⬆️ Update ggml-org/llama.cpp to 247e5c6e447707bb4539bdf1913d206088a8fc69 ( #5605 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-06-09 00:11:46 +02:00
Richard Palethorpe
d650647db9
fix(realtime): Use updated model on session update ( #5604 )
...
Signed-off-by: Richard Palethorpe <io@richiejp.com >
2025-06-09 00:11:05 +02:00
LocalAI [bot]
5bc7ef37a2
chore: ⬆️ Update ggml-org/llama.cpp to 5787b5da57e54dba760c2deeac1edf892e8fc450 ( #5601 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-06-08 08:44:24 +02:00
Ettore Di Giacinto
e0a52807c8
chore(model gallery): add akhil-theerthala_kuvera-8b-v0.1.0 ( #5600 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-06-07 08:59:20 +02:00
LocalAI [bot]
1a95a19f87
chore: ⬆️ Update ggml-org/llama.cpp to 745aa5319b9930068aff5e87cf5e9eef7227339b ( #5598 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-06-07 08:59:05 +02:00
LocalAI [bot]
bcfc08e5bf
chore: ⬆️ Update ggml-org/whisper.cpp to b175baa665bc35f97a2ca774174f07dfffb84e19 ( #5597 )
...
⬆️ Update ggml-org/whisper.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-06-07 08:57:52 +02:00
Ettore Di Giacinto
4d282ca963
chore(model gallery): add nbeerbower_qwen3-gutenberg-encore-14b ( #5596 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-06-06 10:20:48 +02:00
Ettore Di Giacinto
525f49b69d
chore(model gallery): add open-thoughts_openthinker3-7b ( #5595 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-06-06 10:14:00 +02:00
LocalAI [bot]
786aa1de05
chore: ⬆️ Update ggml-org/llama.cpp to 1caae7fc6c77551cb1066515e0f414713eebb367 ( #5593 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-06-06 00:10:02 +02:00
Ettore Di Giacinto
ea82deb16b
chore(model gallery): add ultravox-v0_5-llama-3_1-8b ( #5592 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-06-05 19:23:51 +02:00
Ettore Di Giacinto
b0891309ba
chore(model gallery): add ultravox-v0_5-llama-3_2-1b ( #5591 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-06-05 19:22:01 +02:00
Ettore Di Giacinto
b034cff149
feat: improve RAM estimation by using values from summary ( #5525 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-06-05 19:16:26 +02:00
Ettore Di Giacinto
432f34f001
chore(model gallery): add goekdeniz-guelmez_josiefied-qwen3-14b-abliterated-v3 ( #5590 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-06-05 19:16:04 +02:00
Gavin Mogan
cbd61dccd4
fix(install.sh): vulkan docker tag ( #5589 )
...
vulkan docker tag is not prefixed with gpu
```
regctl tag ls localai/localai | grep 2.29 | grep vulkan
v2.29.0-vulkan
```
Signed-off-by: Gavin Mogan <github@gavinmogan.com >
2025-06-05 08:12:16 +02:00
LocalAI [bot]
0de0817d71
chore: ⬆️ Update ggml-org/whisper.cpp to 799eacdde40b3c562cfce1508da1354b90567f8f ( #5586 )
...
⬆️ Update ggml-org/whisper.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-06-05 08:11:38 +02:00
LocalAI [bot]
bf57d6e5ac
chore: ⬆️ Update ggml-org/llama.cpp to 0d3984424f2973c49c4bcabe4cc0153b4f90c601 ( #5585 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-06-05 08:11:12 +02:00
Ettore Di Giacinto
0b9603e010
chore(model gallery): add deepseek-ai_deepseek-r1-0528-qwen3-8b ( #5580 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-06-04 15:28:45 +02:00
Ettore Di Giacinto
8d925217f6
chore(model gallery): add e-n-v-y_legion-v2.1-llama-70b-elarablated-v0.8-hf ( #5579 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-06-04 11:12:37 +02:00
Ettore Di Giacinto
669a1ccae6
chore(model gallery): add nvidia_nemotron-research-reasoning-qwen-1.5b ( #5578 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-06-04 11:07:10 +02:00
Ettore Di Giacinto
7a7d36ad63
chore(model gallery): add arcee-ai_homunculus ( #5577 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-06-04 10:02:15 +02:00
Ettore Di Giacinto
8b889955b4
chore(deps): bump pytorch to 2.7 in vllm ( #5576 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-06-04 08:56:45 +02:00
dependabot[bot]
a226555949
chore(deps): bump GrantBirki/git-diff-action from 2.8.0 to 2.8.1 ( #5564 )
...
Bumps [GrantBirki/git-diff-action](https://github.com/grantbirki/git-diff-action ) from 2.8.0 to 2.8.1.
- [Release notes](https://github.com/grantbirki/git-diff-action/releases )
- [Commits](https://github.com/grantbirki/git-diff-action/compare/v2.8.0...v2.8.1 )
---
updated-dependencies:
- dependency-name: GrantBirki/git-diff-action
dependency-version: 2.8.1
dependency-type: direct:production
update-type: version-update:semver-patch
...
Signed-off-by: dependabot[bot] <support@github.com >
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2025-06-04 08:41:47 +02:00
LocalAI [bot]
f38f17865a
chore: ⬆️ Update ggml-org/whisper.cpp to 82f461eaa4e6a1ba29fc0dbdaa415a9934ee8a1d ( #5575 )
...
⬆️ Update ggml-org/whisper.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-06-04 08:41:26 +02:00
LocalAI [bot]
03f380701b
chore: ⬆️ Update ggml-org/llama.cpp to 7e00e60ef86645a01fda738fef85b74afa016a34 ( #5574 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-06-04 08:26:36 +02:00
Ettore Di Giacinto
65e2866c97
fix(chatterbox): install only with cuda 12 ( #5573 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-06-03 14:57:47 +02:00
Ettore Di Giacinto
cd3cd899ad
chore(deps): bump llama.cpp to '363757628848a27a435bbf22ff9476e9aeda5f40' ( #5571 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-06-03 12:19:16 +02:00