Ettore Di Giacinto
3fcfaec7c8
chore(ci): move also other jobs to public runner ( #5683 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-06-18 22:00:12 +02:00
Ettore Di Giacinto
a463d40a3e
chore(ci): try to use public runners also for release builds ( #5681 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-06-18 21:51:54 +02:00
Ettore Di Giacinto
1e1f0ee321
chore(backends): move bark-cpp to the backend gallery ( #5682 )
...
chore(bark-cpp): move outside from binary
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-06-18 19:48:50 +02:00
Ettore Di Giacinto
80b3139fa0
Update landing.yaml
...
Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com >
2025-06-18 19:48:17 +02:00
LocalAI [bot]
5173d37acb
chore: ⬆️ Update ggml-org/llama.cpp to 860a9e4eeff3eb2e7bd1cc38f65787cc6c8177af ( #5678 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-06-18 10:01:46 +02:00
LocalAI [bot]
470e48a900
chore: ⬆️ Update ggml-org/whisper.cpp to f3ff80ea8da044e5b8833e7ba54ee174504c518d ( #5677 )
...
⬆️ Update ggml-org/whisper.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-06-18 10:01:08 +02:00
Ettore Di Giacinto
b706dddc93
chore(ci): switch to public runners for base images ( #5680 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-06-17 22:38:50 +02:00
Ettore Di Giacinto
867db3f888
chore(docs): add backend url
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-06-17 22:35:21 +02:00
Ettore Di Giacinto
b79aa31398
chore: move backends docs
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-06-17 22:26:40 +02:00
Ettore Di Giacinto
fb9a09d49c
chore(backend gallery): add description for remaining backends ( #5679 )
...
* chore(backend gallery): add description for remaining backends
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* chore(backend gallery): add linter
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
---------
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-06-17 22:21:44 +02:00
Ettore Di Giacinto
0a78f0ad2d
chore(backend gallery): re-order and add description for vLLM ( #5676 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-06-17 17:31:53 +02:00
Ettore Di Giacinto
d68660bd5a
chore(deps): bump llama.cpp to 'e434e69183fd9e1031f4445002083178c331a28b ( #5665 )
...
chore(deps): bump llama.cpp to 'e434e69183fd9e1031f4445002083178c331a28b'
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-06-17 17:00:10 +02:00
LocalAI [bot]
30ceee2dec
chore: ⬆️ Update ggml-org/whisper.cpp to 2a4d6db7d90899aff3d58d70996916968e4e0d27 ( #5661 )
...
⬆️ Update ggml-org/whisper.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-06-17 09:21:05 +02:00
dependabot[bot]
18c38335fc
chore(deps): bump securego/gosec from 2.22.4 to 2.22.5 ( #5663 )
...
Bumps [securego/gosec](https://github.com/securego/gosec ) from 2.22.4 to 2.22.5.
- [Release notes](https://github.com/securego/gosec/releases )
- [Changelog](https://github.com/securego/gosec/blob/master/.goreleaser.yml )
- [Commits](https://github.com/securego/gosec/compare/v2.22.4...v2.22.5 )
---
updated-dependencies:
- dependency-name: securego/gosec
dependency-version: 2.22.5
dependency-type: direct:production
update-type: version-update:semver-patch
...
Signed-off-by: dependabot[bot] <support@github.com >
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2025-06-16 23:12:27 +00:00
Ettore Di Giacinto
89040ff6f7
fix: add python symlink, use absolute python env path when running backends ( #5664 )
...
* fix: add python symlink, use absolute python env path when running backends
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* fix(ci): do not push images when building PRs
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
---------
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-06-16 23:00:53 +02:00
Ettore Di Giacinto
de343700fd
Don't run python_backend workflow on PR
...
Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com >
2025-06-16 11:06:56 +02:00
Ettore Di Giacinto
87d18ad951
chore: Add python3 to images ( #5660 )
...
Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com >
2025-06-16 11:05:44 +02:00
Ettore Di Giacinto
912c8eff04
chore(ci): use public runner for extra backends ( #5657 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-06-16 08:21:18 +02:00
LocalAI [bot]
481f30bde8
chore: ⬆️ Update ggml-org/llama.cpp to 30e5b01de2a0bcddc7c063c8ef0802703a958417 ( #5659 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-06-15 23:03:40 +00:00
Ettore Di Giacinto
236ac30252
chore(ci): do not specify image-type anymore
...
Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com >
2025-06-15 17:28:40 +02:00
Ettore Di Giacinto
6f761e62e4
update README
...
Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com >
2025-06-15 16:06:43 +02:00
FT
1f29b5f38e
Fix Typos and Improve Documentation Clarity ( #5648 )
...
* Update p2p.go
Signed-off-by: FT <140458077+zeevick10@users.noreply.github.com >
* Update GPU-acceleration.md
Signed-off-by: FT <140458077+zeevick10@users.noreply.github.com >
---------
Signed-off-by: FT <140458077+zeevick10@users.noreply.github.com >
2025-06-15 16:04:44 +02:00
LocalAI [bot]
33d702c5e0
chore: ⬆️ Update ggml-org/llama.cpp to 3cb203c89f60483e349f841684173446ed23c28f ( #5644 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
Co-authored-by: Ettore Di Giacinto <mudler@users.noreply.github.com >
2025-06-15 16:03:13 +02:00
Ettore Di Giacinto
95ff236127
ci: do not fire python_backend on PRs
...
Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com >
2025-06-15 16:02:30 +02:00
Ettore Di Giacinto
2d64269763
feat: Add backend gallery ( #5607 )
...
* feat: Add backend gallery
This PR add support to manage backends as similar to models. There is
now available a backend gallery which can be used to install and remove
extra backends.
The backend gallery can be configured similarly as a model gallery, and
API calls allows to install and remove new backends in runtime, and as
well during the startup phase of LocalAI.
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Add backends docs
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* wip: Backend Dockerfile for python backends
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* feat: drop extras images, build python backends separately
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* fixup on all backends
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* test CI
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Tweaks
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Drop old backends leftovers
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Fixup CI
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Move dockerfile upper
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Fix proto
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Feature dropped for consistency - we prefer model galleries
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Add missing packages in the build image
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* exllama is ponly available on cublas
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* pin torch on chatterbox
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Fixups to index
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* CI
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Debug CI
* Install accellerators deps
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Add target arch
* Add cuda minor version
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Use self-hosted runners
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* ci: use quay for test images
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* fixups for vllm and chatterbox
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Small fixups on CI
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* chatterbox is only available for nvidia
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Simplify CI builds
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Adapt test, use qwen3
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* chore(model gallery): add jina-reranker-v1-tiny-en-gguf
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* fix(gguf-parser): recover from potential panics that can happen while reading ggufs with gguf-parser
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Use reranker from llama.cpp in AIO images
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Limit concurrent jobs
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
---------
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com >
2025-06-15 14:56:52 +02:00
LocalAI [bot]
a7a6020328
chore: ⬆️ Update ggml-org/whisper.cpp to 705db0f728310c32bc96f4e355e2b18076932f75 ( #5643 )
...
⬆️ Update ggml-org/whisper.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-06-15 08:39:00 +02:00
Ettore Di Giacinto
40618164b2
chore: improve tests ( #5646 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-06-14 10:07:05 +02:00
fuder.eth
eb8c29f90a
Minor Documentation Updates: Clarified Comments in Python and Go Files ( #5641 )
...
* Update ui.go
Signed-off-by: fuder.eth <139509124+vtjl10@users.noreply.github.com >
* Update backend.py
Signed-off-by: fuder.eth <139509124+vtjl10@users.noreply.github.com >
---------
Signed-off-by: fuder.eth <139509124+vtjl10@users.noreply.github.com >
2025-06-13 19:55:25 +02:00
Gavin Mogan
63116a2c6a
docs: Update docs metadata headers so when mentioned on slack it doesn't say hugo ( #5642 )
...
Update docs metadata headers so when mentioned on slack it doesn't say hugo
Signed-off-by: Gavin Mogan <github@gavinmogan.com >
2025-06-13 19:54:57 +02:00
LocalAI [bot]
311c2cf539
chore: ⬆️ Update ggml-org/llama.cpp to ed52f3668e633423054a4eab61bb7efee47025ab ( #5636 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-06-12 23:33:33 +02:00
Ettore Di Giacinto
a6fcbd991d
chore(model gallery): add yanfei-v2-qwen3-32b ( #5639 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-06-12 22:24:13 +02:00
kilavvy
2e1dc8deef
Fix Typos in Comments and Error Messages ( #5637 )
...
* Update initializers.go
Signed-off-by: kilavvy <140459108+kilavvy@users.noreply.github.com >
* Update base.go
Signed-off-by: kilavvy <140459108+kilavvy@users.noreply.github.com >
---------
Signed-off-by: kilavvy <140459108+kilavvy@users.noreply.github.com >
2025-06-12 18:34:32 +02:00
LocalAI [bot]
282e017b22
chore: ⬆️ Update ggml-org/whisper.cpp to ebbc874e85b518f963a87612f6d79f5c71a55e84 ( #5635 )
...
⬆️ Update ggml-org/whisper.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-06-11 23:47:00 +02:00
Ettore Di Giacinto
f86cb8be2d
chore(model gallery): add qwen3-embedding-0.6b ( #5634 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-06-11 11:40:41 +02:00
Ettore Di Giacinto
5c56ec4f87
chore(model gallery): add qwen3-embedding-8b ( #5633 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-06-11 11:38:44 +02:00
Ettore Di Giacinto
dd2845a034
chore(model gallery): add qwen3-embedding-4b ( #5632 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-06-11 11:31:43 +02:00
Ettore Di Giacinto
2e7db014b6
chore(model gallery): add openbuddy_openbuddy-r1-0528-distill-qwen3-32b-preview0-qat ( #5631 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-06-11 11:27:30 +02:00
Ettore Di Giacinto
6faeee1d92
chore(model gallery): add baai_robobrain2.0-7b ( #5630 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-06-11 11:17:32 +02:00
Ettore Di Giacinto
31d73eb934
chore(model gallery): add mistralai_magistral-small-2506 ( #5629 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-06-11 11:11:44 +02:00
Ettore Di Giacinto
60863b9e52
chore(model gallery): add sophosympatheia_strawberrylemonade-l3-70b-v1.0 ( #5628 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-06-11 11:08:17 +02:00
Ettore Di Giacinto
a9fc71e2f3
chore(model gallery): add kwaipilot_kwaicoder-autothink-preview ( #5627 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-06-11 11:06:38 +02:00
leopardracer
ce9a9a30e0
Improve Comments and Documentation for MixedMode and ParseJSON Functions ( #5626 )
...
Update parse.go
Signed-off-by: leopardracer <136604165+leopardracer@users.noreply.github.com >
2025-06-11 09:46:53 +02:00
LocalAI [bot]
2693a21da5
chore: ⬆️ Update ggml-org/whisper.cpp to 2679bec6e09231c6fd59715fcba3eebc9e2f6076 ( #5625 )
...
⬆️ Update ggml-org/whisper.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-06-11 09:35:28 +02:00
LocalAI [bot]
d460eab18e
chore: ⬆️ Update ggml-org/llama.cpp to 3678b838bb71eaccbaeb479ff38c2e12bfd2f960 ( #5620 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-06-11 09:00:39 +02:00
LocalAI [bot]
c61e5fe266
chore: ⬆️ Update ggml-org/whisper.cpp to d78f08142381c1460604713e2f2ddf3331c7d816 ( #5619 )
...
⬆️ Update ggml-org/whisper.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
Co-authored-by: Ettore Di Giacinto <mudler@users.noreply.github.com >
2025-06-10 17:29:58 +02:00
Ettore Di Giacinto
88e570b5de
fix(deps): pin grpcio ( #5621 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-06-10 14:21:51 +02:00
Ettore Di Giacinto
6efa97ce0b
chore(model gallery): add qwen2.5-omni-3b ( #5606 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-06-09 10:54:42 +02:00
LocalAI [bot]
41cde5468a
chore: ⬆️ Update ggml-org/llama.cpp to 247e5c6e447707bb4539bdf1913d206088a8fc69 ( #5605 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-06-09 00:11:46 +02:00
Richard Palethorpe
d650647db9
fix(realtime): Use updated model on session update ( #5604 )
...
Signed-off-by: Richard Palethorpe <io@richiejp.com >
2025-06-09 00:11:05 +02:00
LocalAI [bot]
5bc7ef37a2
chore: ⬆️ Update ggml-org/llama.cpp to 5787b5da57e54dba760c2deeac1edf892e8fc450 ( #5601 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-06-08 08:44:24 +02:00