LocalAI [bot]
f7f26b8efa
docs: ⬆️ update docs version mudler/LocalAI ( #6315 )
...
⬆️ Update docs version mudler/LocalAI
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
v3.5.4
2025-09-20 09:41:58 +02:00
LocalAI [bot]
75eb98f8bd
chore: ⬆️ Update ggml-org/llama.cpp to f432d8d83e7407073634c5e4fd81a3d23a10827f ( #6316 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-09-20 09:41:45 +02:00
LocalAI [bot]
c337e7baf7
chore: ⬆️ Update ggml-org/whisper.cpp to 44fa2f647cf2a6953493b21ab83b50d5f5dbc483 ( #6317 )
...
⬆️ Update ggml-org/whisper.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-09-19 21:14:10 +00:00
Ettore Di Giacinto
660bd45be8
fix(python): make option check uniform across backends ( #6314 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-09-19 19:56:08 +02:00
Ettore Di Giacinto
c27da0a0f6
fix(diffusers): fix float detection ( #6313 )
...
There was apparently an oversight, this fixes the float/int detection
Fixes: https://github.com/mudler/LocalAI/issues/6312
Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com >
v3.5.3
2025-09-19 19:09:04 +02:00
Ettore Di Giacinto
ac043ed9ba
chore(model gallery): add aquif-3.5-a4b-think ( #6311 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-09-19 11:16:50 +02:00
Ettore Di Giacinto
2e0d66a1c8
chore(model gallery): add impish_qwen_14b-1m ( #6310 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-09-19 10:57:33 +02:00
Ettore Di Giacinto
41a0f361eb
chore(model gallery): add mistralai_magistral-small-2509 ( #6309 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-09-19 10:48:13 +02:00
LocalAI [bot]
d3c5c02837
docs: ⬆️ update docs version mudler/LocalAI ( #6307 )
...
⬆️ Update docs version mudler/LocalAI
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-09-18 23:48:02 +02:00
LocalAI [bot]
ae3d8fb0c4
chore: ⬆️ Update ggml-org/llama.cpp to 3edd87cd055a45d885fa914d879d36d33ecfc3e1 ( #6308 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-09-18 21:09:14 +00:00
LocalAI [bot]
902e47f0b0
chore: ⬆️ Update ggml-org/llama.cpp to 0320ac5264279d74f8ee91bafa6c90e9ab9bbb91 ( #6306 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
v3.5.2
2025-09-18 09:27:18 +02:00
Ettore Di Giacinto
50bb78fd24
Add permissions for issues and actions
...
Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com >
2025-09-18 09:26:10 +02:00
LocalAI [bot]
542f07ab2d
docs: ⬆️ update docs version mudler/LocalAI ( #6305 )
...
⬆️ Update docs version mudler/LocalAI
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-09-17 21:06:50 +00:00
Ettore Di Giacinto
77c5acb9db
Revert "feat(nvidia-gpu): bump images to cuda 12.8" ( #6303 )
...
Revert "feat(nvidia-gpu): bump images to cuda 12.8 (#6239 )"
This reverts commit d9e25af7b5 .
2025-09-17 19:31:43 +02:00
Ettore Di Giacinto
44bbf4d778
chore(model gallery): add websailor-7b ( #6300 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
v3.5.1
2025-09-17 09:49:58 +02:00
Ettore Di Giacinto
633c12f93d
chore(model gallery): add websailor-32b ( #6299 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-09-17 09:48:16 +02:00
Ettore Di Giacinto
6f24135f1d
chore(model gallery): add webwatcher-32b ( #6298 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-09-17 09:42:54 +02:00
Ettore Di Giacinto
b72aa7b4fa
chore(model gallery): add webwatcher-7b ( #6297 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-09-17 09:36:25 +02:00
Ettore Di Giacinto
e94e725479
chore(model gallery): add alibaba-nlp_tongyi-deepresearch-30b-a3b ( #6295 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-09-17 09:22:19 +02:00
LocalAI [bot]
e4ac7b14a3
chore: ⬆️ Update ggml-org/llama.cpp to 8ff206097c2bf3ca1c7aa95f9d6db779fc7bdd68 ( #6292 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-09-16 21:09:47 +00:00
Ettore Di Giacinto
ddb39c73f2
chore(model gallery): add holo1.5-3b ( #6291 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-09-16 18:13:11 +02:00
Ettore Di Giacinto
264b09fb1e
chore(model gallery): add holo1.5-7b ( #6290 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-09-16 18:10:27 +02:00
Ettore Di Giacinto
36dd45df51
chore(model gallery): add holo1.5-72b ( #6289 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-09-16 18:07:50 +02:00
Ettore Di Giacinto
e5599f87b8
chore(model gallery): add k2-think-i1 ( #6288 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-09-16 18:05:01 +02:00
LocalAI [bot]
e89b5cc0e3
chore: ⬆️ Update ggml-org/llama.cpp to b907255f4bd169b0dc7dca9553b4c54af5170865 ( #6287 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-09-16 08:10:37 +02:00
Richard Palethorpe
10bf1084cc
chore: ⬆️ Update leejet/stable-diffusion.cpp to 0ebe6fe118f125665939b27c89f34ed38716bff8 ( #6271 )
...
* ⬆️ Update leejet/stable-diffusion.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
* fix(stablediffusion-ggml): Move parameters and start refactor of passing params
Signed-off-by: Richard Palethorpe <io@richiejp.com >
* fix(stablediffusion-ggml): Add default sampler option
Signed-off-by: Richard Palethorpe <io@richiejp.com >
---------
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Signed-off-by: Richard Palethorpe <io@richiejp.com >
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-09-15 21:42:46 +02:00
Ettore Di Giacinto
b08ae559b3
chore(model gallery): add qwen3-stargate-sg1-uncensored-abliterated-8b-i1 ( #6270 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-09-15 11:03:26 +02:00
Ettore Di Giacinto
aa7cb7e18c
chore(model gallery): add aquif-ai_aquif-3.5-8b-think ( #6269 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-09-15 10:42:42 +02:00
Ettore Di Giacinto
eadd3d4e46
chore(model gallery): add baidu_ernie-4.5-21b-a3b-thinking ( #6267 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-09-15 10:27:02 +02:00
LocalAI [bot]
2a18206033
chore: ⬆️ Update ggml-org/llama.cpp to 6c019cb04e86e2dacfe62ce7666c64e9717dde1f ( #6265 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-09-14 21:19:41 +00:00
LocalAI [bot]
39798d734e
chore: ⬆️ Update ggml-org/llama.cpp to 0fa154e3502e940df914f03b41475a2b80b985b0 ( #6263 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-09-14 19:59:58 +00:00
Gianluca Boiano
d0e99562af
chore(aio): upgrade minicpm-v model to latest 4.5 ( #6262 )
...
chore(aio): upgrade vision model to MiniCPM-V 4.5
Signed-off-by: Gianluca Boiano <morf3089@gmail.com >
2025-09-14 15:04:58 +02:00
Ettore Di Giacinto
6410c99bf2
fix(llama-cpp): correctly calculate embeddings ( #6259 )
...
* chore(tests): check embeddings differs in llama.cpp
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* fix(llama.cpp): use the correct field for embedding
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* fix(llama.cpp): use embedding type none
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* chore(tests): add test-cases in aio-e2e suite
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
---------
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-09-13 23:11:54 +02:00
LocalAI [bot]
55766d269b
chore: ⬆️ Update ggml-org/llama.cpp to aa0c461efe3603639af1a1defed2438d9c16ca0f ( #6261 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-09-13 21:11:18 +00:00
Ettore Di Giacinto
ffa0ad1eac
Fix formatting issues in README.md links
...
Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com >
2025-09-13 09:16:17 +02:00
LocalAI [bot]
623789a29e
chore: ⬆️ Update ggml-org/llama.cpp to 40be51152d4dc2d47444a4ed378285139859895b ( #6260 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-09-12 21:10:39 +00:00
Richard Palethorpe
2b9a3d32c9
chore: ⬆️ Update leejet/stable-diffusion.cpp to fce6afcc6a3250a8e17923608922d2a99b339b47 ( #6256 )
...
* ⬆️ Update leejet/stable-diffusion.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
* fix(stablediffusion-ggml): Add SMOOTHSTEP scheduler and assert sampler and scheduler counts
Signed-off-by: Richard Palethorpe <io@richiejp.com >
---------
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Signed-off-by: Richard Palethorpe <io@richiejp.com >
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-09-12 12:28:20 +02:00
LocalAI [bot]
f8b71dc5d0
chore: ⬆️ Update ggml-org/llama.cpp to 0e6ff0046f4a2983b2c77950aa75960fe4b4f0e2 ( #6235 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-09-11 21:21:49 +00:00
KingJ
1d3331b5cb
fix(rocm): Rename tag suffix for hipblas whisper build to match backend config ( #6247 )
...
Rename tag suffix for hipblas whisper to match backend config
hipblas images generally have the suffix `-gpu-rocm-hipblas-X`. One exception to this currently is the hipblas build of Whisper which has the suffix `gpu-hipblas-whisper.
However, as `backend/index.yaml` references the image tag for Whisper using the more consistent form (i.e. `latest-gpu-rocm-hipblas-whisper`), it is not possible to add the backend as raised in #6114 .
Therefore, rename the suffix for hipblas whisper images to use the more consistent form, aligning with other hipblas builds as well as the expected image name in `backend/index.yaml`.
Signed-off-by: Kingsley Jarrett <kj@kingj.net >
2025-09-11 21:19:09 +02:00
Mário Freitas
2c0b9c6349
fix(chat): use proper finish_reason for tool/function calling ( #6243 )
...
Signed-off-by: Mário Freitas <imkira@gmail.com >
2025-09-11 21:13:23 +02:00
qxo
3c6c976755
feat: support HF_ENDPOINT env for the HuggingFace endpoint ( #6220 )
...
ie: `HF_ENDPOINT=https://hf-mirror.com `
2025-09-11 21:04:57 +02:00
Sertaç Özercan
ebbcba342a
fix: runtime capability detection for backends ( #6149 )
...
* runtime capability detection for backends
Signed-off-by: Sertac Ozercan <sozercan@gmail.com >
* test
Signed-off-by: Sertac Ozercan <sozercan@gmail.com >
* skip nvidia on darwin
Signed-off-by: Sertac Ozercan <sozercan@gmail.com >
* address review comments
Signed-off-by: Sertac Ozercan <sozercan@gmail.com >
* fix apple test
Signed-off-by: Sertac Ozercan <sozercan@gmail.com >
* remove unused func
Signed-off-by: Sertac Ozercan <sozercan@gmail.com >
---------
Signed-off-by: Sertac Ozercan <sozercan@gmail.com >
2025-09-11 10:46:19 +02:00
LocalAI [bot]
0de75519dc
chore: ⬆️ Update leejet/stable-diffusion.cpp to b0179181069254389ccad604e44f17a2c25b4094 ( #6246 )
...
⬆️ Update leejet/stable-diffusion.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-09-10 23:43:12 +02:00
Richard Palethorpe
37f5e4f5c1
feat(whisper): Add diarization (tinydiarize) ( #6184 )
...
Signed-off-by: Richard Palethorpe <io@richiejp.com >
2025-09-10 19:09:28 +02:00
Ettore Di Giacinto
ffa934b959
feat(chatterbox): add MPS, and CPU, pin version ( #6242 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-09-09 17:58:07 +02:00
Mauro Morales
59311d8b1e
Point to LocalAI-examples repo for llava ( #6241 )
...
Signed-off-by: Mauro Morales <contact@mauromorales.com >
2025-09-09 16:40:55 +02:00
Ettore Di Giacinto
d9e25af7b5
feat(nvidia-gpu): bump images to cuda 12.8 ( #6239 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-09-09 13:02:17 +02:00
dependabot[bot]
e4f8b63b40
chore(deps): bump actions/labeler from 5 to 6 ( #6229 )
...
Bumps [actions/labeler](https://github.com/actions/labeler ) from 5 to 6.
- [Release notes](https://github.com/actions/labeler/releases )
- [Commits](https://github.com/actions/labeler/compare/v5...v6 )
---
updated-dependencies:
- dependency-name: actions/labeler
dependency-version: '6'
dependency-type: direct:production
update-type: version-update:semver-major
...
Signed-off-by: dependabot[bot] <support@github.com >
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2025-09-09 08:57:13 +02:00
dependabot[bot]
1364ae9be6
chore(deps): bump github.com/swaggo/swag from 1.16.3 to 1.16.6 ( #6222 )
...
Bumps [github.com/swaggo/swag](https://github.com/swaggo/swag ) from 1.16.3 to 1.16.6.
- [Release notes](https://github.com/swaggo/swag/releases )
- [Changelog](https://github.com/swaggo/swag/blob/master/.goreleaser.yml )
- [Commits](https://github.com/swaggo/swag/compare/v1.16.3...v1.16.6 )
---
updated-dependencies:
- dependency-name: github.com/swaggo/swag
dependency-version: 1.16.6
dependency-type: direct:production
update-type: version-update:semver-patch
...
Signed-off-by: dependabot[bot] <support@github.com >
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2025-09-09 08:56:59 +02:00
dependabot[bot]
cfd6a9150d
chore(deps): bump oras.land/oras-go/v2 from 2.5.0 to 2.6.0 ( #6225 )
...
Bumps [oras.land/oras-go/v2](https://github.com/oras-project/oras-go ) from 2.5.0 to 2.6.0.
- [Release notes](https://github.com/oras-project/oras-go/releases )
- [Commits](https://github.com/oras-project/oras-go/compare/v2.5.0...v2.6.0 )
---
updated-dependencies:
- dependency-name: oras.land/oras-go/v2
dependency-version: 2.6.0
dependency-type: direct:production
update-type: version-update:semver-minor
...
Signed-off-by: dependabot[bot] <support@github.com >
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2025-09-08 23:43:28 +00:00