Ettore Di Giacinto
6ef3852de5
chore(docs): fixup tag
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-07-26 21:25:07 +02:00
Ettore Di Giacinto
a8057b952c
fix(cuda): be consistent with image tag naming ( #5916 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
v3.2.3
2025-07-26 08:30:59 +02:00
Ettore Di Giacinto
fd5c1d916f
chore(docs): add documentation on backend detection override ( #5915 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-07-26 08:18:31 +02:00
LocalAI [bot]
5ce982b9c9
chore: ⬆️ Update ggml-org/llama.cpp to c7f3169cd523140a288095f2d79befb20a0b73f4 ( #5913 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-07-25 23:08:20 +02:00
Ettore Di Giacinto
47ccfccf7a
fix(ci): add nvidia-l4t capability to l4t images ( #5914 )
...
Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com >
v3.2.2
2025-07-25 22:45:09 +02:00
LocalAI [bot]
a760f7ff39
docs: ⬆️ update docs version mudler/LocalAI ( #5912 )
...
⬆️ Update docs version mudler/LocalAI
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-07-25 22:15:16 +02:00
Ettore Di Giacinto
facf7625f3
fix(vulkan): use correct image suffix ( #5911 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-07-25 19:20:20 +02:00
Ettore Di Giacinto
b3600b3c50
feat(backend gallery): add mirrors ( #5910 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-07-25 19:20:08 +02:00
Ettore Di Giacinto
f0b47cfe6a
fix(backends gallery): trim string when reading cap from file ( #5909 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-07-25 18:10:02 +02:00
Ettore Di Giacinto
ee625fc34e
fix(backends gallery): pass-by backend galleries to the model service ( #5906 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
v3.2.1
2025-07-25 16:38:09 +02:00
Ettore Di Giacinto
693aa0b5de
Update README.md
...
Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com >
2025-07-25 11:51:23 +02:00
Ettore Di Giacinto
3973e6e5da
fix(install.sh): update to use the new binary naming ( #5903 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-07-25 10:43:22 +02:00
LocalAI [bot]
fb6ec68090
chore: ⬆️ Update ggml-org/whisper.cpp to 7de8dd783f7b2eab56bff6bbc5d3369e34f0e77f ( #5902 )
...
⬆️ Update ggml-org/whisper.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-07-25 08:40:24 +02:00
LocalAI [bot]
0301fc7c46
chore: ⬆️ Update leejet/stable-diffusion.cpp to eed97a5e1d054f9c1e7ac01982ae480411d4157e ( #5901 )
...
⬆️ Update leejet/stable-diffusion.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-07-25 08:40:06 +02:00
LocalAI [bot]
813cb4296d
chore: ⬆️ Update ggml-org/llama.cpp to 3f4fc97f1d745f1d5d3c853949503136d419e6de ( #5900 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-07-25 08:39:44 +02:00
Ettore Di Giacinto
deda3a4972
Update build documentation
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-07-24 22:53:08 +02:00
Ettore Di Giacinto
a28f27604a
Update backends.md
...
Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com >
v3.2.0
2025-07-24 16:18:25 +02:00
Richard Palethorpe
8fe9fa98f2
fix(stablediffusion-cpp): Switch back to upstream and update ( #5880 )
...
* sync(stablediffusion-cpp): Switch back to upstream and update
Signed-off-by: Richard Palethorpe <io@richiejp.com >
* fix(stablediffusion-ggml): NULL terminate options array to prevent segfault
Signed-off-by: Richard Palethorpe <io@richiejp.com >
* fix(build): Add BUILD_TYPE and BASE_IMAGE to all backends
Signed-off-by: Richard Palethorpe <io@richiejp.com >
---------
Signed-off-by: Richard Palethorpe <io@richiejp.com >
2025-07-24 16:03:18 +02:00
Nathaniel Hyson
4db1b80278
Update quickstart.md ( #5898 )
...
Fixed spelling mistake
Signed-off-by: Nathaniel Hyson <Shinrai@users.noreply.github.com >
2025-07-24 15:04:02 +02:00
Dave
b3c2a3c257
fix: untangle pkg and core ( #5896 )
...
* migrate core/system to pkg/system - it has no dependencies FROM core, and IS USED in pkg
Signed-off-by: Dave Lee <dave@gray101.com >
* move pkg/templates up to core/templates -- nothing in pkg references it, but it does reference core.
Signed-off-by: Dave Lee <dave@gray101.com >
* remove extra check, len of nil is 0
Signed-off-by: Dave Lee <dave@gray101.com >
* move pkg/startup to core/startup -- it does have important and unfixable dependencies on core
Signed-off-by: Dave Lee <dave@gray101.com >
---------
Signed-off-by: Dave Lee <dave@gray101.com >
2025-07-24 15:03:41 +02:00
LocalAI [bot]
61c2304638
chore: ⬆️ Update ggml-org/llama.cpp to a86f52b2859dae4db5a7a0bbc0f1ad9de6b43ec6 ( #5894 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-07-24 15:02:37 +02:00
Ettore Di Giacinto
92c5ab97e2
chore(Makefile): drop unused targets ( #5893 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-07-24 14:49:50 +02:00
LocalAI [bot]
76e471441c
chore: ⬆️ Update richiejp/stable-diffusion.cpp to 10c6501bd05a697e014f1bee3a84e5664290c489 ( #5732 )
...
⬆️ Update richiejp/stable-diffusion.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-07-23 21:09:02 +00:00
Dave
9cecf5e7ac
fix: rename Dockerfile.go --> Dockerfile.golang to avoid IDE errors ( #5892 )
...
extract up and out Dockerfile.go --> Dockerfile.golang rename. Prevents syntax highlighting and IDE errors
Signed-off-by: Dave Lee <dave@gray101.com >
2025-07-23 21:33:26 +02:00
Ettore Di Giacinto
b7b3164736
chore: try to speedup build
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-07-23 21:21:23 +02:00
Ettore Di Giacinto
5f7ece3e94
fix(p2p): adapt to backend changes, general improvements ( #5889 )
...
The binary is now named "llama-cpp-rpc-server" for p2p workers.
We also decrease the default token rotation interval, in this way
peer discovery is much more responsive.
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-07-23 12:40:32 +02:00
Ettore Di Giacinto
c717b8d800
chore(model gallery): add qwen3-coder-480b-a35b-instruct ( #5888 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-07-23 11:59:58 +02:00
Ettore Di Giacinto
f1d35c4149
chore(model gallery): add qwen3-235b-a22b-instruct-2507 ( #5887 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-07-23 11:54:58 +02:00
Ettore Di Giacinto
ee7e77b6c1
chore(model gallery): add menlo_lucy ( #5886 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-07-23 11:51:51 +02:00
Ettore Di Giacinto
324fecbb75
chore(model gallery): add entfane_math-genius-7b ( #5885 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-07-23 11:45:23 +02:00
Ettore Di Giacinto
a79bfcf0a7
chore(model gallery): add dream-org_dream-v0-instruct-7b ( #5884 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-07-23 11:40:53 +02:00
Ettore Di Giacinto
82495e7fb6
chore(model gallery): add omega-qwen3-atom-8b ( #5883 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-07-23 11:33:43 +02:00
Ettore Di Giacinto
6030b12283
chore(backend gallery): add name to 'diffusers' meta
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-07-23 09:21:04 +02:00
LocalAI [bot]
b5be867e28
chore: ⬆️ Update ggml-org/llama.cpp to acd6cb1c41676f6bbb25c2a76fa5abeb1719301e ( #5882 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-07-22 21:12:06 +00:00
Ettore Di Giacinto
9b806250d4
chore: drop vllm for cuda 11 ( #5881 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-07-22 18:47:31 +02:00
Ettore Di Giacinto
5f066e702f
fix(darwin): add dashes on image suffix
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-07-22 17:08:19 +02:00
dependabot[bot]
47bb3a3db2
chore(deps): bump securego/gosec from 2.22.5 to 2.22.7 ( #5878 )
...
Bumps [securego/gosec](https://github.com/securego/gosec ) from 2.22.5 to 2.22.7.
- [Release notes](https://github.com/securego/gosec/releases )
- [Changelog](https://github.com/securego/gosec/blob/master/.goreleaser.yml )
- [Commits](https://github.com/securego/gosec/compare/v2.22.5...v2.22.7 )
---
updated-dependencies:
- dependency-name: securego/gosec
dependency-version: 2.22.7
dependency-type: direct:production
update-type: version-update:semver-patch
...
Signed-off-by: dependabot[bot] <support@github.com >
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2025-07-22 16:42:11 +02:00
Richard Palethorpe
51230a801e
fix(build): Add and update ONEAPI_VERSION ( #5874 )
...
Signed-off-by: Richard Palethorpe <io@richiejp.com >
2025-07-22 16:41:49 +02:00
Richard Palethorpe
754bedc3ea
fix(realtime): Reset speech started flag on commit ( #5879 )
...
Signed-off-by: Richard Palethorpe <io@richiejp.com >
2025-07-22 16:41:12 +02:00
Ettore Di Giacinto
98e5291afc
feat: refactor build process, drop embedded backends ( #5875 )
...
* feat: split remaining backends and drop embedded backends
- Drop silero-vad, huggingface, and stores backend from embedded
binaries
- Refactor Makefile and Dockerfile to avoid building grpc backends
- Drop golang code that was used to embed backends
- Simplify building by using goreleaser
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* chore(gallery): be specific with llama-cpp backend templates
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* chore(docs): update
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* chore(ci): minor fixes
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* chore: drop all ffmpeg references
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* fix: run protogen-go
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Always enable p2p mode
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Update gorelease file
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* fix(stores): do not always load
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Fix linting issues
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Simplify
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Mac OS fixup
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
---------
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-07-22 16:31:04 +02:00
LocalAI [bot]
e29b2c3aff
chore: ⬆️ Update ggml-org/llama.cpp to 6c9ee3b17e19dcc82ab93d52ae46fdd0226d4777 ( #5877 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-07-22 08:25:43 +02:00
LocalAI [bot]
8dc574f3c4
chore: ⬆️ Update ggml-org/whisper.cpp to 1f5cf0b2888402d57bb17b2029b2caa97e5f3baf ( #5876 )
...
⬆️ Update ggml-org/whisper.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-07-22 08:25:13 +02:00
Ettore Di Giacinto
05bf2493a5
fix: do not pass by environ to ffmpeg ( #5871 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-07-21 14:35:33 +02:00
Max Goltzsche
eae4ca08da
feat(openai): support input_audio chat api field ( #5870 )
...
Improving the chat completion endpoint OpenAI API compatibility by supporting messages of type `input_audio`, e.g.:
```
{
...
"messages": [
{
"role": "user",
"content": [{
"type": "input_audio",
"input_audio": {
"data": "<base64-encoded audio data>",
"format": "wav"
}
}]
}
]
}
```
Closes #5869
Signed-off-by: Max Goltzsche <max.goltzsche@gmail.com >
2025-07-21 09:15:55 +02:00
LocalAI [bot]
fa284f7445
chore: ⬆️ Update ggml-org/llama.cpp to 2be60cbc2707359241c2784f9d2e30d8fc7cdabb ( #5867 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-07-21 09:14:09 +02:00
Ettore Di Giacinto
8f69b80520
Update index.yaml
...
Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com >
2025-07-20 22:54:12 +02:00
Ettore Di Giacinto
b1fc5acd4a
feat: split whisper from main binary ( #5863 )
...
* feat: split whisper from main binary
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Cleanup makefile
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Add backend builds (missing only darwin)
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Test CI
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Add whisper backend to test runs
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Fixups
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Make sure we have runtime libs
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Less grpc on the main Dockerfile
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* fixups
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Fix hipblas build
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Add whisper to index
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Re-enable CI
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Adapt auto-bumper
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
---------
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-07-20 22:52:45 +02:00
LocalAI [bot]
fab41c29dd
chore(model-gallery): ⬆️ update checksum ( #5865 )
...
⬆️ Checksum updates in gallery/index.yaml
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-07-20 20:37:43 +02:00
Ettore Di Giacinto
fb0ec96396
ci: do not upgrade pip
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-07-20 12:30:12 +02:00
LocalAI [bot]
7659461036
chore: ⬆️ Update ggml-org/llama.cpp to a979ca22db0d737af1e548a73291193655c6be99 ( #5862 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-07-20 08:43:36 +02:00