Ettore Di Giacinto
2defe98df8
fix(vllm): Update flash-attn to specific wheel URL
...
Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com >
2025-11-21 18:06:46 +01:00
Filipe Oliveira
b406b088a7
fix: Update Installer Options URL ( #7330 )
2025-11-21 17:29:36 +01:00
Ettore Di Giacinto
6261c87b1b
Add NVCC_THREADS and MAX_JOB environment variables
...
Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com >
2025-11-21 16:14:13 +01:00
Ettore Di Giacinto
fa00aa0085
chore(ci): add OS check to skip test if not on Linux
...
Skip test on non-Linux operating systems.
Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com >
2025-11-21 15:01:04 +01:00
Ettore Di Giacinto
0e53ce60b4
chore(ci): remove context size configuration from application
...
Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com >
2025-11-21 14:57:32 +01:00
Ettore Di Giacinto
8aba078439
chore(tests): add context size option to application initialization
...
Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com >
2025-11-21 09:50:05 +01:00
Ettore Di Giacinto
e88db7d142
fix(llama.cpp): handle corner cases with tool content ( #7324 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-11-21 09:21:49 +01:00
LocalAI [bot]
b7b8a0a748
chore: ⬆️ Update ggml-org/llama.cpp to dd0f3219419b24740864b5343958a97e1b3e4b26 ( #7322 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-11-21 08:11:47 +01:00
Ettore Di Giacinto
dd2828241c
chore(docs): add documentation about import ( #7315 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-11-20 23:07:36 +01:00
LocalAI [bot]
b8011f49f2
chore: ⬆️ Update ggml-org/whisper.cpp to 19ceec8eac980403b714d603e5ca31653cd42a3f ( #7321 )
...
⬆️ Update ggml-org/whisper.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-11-20 23:07:22 +01:00
Copilot
16e5689162
feat(importers): Add diffuser backend importer with ginkgo tests and UI support ( #7316 )
...
* Initial plan
* Add diffuser backend importer with ginkgo tests
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
* Finalize diffuser backend importer implementation
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
* Add diffuser preferences to model-editor import section
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
* Use gopkg.in/yaml.v3 for consistency in diffuser importer
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
---------
Co-authored-by: copilot-swe-agent[bot] <198982749+Copilot@users.noreply.github.com >
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-11-20 22:38:30 +01:00
Ettore Di Giacinto
2dd42292dc
feat(ui): runtime settings ( #7320 )
...
* feat(ui): add watchdog settings
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Do not re-read env
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Some refactor, move other settings to runtime (p2p)
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Add API Keys handling
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Allow to disable runtime settings
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Documentation
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Small fixups
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* show MCP toggle in index
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Drop context default
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
---------
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-11-20 22:37:20 +01:00
Ettore Di Giacinto
53d51671d7
Update Docker installation recommendation wording
...
Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com >
2025-11-20 17:27:48 +01:00
Ettore Di Giacinto
daf39e1efd
chore(vllm/ci): set maximum number of jobs
...
Also added comments to clarify CPU usage during build.
Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com >
2025-11-20 15:53:32 +01:00
Ettore Di Giacinto
382474e4a1
fix: do not delete files if used by other configured models ( #7235 )
...
* WIP
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* fix: prevent deletion of model files shared by multiple configurations (#7317 )
* Initial plan
* fix: do not delete files if used by other configured models
- Fixed bug in DeleteModelFromSystem where OR was used instead of AND for file suffix check
- Fixed bug where model config filename comparison was incorrect
- Added comprehensive Ginkgo test to verify shared model files are not deleted
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
* fix: prevent deletion of model files shared by multiple configurations
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
---------
Co-authored-by: copilot-swe-agent[bot] <198982749+Copilot@users.noreply.github.com >
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
---------
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
Co-authored-by: Copilot <198982749+Copilot@users.noreply.github.com >
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-11-20 14:55:51 +01:00
Ettore Di Giacinto
5fed9c6596
chore(ci): move intel image builds to self-hosted
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-11-20 09:36:54 +01:00
LocalAI [bot]
bfa07df7cd
chore: ⬆️ Update ggml-org/llama.cpp to 7d77f07325985c03a91fa371d0a68ef88a91ec7f ( #7314 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-11-20 07:58:42 +01:00
dependabot[bot]
fbaa21b0e5
chore(deps): bump golang.org/x/crypto from 0.43.0 to 0.45.0 in the go_modules group across 1 directory ( #7319 )
...
chore(deps): bump golang.org/x/crypto
Bumps the go_modules group with 1 update in the / directory: [golang.org/x/crypto](https://github.com/golang/crypto ).
Updates `golang.org/x/crypto` from 0.43.0 to 0.45.0
- [Commits](https://github.com/golang/crypto/compare/v0.43.0...v0.45.0 )
---
updated-dependencies:
- dependency-name: golang.org/x/crypto
dependency-version: 0.45.0
dependency-type: indirect
dependency-group: go_modules
...
Signed-off-by: dependabot[bot] <support@github.com >
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2025-11-20 04:19:22 +00:00
Ettore Di Giacinto
95b6c9bb5a
Update docs
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-11-19 22:25:33 +01:00
Ettore Di Giacinto
2cc4809b0d
feat: docs revamp ( #7313 )
...
* docs
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Small enhancements
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Enhancements
* Default to zen-dark
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* fixups
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
---------
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-11-19 22:21:20 +01:00
Ettore Di Giacinto
77bbeed57e
feat(importer): unify importing code with CLI ( #7299 )
...
* feat(importer): support ollama and OCI, unify code
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* feat: support importing from local file
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* support also yaml config files
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Correctly handle local files
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Extract importing errors
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Add importer tests
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Add integration tests
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* chore(UX): improve and specify supported URI formats
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* fail if backend does not have a runfile
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Adapt tests
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* feat(gallery): add cache for galleries
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* fix(ui): remove handler duplicate
File input handlers are now handled by Alpine.js @change handlers in chat.html.
Removed duplicate listeners to prevent files from being processed twice
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* fix(ui): be consistent in attachments in the chat
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Fail if no importer matches
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* fix: propagate ops correctly
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Fixups
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
---------
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-11-19 20:52:11 +01:00
Ettore Di Giacinto
3152611184
chore(deps): bump llama.cpp to '10e9780154365b191fb43ca4830659ef12def80f ( #7311 )
...
chore(deps): bump llama.cpp to '10e9780154365b191fb43ca4830659ef12def80f'
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-11-19 14:42:11 +01:00
Ettore Di Giacinto
30f992f241
feat(ui): add backend reinstall button ( #7305 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-11-18 14:52:54 +01:00
ErixM
2709220b84
fix the tts model dropdown to show the currently selected model ( #7306 )
...
* fix the tts model dropdown to show the currently selected model
* Update core/config/model_config.go
Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com >
---------
Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com >
Co-authored-by: Erixhens Muka <erixhens.muka@bluetensor.ai >
Co-authored-by: Ettore Di Giacinto <mudler@users.noreply.github.com >
2025-11-18 14:49:03 +01:00
LocalAI [bot]
4278506876
chore: ⬆️ Update ggml-org/llama.cpp to cb623de3fc61011e5062522b4d05721a22f2e916 ( #7301 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-11-18 07:43:57 +01:00
LocalAI [bot]
1dd1d12da1
chore: ⬆️ Update ggml-org/whisper.cpp to b12abefa9be2abae39a73fa903322af135024a36 ( #7300 )
...
⬆️ Update ggml-org/whisper.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-11-18 07:43:33 +01:00
dependabot[bot]
3a5b3bb0a6
chore(deps): bump google.golang.org/protobuf from 1.36.8 to 1.36.10 ( #7295 )
...
Bumps google.golang.org/protobuf from 1.36.8 to 1.36.10.
---
updated-dependencies:
- dependency-name: google.golang.org/protobuf
dependency-version: 1.36.10
dependency-type: direct:production
update-type: version-update:semver-patch
...
Signed-off-by: dependabot[bot] <support@github.com >
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2025-11-17 22:25:55 +01:00
dependabot[bot]
94d9fc923f
chore(deps): bump github.com/alecthomas/kong from 1.12.1 to 1.13.0 ( #7296 )
...
Bumps [github.com/alecthomas/kong](https://github.com/alecthomas/kong ) from 1.12.1 to 1.13.0.
- [Commits](https://github.com/alecthomas/kong/compare/v1.12.1...v1.13.0 )
---
updated-dependencies:
- dependency-name: github.com/alecthomas/kong
dependency-version: 1.13.0
dependency-type: direct:production
update-type: version-update:semver-minor
...
Signed-off-by: dependabot[bot] <support@github.com >
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2025-11-17 20:39:14 +01:00
dependabot[bot]
6fcf2c50b6
chore(deps): bump go.yaml.in/yaml/v2 from 2.4.2 to 2.4.3 ( #7294 )
...
Bumps [go.yaml.in/yaml/v2](https://github.com/yaml/go-yaml ) from 2.4.2 to 2.4.3.
- [Commits](https://github.com/yaml/go-yaml/compare/v2.4.2...v2.4.3 )
---
updated-dependencies:
- dependency-name: go.yaml.in/yaml/v2
dependency-version: 2.4.3
dependency-type: direct:production
update-type: version-update:semver-patch
...
Signed-off-by: dependabot[bot] <support@github.com >
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2025-11-17 20:37:19 +01:00
dependabot[bot]
7cbd4a2f18
chore(deps): bump fyne.io/fyne/v2 from 2.7.0 to 2.7.1 ( #7293 )
...
Bumps [fyne.io/fyne/v2](https://github.com/fyne-io/fyne ) from 2.7.0 to 2.7.1.
- [Release notes](https://github.com/fyne-io/fyne/releases )
- [Changelog](https://github.com/fyne-io/fyne/blob/master/CHANGELOG.md )
- [Commits](https://github.com/fyne-io/fyne/compare/v2.7.0...v2.7.1 )
---
updated-dependencies:
- dependency-name: fyne.io/fyne/v2
dependency-version: 2.7.1
dependency-type: direct:production
update-type: version-update:semver-patch
...
Signed-off-by: dependabot[bot] <support@github.com >
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2025-11-17 20:37:07 +01:00
Ettore Di Giacinto
18d11396cd
chore(docs): improve documentation and split into sections bigger topics ( #7292 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-11-17 18:39:21 +01:00
Ettore Di Giacinto
93cd688f40
chore: small ux enhancements ( #7290 )
...
* chore: improve chat attachments
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* chore: display installed backends/models
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
---------
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-11-17 17:09:42 +01:00
Ettore Di Giacinto
721c3f962b
chore: scroll in thinking mode, better buttons placement ( #7289 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-11-17 16:21:14 +01:00
LocalAI [bot]
fb834805db
chore: ⬆️ Update ggml-org/llama.cpp to 80deff3648b93727422461c41c7279ef1dac7452 ( #7287 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-11-17 07:51:08 +01:00
LocalAI [bot]
839aa7b42b
feat(swagger): update swagger ( #7286 )
...
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-11-17 07:49:06 +01:00
Ettore Di Giacinto
e963a45d66
feat(index): minor enhancements ( #7288 )
...
* feat(ui): add placeholder effect and select first model by default
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* fix(ui): correctly bind focus to parent
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
---------
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-11-16 21:23:18 +01:00
Mikhail Khludnev
c313b2c671
fix(reranker): tests and top_n check fix #7212 ( #7284 )
...
reranker tests and top_n check fix #7212
Signed-off-by: Mikhail Khludnev <mkhl@apache.org >
2025-11-16 17:53:23 +01:00
Ettore Di Giacinto
137f16336e
feat(ui): small refinements ( #7285 )
...
* feat(ui): show loaded models in the index
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* chore(ui): re-organize navbar
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
---------
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-11-16 17:50:13 +01:00
Ettore Di Giacinto
d7f9f3ac93
feat: add support to logitbias and logprobs ( #7283 )
...
* feat: add support to logprobs in results
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* feat: add support to logitbias
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
---------
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-11-16 13:27:36 +01:00
Ettore Di Giacinto
cd7d384500
feat: restyle index ( #7282 )
...
* Move management to separate section
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Make index to redirect to chat
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Use logo in index
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* work out the wizard in the front-page
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
---------
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-11-16 11:01:05 +01:00
LocalAI [bot]
d1a0dd10e6
chore: ⬆️ Update ggml-org/llama.cpp to 662192e1dcd224bc25759aadd0190577524c6a66 ( #7277 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-11-16 08:41:12 +01:00
Ettore Di Giacinto
be8cf838c2
feat(importers): add transformers and vLLM ( #7278 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-11-15 22:47:09 +01:00
LocalAI [bot]
3276d1cdaf
feat(swagger): update swagger ( #7276 )
...
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-11-15 21:50:30 +01:00
Ettore Di Giacinto
5e5f01badd
chore(ui): import vendored libs ( #7281 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-11-15 21:49:52 +01:00
Ettore Di Giacinto
6d0f646c37
chore: guide the user to import models ( #7280 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-11-15 21:37:50 +01:00
Ettore Di Giacinto
99d31667f8
chore: do not use placeholder image ( #7279 )
...
Use font-awesome icons instead
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-11-15 21:31:58 +01:00
Ettore Di Giacinto
47b546afdc
feat(mcp): add LocalAI endpoint to stream live results of the agent ( #7274 )
...
* feat(mcp): add LocalAI endpoint to stream live results of the agent
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* wip
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Refactoring
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* MCP UX integration
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Enhance UX
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Support also non-SSE
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
---------
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-11-15 17:38:00 +01:00
LocalAI [bot]
a09d49da43
chore: ⬆️ Update ggml-org/llama.cpp to 9b17d74ab7d31cb7d15ee7eec1616c3d825a84c0 ( #7273 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-11-15 00:05:39 +01:00
Ettore Di Giacinto
1cdcaf0152
feat: migrate to echo and enable cancellation of non-streaming requests ( #7270 )
...
* WIP: migrate to echo
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* tests
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
---------
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-11-14 22:57:53 +01:00
Ettore Di Giacinto
03e9f4b140
fix: handle tool errors ( #7271 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-11-14 17:23:56 +01:00