Ettore Di Giacinto
02cc8cbcaa
feat(llama.cpp): consolidate options and respect tokenizer template when enabled ( #7120 )
...
* feat(llama.cpp): expose env vars as options for consistency
This allows to configure everything in the YAML file of the model rather
than have global configurations
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* feat(llama.cpp): respect usetokenizertemplate and use llama.cpp templating system to process messages
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* WIP
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Detect template exists if use tokenizer template is enabled
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Better recognization of chat
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Fixes to support tool calls while using templates from tokenizer
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Fixups
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Drop template guessing, fix passing tools to tokenizer
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Extract grammar and other options from chat template, add schema struct
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* WIP
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* WIP
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Automatically set use_jinja
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Cleanups, identify by default gguf models for chat
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Update docs
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
---------
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-11-07 21:23:50 +01:00
Ettore Di Giacinto
e5e86d0acb
Enhance README with roadmap and getting started info
...
Updated README to include roadmap link and clarify getting started section.
Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com >
2025-11-07 16:50:47 +01:00
Ettore Di Giacinto
edd35d2b33
Fix formatting issues in README.md links
...
Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com >
2025-11-07 15:57:08 +01:00
LocalAI [bot]
e8cc29e364
chore(model gallery): 🤖 add 1 new models via gallery agent ( #7162 )
...
chore(model gallery): 🤖 add new models via gallery agent
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-11-07 08:40:29 +01:00
LocalAI [bot]
8f7c499f17
chore: ⬆️ Update ggml-org/llama.cpp to 7f09a680af6e0ef612de81018e1d19c19b8651e8 ( #7156 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-11-07 08:38:56 +01:00
dependabot[bot]
ea446fde08
chore(deps): bump github.com/containerd/containerd from 1.7.28 to 1.7.29 in the go_modules group across 1 directory ( #7149 )
...
chore(deps): bump github.com/containerd/containerd
Bumps the go_modules group with 1 update in the / directory: [github.com/containerd/containerd](https://github.com/containerd/containerd ).
Updates `github.com/containerd/containerd` from 1.7.28 to 1.7.29
- [Release notes](https://github.com/containerd/containerd/releases )
- [Changelog](https://github.com/containerd/containerd/blob/main/RELEASES.md )
- [Commits](https://github.com/containerd/containerd/compare/v1.7.28...v1.7.29 )
---
updated-dependencies:
- dependency-name: github.com/containerd/containerd
dependency-version: 1.7.29
dependency-type: direct:production
dependency-group: go_modules
...
Signed-off-by: dependabot[bot] <support@github.com >
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2025-11-06 16:40:49 +00:00
Mikhail Khludnev
122e4c7094
fix(reranker): reproduce ignoring top_n ( #7025 )
...
* fix(reranker): reproduce ignoring top_n
Signed-off-by: Mikhail Khludnev <mkhl@apache.org >
* fix(reranker): ignoring top_n
Signed-off-by: Mikhail Khludnev <mkhl@apache.org >
---------
Signed-off-by: Mikhail Khludnev <mkhl@apache.org >
2025-11-06 10:03:05 +00:00
LocalAI [bot]
2573102317
chore(model gallery): 🤖 add 1 new models via gallery agent ( #7133 )
...
chore(model gallery): 🤖 add new models via gallery agent
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-11-06 09:18:59 +01:00
Shohidul Bari
41b60fcfd3
chore: show success toast when system prompt is updated ( #7131 )
...
Signed-off-by: shohidulbari <shohidulbari18@gmail.com >
2025-11-06 09:05:57 +01:00
LocalAI [bot]
cb81869140
chore(model gallery): 🤖 add 1 new models via gallery agent ( #7127 )
...
chore(model gallery): 🤖 add new models via gallery agent
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-11-05 22:40:37 +01:00
LocalAI [bot]
db9957b94e
chore: ⬆️ Update ggml-org/llama.cpp to a44d77126c911d105f7f800c17da21b2a5b112d1 ( #7125 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-11-05 21:22:04 +00:00
LocalAI [bot]
98158881c2
chore: ⬆️ Update ggml-org/llama.cpp to ad51c0a720062a04349c779aae301ad65ca4c856 ( #7098 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-11-04 21:19:58 +00:00
Ettore Di Giacinto
79247a5d17
Clarify note about DMGs not being signed by Apple
...
Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com >
2025-11-04 12:09:28 +01:00
Ettore Di Giacinto
46b7a4c5f2
Add macOS DMG download information
...
Added download link and note for macOS DMG installation.
Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com >
2025-11-04 12:09:07 +01:00
Ettore Di Giacinto
436e2d91d0
Enhance overview with Docker and installer details
...
Added Docker instructions and clarified one-liner installer for Linux.
Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com >
2025-11-04 12:08:03 +01:00
Ettore Di Giacinto
a86fdc4087
Update binaries.md with macOS download instructions
...
Added download instructions for macOS DMG file and updated command for Linux and macOS.
Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com >
2025-11-04 12:06:56 +01:00
Ettore Di Giacinto
c7ac6ca687
Update README with DMG signing note
...
Added a note about DMG signing and workarounds.
Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com >
2025-11-04 12:06:01 +01:00
LocalAI [bot]
7088327e8d
chore(model gallery): 🤖 add 1 new models via gallery agent ( #7077 )
...
chore(model gallery): 🤖 add new models via gallery agent
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-11-04 09:27:33 +01:00
LocalAI [bot]
e2cb44ef37
chore: ⬆️ Update ggml-org/llama.cpp to c5023daf607c578d6344c628eb7da18ac3d92d32 ( #7069 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-11-04 09:26:10 +01:00
dependabot[bot]
3a40b4129c
chore(deps): bump github.com/mudler/edgevpn from 0.31.0 to 0.31.1 ( #7055 )
...
Bumps [github.com/mudler/edgevpn](https://github.com/mudler/edgevpn ) from 0.31.0 to 0.31.1.
- [Release notes](https://github.com/mudler/edgevpn/releases )
- [Changelog](https://github.com/mudler/edgevpn/blob/master/.goreleaser.yml )
- [Commits](https://github.com/mudler/edgevpn/compare/v0.31.0...v0.31.1 )
---
updated-dependencies:
- dependency-name: github.com/mudler/edgevpn
dependency-version: 0.31.1
dependency-type: direct:production
update-type: version-update:semver-patch
...
Signed-off-by: dependabot[bot] <support@github.com >
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2025-11-04 02:29:50 +00:00
dependabot[bot]
4ca8055f21
chore(deps): bump github.com/valyala/fasthttp from 1.55.0 to 1.68.0 ( #7057 )
...
Bumps [github.com/valyala/fasthttp](https://github.com/valyala/fasthttp ) from 1.55.0 to 1.68.0.
- [Release notes](https://github.com/valyala/fasthttp/releases )
- [Commits](https://github.com/valyala/fasthttp/compare/v1.55.0...v1.68.0 )
---
updated-dependencies:
- dependency-name: github.com/valyala/fasthttp
dependency-version: 1.68.0
dependency-type: direct:production
update-type: version-update:semver-minor
...
Signed-off-by: dependabot[bot] <support@github.com >
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2025-11-04 01:16:28 +00:00
dependabot[bot]
704786cc6d
chore(deps): bump github.com/modelcontextprotocol/go-sdk from 1.0.0 to 1.1.0 ( #7053 )
...
chore(deps): bump github.com/modelcontextprotocol/go-sdk
Bumps [github.com/modelcontextprotocol/go-sdk](https://github.com/modelcontextprotocol/go-sdk ) from 1.0.0 to 1.1.0.
- [Release notes](https://github.com/modelcontextprotocol/go-sdk/releases )
- [Commits](https://github.com/modelcontextprotocol/go-sdk/compare/v1.0.0...v1.1.0 )
---
updated-dependencies:
- dependency-name: github.com/modelcontextprotocol/go-sdk
dependency-version: 1.1.0
dependency-type: direct:production
update-type: version-update:semver-minor
...
Signed-off-by: dependabot[bot] <support@github.com >
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2025-11-03 23:54:13 +00:00
dependabot[bot]
e5ce1fd9cc
chore(deps): bump github.com/onsi/ginkgo/v2 from 2.26.0 to 2.27.2 ( #7056 )
...
Bumps [github.com/onsi/ginkgo/v2](https://github.com/onsi/ginkgo ) from 2.26.0 to 2.27.2.
- [Release notes](https://github.com/onsi/ginkgo/releases )
- [Changelog](https://github.com/onsi/ginkgo/blob/master/CHANGELOG.md )
- [Commits](https://github.com/onsi/ginkgo/compare/v2.26.0...v2.27.2 )
---
updated-dependencies:
- dependency-name: github.com/onsi/ginkgo/v2
dependency-version: 2.27.2
dependency-type: direct:production
update-type: version-update:semver-minor
...
Signed-off-by: dependabot[bot] <support@github.com >
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2025-11-03 21:49:06 +01:00
LocalAI [bot]
ea2037f141
chore(model gallery): 🤖 add 1 new models via gallery agent ( #7068 )
...
chore(model gallery): 🤖 add new models via gallery agent
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-11-03 21:48:04 +01:00
dependabot[bot]
567fa62330
chore(deps): bump github.com/mudler/cogito from 0.4.0 to 0.5.0 ( #7054 )
...
Bumps [github.com/mudler/cogito](https://github.com/mudler/cogito ) from 0.4.0 to 0.5.0.
- [Release notes](https://github.com/mudler/cogito/releases )
- [Commits](https://github.com/mudler/cogito/compare/v0.4.0...v0.5.0 )
---
updated-dependencies:
- dependency-name: github.com/mudler/cogito
dependency-version: 0.5.0
dependency-type: direct:production
update-type: version-update:semver-minor
...
Signed-off-by: dependabot[bot] <support@github.com >
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2025-11-03 19:50:22 +00:00
Ettore Di Giacinto
d424a27fa2
chore: display warning only when directory is present ( #7050 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-11-03 18:56:47 +01:00
LocalAI [bot]
3ce9cb566d
chore(model gallery): 🤖 add 1 new models via gallery agent ( #7040 )
...
chore(model gallery): 🤖 add new models via gallery agent
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-11-03 09:09:50 +01:00
Ettore Di Giacinto
ee7638a9b0
Update index.yaml with new file entry
...
Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com >
2025-11-03 08:47:47 +01:00
LocalAI [bot]
e57e50e441
chore(model gallery): 🤖 add 1 new models via gallery agent ( #7039 )
...
chore(model gallery): 🤖 add new models via gallery agent
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-11-03 08:47:16 +01:00
LocalAI [bot]
81880e7975
chore(model-gallery): ⬆️ update checksum ( #7024 )
...
⬆️ Checksum updates in gallery/index.yaml
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-11-03 08:31:14 +01:00
LocalAI [bot]
2cad2c8591
chore: ⬆️ Update ggml-org/llama.cpp to cd5e3b57541ecc52421130742f4d89acbcf77cd4 ( #7023 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-11-02 21:24:19 +00:00
LocalAI [bot]
b87b41ee45
chore(model gallery): 🤖 add 1 new models via gallery agent ( #7017 )
...
chore(model gallery): 🤖 add new models via gallery agent
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-11-02 17:34:11 +01:00
Ettore Di Giacinto
424acd66ad
feat(llama.cpp): allow to set cache-ram and ctx_shift ( #7009 )
...
* feat(llama.cpp): allow to set cache-ram and ctx_shift
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Apply suggestion from @mudler
Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com >
---------
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com >
2025-11-02 17:33:29 +01:00
LocalAI [bot]
3cd8234550
chore: ⬆️ Update ggml-org/whisper.cpp to 999a7e0cbf8484dc2cea1e9f855d6b39f34f7ae9 ( #6997 )
...
⬆️ Update ggml-org/whisper.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-11-02 10:39:11 +01:00
LocalAI [bot]
c70a0f05b8
chore(model gallery): 🤖 add 1 new models via gallery agent ( #6989 )
...
chore(model gallery): 🤖 add new models via gallery agent
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-11-02 09:48:12 +01:00
LocalAI [bot]
f85e2dd1b8
chore: ⬆️ Update ggml-org/llama.cpp to 2f68ce7cfd20e9e7098514bf730e5389b7bba908 ( #6998 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-11-02 09:44:37 +01:00
LocalAI [bot]
e485bdf9ab
docs: ⬆️ update docs version mudler/LocalAI ( #6996 )
...
⬆️ Update docs version mudler/LocalAI
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-11-01 21:08:08 +00:00
Ettore Di Giacinto
495c4ee694
Update index.yaml with new model entry
...
Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com >
2025-11-01 09:21:22 +01:00
LocalAI [bot]
161d1a0344
chore(model gallery): 🤖 add 1 new models via gallery agent ( #6982 )
...
chore(model gallery): 🤖 add new models via gallery agent
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-11-01 09:21:00 +01:00
LocalAI [bot]
b6d1def96f
chore(model-gallery): ⬆️ update checksum ( #6972 )
...
⬆️ Checksum updates in gallery/index.yaml
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
2025-11-01 09:13:42 +01:00
LocalAI [bot]
9ecfdc5938
chore: ⬆️ Update ggml-org/llama.cpp to 31c511a968348281e11d590446bb815048a1e912 ( #6970 )
...
⬆️ Update ggml-org/llama.cpp
Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com >
v3.7.0
2025-10-31 21:04:53 +00:00
Ettore Di Giacinto
c332ef5cce
chore: fix linting issues
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-10-31 19:08:34 +01:00
Ettore Di Giacinto
6e7a8c6041
chore(model gallery): add qwen3-vl-2b-instruct ( #6967 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-10-31 19:04:10 +01:00
Ettore Di Giacinto
43e707ec4f
chore(model gallery): add qwen3-vl-2b-thinking ( #6966 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-10-31 19:03:23 +01:00
Ettore Di Giacinto
fed3663a74
chore(model gallery): add qwen3-vl-4b-thinking ( #6965 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-10-31 19:02:22 +01:00
Ettore Di Giacinto
5b72798db3
chore(model gallery): add qwen3-vl-32b-instruct ( #6964 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-10-31 19:01:11 +01:00
Ettore Di Giacinto
d24d6d4e93
chore(model gallery): add qwen3-vl-4b-instruct ( #6963 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-10-31 18:57:50 +01:00
Ettore Di Giacinto
50ee1fbe06
chore(model gallery): add qwen3-vl-30b-a3b-thinking ( #6962 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-10-31 18:53:13 +01:00
Ettore Di Giacinto
19f3425ce0
chore(model gallery): add huihui-qwen3-vl-30b-a3b-instruct-abliterated ( #6961 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-10-31 18:46:25 +01:00
Ettore Di Giacinto
a6ef245534
chore(model gallery): add qwen3-vl-30b-a3b-instruct ( #6960 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2025-10-31 18:37:12 +01:00