cryptk
28a421cb1d
feat: migrate python backends from conda to uv ( #2215 )
...
* feat: migrate diffusers backend from conda to uv
- replace conda with UV for diffusers install (prototype for all
extras backends)
- add ability to build docker with one/some/all extras backends
instead of all or nothing
Signed-off-by: Chris Jowett <421501+cryptk@users.noreply.github.com >
* feat: migrate autogtpq bark coqui from conda to uv
Signed-off-by: Chris Jowett <421501+cryptk@users.noreply.github.com >
* feat: convert exllama over to uv
Signed-off-by: Chris Jowett <421501+cryptk@users.noreply.github.com >
* feat: migrate exllama2 to uv
Signed-off-by: Chris Jowett <421501+cryptk@users.noreply.github.com >
* feat: migrate mamba to uv
Signed-off-by: Chris Jowett <421501+cryptk@users.noreply.github.com >
* feat: migrate parler to uv
Signed-off-by: Chris Jowett <421501+cryptk@users.noreply.github.com >
* feat: migrate petals to uv
Signed-off-by: Chris Jowett <421501+cryptk@users.noreply.github.com >
* fix: fix tests
Signed-off-by: Chris Jowett <421501+cryptk@users.noreply.github.com >
* feat: migrate rerankers to uv
Signed-off-by: Chris Jowett <421501+cryptk@users.noreply.github.com >
* feat: migrate sentencetransformers to uv
Signed-off-by: Chris Jowett <421501+cryptk@users.noreply.github.com >
* fix: install uv for tests-linux
Signed-off-by: Chris Jowett <421501+cryptk@users.noreply.github.com >
* fix: make sure file exists before installing on intel images
Signed-off-by: Chris Jowett <421501+cryptk@users.noreply.github.com >
* feat: migrate transformers backend to uv
Signed-off-by: Chris Jowett <421501+cryptk@users.noreply.github.com >
* feat: migrate transformers-musicgen to uv
Signed-off-by: Chris Jowett <421501+cryptk@users.noreply.github.com >
* feat: migrate vall-e-x to uv
Signed-off-by: Chris Jowett <421501+cryptk@users.noreply.github.com >
* feat: migrate vllm to uv
Signed-off-by: Chris Jowett <421501+cryptk@users.noreply.github.com >
* fix: add uv install to the rest of test-extra.yml
Signed-off-by: Chris Jowett <421501+cryptk@users.noreply.github.com >
* fix: adjust file perms on all install/run/test scripts
Signed-off-by: Chris Jowett <421501+cryptk@users.noreply.github.com >
* fix: add missing acclerate dependencies
Signed-off-by: Chris Jowett <421501+cryptk@users.noreply.github.com >
* fix: add some more missing dependencies to python backends
Signed-off-by: Chris Jowett <421501+cryptk@users.noreply.github.com >
* fix: parler tests venv py dir fix
Signed-off-by: Chris Jowett <421501+cryptk@users.noreply.github.com >
* fix: correct filename for transformers-musicgen tests
Signed-off-by: Chris Jowett <421501+cryptk@users.noreply.github.com >
* fix: adjust the pwd for valle tests
Signed-off-by: Chris Jowett <421501+cryptk@users.noreply.github.com >
* feat: cleanup and optimization work for uv migration
Signed-off-by: Chris Jowett <421501+cryptk@users.noreply.github.com >
* fix: add setuptools to requirements-install for mamba
Signed-off-by: Chris Jowett <421501+cryptk@users.noreply.github.com >
* feat: more size optimization work
Signed-off-by: Chris Jowett <421501+cryptk@users.noreply.github.com >
* feat: make installs and tests more consistent, cleanup some deps
Signed-off-by: Chris Jowett <421501+cryptk@users.noreply.github.com >
* fix: cleanup
Signed-off-by: Chris Jowett <421501+cryptk@users.noreply.github.com >
* fix: mamba backend is cublas only
Signed-off-by: Chris Jowett <421501+cryptk@users.noreply.github.com >
* fix: uncomment lines in makefile
Signed-off-by: Chris Jowett <421501+cryptk@users.noreply.github.com >
---------
Signed-off-by: Chris Jowett <421501+cryptk@users.noreply.github.com >
2024-05-10 15:08:08 +02:00
Ettore Di Giacinto
650ae620c5
ci: get latest git version
...
Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com >
2024-05-09 11:33:16 +02:00
Ettore Di Giacinto
6a209cbef6
ci: get file name correctly in checksum_checker.sh
...
Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com >
2024-05-09 10:57:23 +02:00
Ettore Di Giacinto
9786bb826d
ci: try to fix checksum_checker.sh
...
Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com >
2024-05-09 09:34:07 +02:00
Ettore Di Giacinto
9b4c6f348a
Update checksum_checker.yaml
...
Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com >
2024-05-09 00:57:22 +02:00
Ettore Di Giacinto
cb6ddb21ec
Update checksum_checker.yaml
...
Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com >
2024-05-09 00:55:48 +02:00
Ettore Di Giacinto
0baacca605
Update checksum_checker.yaml
...
Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com >
2024-05-09 00:54:35 +02:00
Ettore Di Giacinto
222d714ec7
Update checksum_checker.yaml
...
Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com >
2024-05-09 00:51:57 +02:00
Ettore Di Giacinto
fd2d89d37b
Update checksum_checker.sh
...
Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com >
2024-05-09 00:43:16 +02:00
Ettore Di Giacinto
6440b608dc
Update checksum_checker.yaml
...
Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com >
2024-05-09 00:42:48 +02:00
Ettore Di Giacinto
1937118eab
Update checksum_checker.yaml
...
Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com >
2024-05-09 00:34:56 +02:00
Ettore Di Giacinto
bc272d1e4b
ci: add checksum checker pipeline ( #2274 )
...
Signed-off-by: mudler <mudler@localai.io >
2024-05-09 00:31:27 +02:00
Ettore Di Giacinto
c5798500cb
feat(single-build): generate single binaries for releases ( #2246 )
...
* feat(single-build): generate single binaries for releases
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* drop old targets
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
---------
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2024-05-05 17:20:51 +02:00
cryptk
a0aa5d01a1
feat: update ROCM and use smaller image ( #2196 )
...
* feat: update ROCM and use smaller image
Signed-off-by: Chris Jowett <421501+cryptk@users.noreply.github.com >
* fix: add call to ldconfig to fix AMDs broken library packages
Signed-off-by: Chris Jowett <421501+cryptk@users.noreply.github.com >
---------
Signed-off-by: Chris Jowett <421501+cryptk@users.noreply.github.com >
2024-05-03 18:46:49 +02:00
cryptk
f7aabf1b50
fix: bring everything onto the same GRPC version to fix tests ( #2199 )
...
fix: more places where we are installing grpc that need a version specified
fix: attempt to fix metal tests
fix: metal/brew is forcing an update, they don't have 1.58 available anymore
Signed-off-by: Chris Jowett <421501+cryptk@users.noreply.github.com >
2024-04-30 19:12:15 +00:00
dependabot[bot]
53c3842bc2
build(deps): bump dependabot/fetch-metadata from 2.0.0 to 2.1.0 ( #2186 )
...
Bumps [dependabot/fetch-metadata](https://github.com/dependabot/fetch-metadata ) from 2.0.0 to 2.1.0.
- [Release notes](https://github.com/dependabot/fetch-metadata/releases )
- [Commits](https://github.com/dependabot/fetch-metadata/compare/v2.0.0...v2.1.0 )
---
updated-dependencies:
- dependency-name: dependabot/fetch-metadata
dependency-type: direct:production
update-type: version-update:semver-minor
...
Signed-off-by: dependabot[bot] <support@github.com >
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2024-04-29 21:12:37 +00:00
Dave
982dc6a2bd
fix: github bump_docs.sh regex to drop emoji and other text ( #2180 )
...
fix: bump_docs regex
Signed-off-by: Dave Lee <dave@gray101.com >
2024-04-29 03:55:29 +00:00
cryptk
987b7ad42d
feat: only keep the build artifacts from the grpc build ( #2172 )
...
* feat: only keep the build artifacts from the grpc build
Signed-off-by: Chris Jowett <421501+cryptk@users.noreply.github.com >
* feat: remove separate Cache GRPC build step
Signed-off-by: Chris Jowett <421501+cryptk@users.noreply.github.com >
* feat: remove docker inspect step, it is leftover from previous debugging
Signed-off-by: Chris Jowett <421501+cryptk@users.noreply.github.com >
---------
Signed-off-by: Chris Jowett <421501+cryptk@users.noreply.github.com >
2024-04-28 19:24:16 +00:00
Ettore Di Giacinto
7e6bf6e7a1
ci: add auto-label rule for gallery in labeler.yml
...
Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com >
2024-04-27 19:52:26 +02:00
cryptk
9fc0135991
feat: cleanup Dockerfile and make final image a little smaller ( #2146 )
...
* feat: cleanup Dockerfile and make final image a little smaller
Signed-off-by: Chris Jowett <421501+cryptk@users.noreply.github.com >
* fix: add build-essential to final stage
Signed-off-by: Chris Jowett <421501+cryptk@users.noreply.github.com >
* fix: more GRPC cache misses
Signed-off-by: Chris Jowett <421501+cryptk@users.noreply.github.com >
* fix: correct for another cause of GRPC cache misses
Signed-off-by: Chris Jowett <421501+cryptk@users.noreply.github.com >
* feat: generate new GRPC cache automatically if needed
Signed-off-by: Chris Jowett <421501+cryptk@users.noreply.github.com >
* fix: use new GRPC_MAKEFLAGS build arg in GRPC cache generation
Signed-off-by: Chris Jowett <421501+cryptk@users.noreply.github.com >
---------
Signed-off-by: Chris Jowett <421501+cryptk@users.noreply.github.com >
2024-04-27 19:48:20 +02:00
fakezeta
c9451cb604
Bump oneapi-basekit, optimum and openvino ( #2139 )
...
* Bump oneapi-basekit, optimum and openvino
* Changed PERFORMANCE HINT to CUMULATIVE_THROUGHPUT
Minor latency change for first token but about 10-15% speedup on token generation.
2024-04-26 16:20:43 +02:00
Ettore Di Giacinto
5d170e9264
Update yaml-check.yml
...
Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com >
2024-04-25 16:05:02 +02:00
Ettore Di Giacinto
1b0a64aa46
Update yaml-check.yml
...
Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com >
2024-04-25 15:57:06 +02:00
Ettore Di Giacinto
aa8e1c63d5
Create yaml-check.yml
...
Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com >
2024-04-25 15:52:52 +02:00
Ettore Di Giacinto
60690c9fc4
ci: add swagger pipeline
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2024-04-25 15:11:01 +02:00
Ettore Di Giacinto
b664edde29
feat(rerankers): Add new backend, support jina rerankers API ( #2121 )
...
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2024-04-25 00:19:02 +02:00
Dave
228bc4903f
fix: action-tmate detached ( #2092 )
...
connect-timeout-seconds works best with `detached: true`
Signed-off-by: Dave <dave@gray101.com >
2024-04-21 22:39:17 +02:00
Dave
1038f7469c
fix: action-tmate: use connect-timeout-sections and limit-access-to-actor ( #2083 )
...
fix for action-tmate: connect-timeout-sections and limit-access-to-actor
Signed-off-by: Dave Lee <dave@gray101.com >
2024-04-20 08:42:02 +00:00
cryptk
852316c5a6
fix: move the GRPC cache generation workflow into it's own concurrency group ( #2071 )
...
Signed-off-by: Chris Jowett <421501+cryptk@users.noreply.github.com >
2024-04-18 20:52:34 -04:00
cryptk
13012cfa70
feat: better control of GRPC docker cache ( #2070 )
...
Signed-off-by: Chris Jowett <421501+cryptk@users.noreply.github.com >
2024-04-18 16:19:36 -04:00
Ettore Di Giacinto
af9e5a2d05
Revert #1963 ( #2056 )
...
* Revert "fix(fncall): fix regression introduced in #1963 (#2048 )"
This reverts commit 6b06d4e0af .
* Revert "fix: action-tmate back to upstream, dead code removal (#2038 )"
This reverts commit fdec8a9d00 .
* Revert "feat(grpc): return consumed token count and update response accordingly (#2035 )"
This reverts commit e843d7df0e .
* Revert "refactor: backend/service split, channel-based llm flow (#1963 )"
This reverts commit eed5706994 .
* feat(grpc): return consumed token count and update response accordingly
Fixes : #1920
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
---------
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2024-04-17 23:33:49 +02:00
Dave
fdec8a9d00
fix: action-tmate back to upstream, dead code removal ( #2038 )
...
cleanup: upstream action-tmate has taken my PR, drop master reference. Also remove dead code from api.go
Signed-off-by: Dave Lee <dave@gray101.com >
2024-04-16 01:46:36 +00:00
dependabot[bot]
320d8a48d9
build(deps): bump github/codeql-action from 2 to 3 ( #2041 )
...
Bumps [github/codeql-action](https://github.com/github/codeql-action ) from 2 to 3.
- [Release notes](https://github.com/github/codeql-action/releases )
- [Changelog](https://github.com/github/codeql-action/blob/main/CHANGELOG.md )
- [Commits](https://github.com/github/codeql-action/compare/v2...v3 )
---
updated-dependencies:
- dependency-name: github/codeql-action
dependency-type: direct:production
update-type: version-update:semver-major
...
Signed-off-by: dependabot[bot] <support@github.com >
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2024-04-15 22:02:44 +00:00
dependabot[bot]
46609e936e
build(deps): bump dependabot/fetch-metadata from 1.3.4 to 2.0.0 ( #2040 )
...
Bumps [dependabot/fetch-metadata](https://github.com/dependabot/fetch-metadata ) from 1.3.4 to 2.0.0.
- [Release notes](https://github.com/dependabot/fetch-metadata/releases )
- [Commits](https://github.com/dependabot/fetch-metadata/compare/v1.3.4...v2.0.0 )
---
updated-dependencies:
- dependency-name: dependabot/fetch-metadata
dependency-type: direct:production
update-type: version-update:semver-major
...
Signed-off-by: dependabot[bot] <support@github.com >
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2024-04-15 21:37:06 +00:00
dependabot[bot]
b72c6cc9fc
build(deps): bump softprops/action-gh-release from 1 to 2 ( #2039 )
...
Bumps [softprops/action-gh-release](https://github.com/softprops/action-gh-release ) from 1 to 2.
- [Release notes](https://github.com/softprops/action-gh-release/releases )
- [Changelog](https://github.com/softprops/action-gh-release/blob/master/CHANGELOG.md )
- [Commits](https://github.com/softprops/action-gh-release/compare/v1...v2 )
---
updated-dependencies:
- dependency-name: softprops/action-gh-release
dependency-type: direct:production
update-type: version-update:semver-major
...
Signed-off-by: dependabot[bot] <support@github.com >
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2024-04-15 20:52:39 +00:00
Dave
d5699dbf4f
fix - correct checkout versions ( #2029 )
...
minor fix - bump some checkout@v3 to checkout@v4 to match and clean up warnings
Signed-off-by: Dave Lee <dave@gray101.com >
2024-04-13 19:01:17 +02:00
Ettore Di Giacinto
0fdff26924
feat(parler-tts): Add new backend ( #2027 )
...
* feat(parler-tts): Add new backend
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* feat(parler-tts): try downgrade protobuf
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* feat(parler-tts): add parler conda env
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* Revert "feat(parler-tts): try downgrade protobuf"
This reverts commit bd5941d5cfc00676b45a99f71debf3c34249cf3c.
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* deps: add grpc
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* fix: try to gen proto with same environment
* workaround
* Revert "fix: try to gen proto with same environment"
This reverts commit 998c745e2f .
* Workaround fixup
---------
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
Co-authored-by: Dave <dave@gray101.com >
2024-04-13 18:59:21 +02:00
Ettore Di Giacinto
b91820b7f8
Update localaibot_automerge.yml
...
Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com >
2024-04-13 13:46:07 +02:00
Ettore Di Giacinto
4e74560649
ci: fix release pipeline missing dependencies ( #2025 )
2024-04-13 13:30:40 +02:00
Ettore Di Giacinto
95244ed6e7
Update localaibot_automerge.yml
...
Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com >
2024-04-13 10:03:15 +02:00
Ettore Di Giacinto
f1f39eea3f
Create localaibot_automerge.yml
...
Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com >
2024-04-13 09:47:33 +02:00
Dave
eed5706994
refactor: backend/service split, channel-based llm flow ( #1963 )
...
Refactor: channel based llm flow and services split
---------
Signed-off-by: Dave Lee <dave@gray101.com >
2024-04-13 09:45:34 +02:00
cryptk
1981154f49
fix: dont commit generated files to git ( #1993 )
...
* fix: initial work towards not committing generated files to the repository
Signed-off-by: Chris Jowett <421501+cryptk@users.noreply.github.com >
* feat: improve build docs
Signed-off-by: Chris Jowett <421501+cryptk@users.noreply.github.com >
* fix: remove unused folder from .dockerignore and .gitignore
Signed-off-by: Chris Jowett <421501+cryptk@users.noreply.github.com >
* fix: attempt to fix extra backend tests
Signed-off-by: Chris Jowett <421501+cryptk@users.noreply.github.com >
* fix: attempt to fix other tests
Signed-off-by: Chris Jowett <421501+cryptk@users.noreply.github.com >
* fix: more test fixes
Signed-off-by: Chris Jowett <421501+cryptk@users.noreply.github.com >
* fix: fix apple tests
Signed-off-by: Chris Jowett <421501+cryptk@users.noreply.github.com >
* fix: more extras tests fixes
Signed-off-by: Chris Jowett <421501+cryptk@users.noreply.github.com >
* fix: add GOBIN to PATH in docker build
Signed-off-by: Chris Jowett <421501+cryptk@users.noreply.github.com >
* fix: extra tests and Dockerfile corrections
Signed-off-by: Chris Jowett <421501+cryptk@users.noreply.github.com >
* fix: remove build dependency checks
Signed-off-by: Chris Jowett <421501+cryptk@users.noreply.github.com >
* fix: add golang protobuf compilers to tests-linux action
Signed-off-by: Chris Jowett <421501+cryptk@users.noreply.github.com >
* fix: ensure protogen is run for extra backend installs
Signed-off-by: Chris Jowett <421501+cryptk@users.noreply.github.com >
* fix: use newer protobuf
Signed-off-by: Chris Jowett <421501+cryptk@users.noreply.github.com >
* fix: more missing protoc binaries
Signed-off-by: Chris Jowett <421501+cryptk@users.noreply.github.com >
* fix: missing dependencies during docker build
Signed-off-by: Chris Jowett <421501+cryptk@users.noreply.github.com >
* fix: don't install grpc compilers in the final stage if they aren't needed
Signed-off-by: Chris Jowett <421501+cryptk@users.noreply.github.com >
* fix: python-grpc-tools in 22.04 repos is too old
Signed-off-by: Chris Jowett <421501+cryptk@users.noreply.github.com >
* fix: add a couple of extra build dependencies to Makefile
Signed-off-by: Chris Jowett <421501+cryptk@users.noreply.github.com >
* fix: unbreak container rebuild functionality
Signed-off-by: Chris Jowett <421501+cryptk@users.noreply.github.com >
---------
Signed-off-by: Chris Jowett <421501+cryptk@users.noreply.github.com >
2024-04-13 09:37:32 +02:00
Ettore Di Giacinto
0e549424e7
Update dependabot_auto.yml
...
Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com >
2024-04-12 15:59:25 +02:00
Ettore Di Giacinto
69d638268b
Update dependabot_auto.yml
...
Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com >
2024-04-12 15:57:13 +02:00
Ettore Di Giacinto
18eea9088a
Update dependabot_auto.yml
...
Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com >
2024-04-12 15:38:34 +02:00
Ettore Di Giacinto
fb105837ba
Update secscan.yaml
...
Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com >
2024-04-12 15:37:56 +02:00
Ettore Di Giacinto
099bd54ff2
ci: try to build on macos14 ( #2011 )
...
* ci: try to build on macos14
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
* ci: fixup artifact name
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
---------
Signed-off-by: Ettore Di Giacinto <mudler@localai.io >
2024-04-11 19:22:30 +02:00
Ettore Di Giacinto
b1a242251c
ci: fixup upload artifact name
...
Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com >
2024-04-11 18:26:03 +02:00
dependabot[bot]
b606c7b768
build(deps): bump actions/upload-artifact from 3 to 4 ( #2007 )
...
Bumps [actions/upload-artifact](https://github.com/actions/upload-artifact ) from 3 to 4.
- [Release notes](https://github.com/actions/upload-artifact/releases )
- [Commits](https://github.com/actions/upload-artifact/compare/v3...v4 )
---
updated-dependencies:
- dependency-name: actions/upload-artifact
dependency-type: direct:production
update-type: version-update:semver-major
...
Signed-off-by: dependabot[bot] <support@github.com >
Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
2024-04-11 14:44:02 +00:00