mirror of
https://github.com/mudler/LocalAI.git
synced 2026-05-05 17:59:44 -05:00
feat: disable force eviction (#7725)
* feat: allow to set forcing backends eviction while requests are in flight Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * feat: try to make the request sit and retry if eviction couldn't be done Otherwise calls that in order to pass would need to shutdown other backends would just fail. In this way instead we make the request sit and retry eviction until it succeeds. The thresholds can be configured by the user. Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * add tests Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * expose settings to CLI Signed-off-by: Ettore Di Giacinto <mudler@localai.io> * Update docs Signed-off-by: Ettore Di Giacinto <mudler@localai.io> --------- Signed-off-by: Ettore Di Giacinto <mudler@localai.io>
This commit is contained in:
committed by
GitHub
parent
bb459e671f
commit
c844b7ac58
@@ -262,4 +262,13 @@ var _ = Describe("ModelLoader", func() {
|
||||
Expect(modelLoader.GetLoadingCount()).To(Equal(0))
|
||||
})
|
||||
})
|
||||
|
||||
Context("LRU Eviction Retry Settings", func() {
|
||||
It("should allow updating retry settings", func() {
|
||||
modelLoader.SetLRUEvictionRetrySettings(50, 2*time.Second)
|
||||
// Settings are updated - we can verify through behavior if needed
|
||||
// For now, just verify the call doesn't panic
|
||||
Expect(modelLoader).ToNot(BeNil())
|
||||
})
|
||||
})
|
||||
})
|
||||
|
||||
Reference in New Issue
Block a user