mirror of
https://github.com/mudler/LocalAI.git
synced 2025-12-21 09:20:14 -06:00
Remove model size guidance from FAQ
Removed redundant information about model sizes in the WebUI. Signed-off-by: Ettore Di Giacinto <mudler@users.noreply.github.com>
This commit is contained in:
committed by
GitHub
parent
1e9b115251
commit
e963e16bc5
@@ -44,8 +44,6 @@ Model sizes vary significantly depending on the model and quantization level:
|
||||
- Use SSD storage for better performance
|
||||
- Consider the model size relative to your system RAM - models larger than your RAM may not run efficiently
|
||||
|
||||
The WebUI shows model sizes in the Models tab to help you choose appropriate models for your system.
|
||||
|
||||
### Benchmarking LocalAI and llama.cpp shows different results!
|
||||
|
||||
LocalAI applies a set of defaults when loading models with the llama.cpp backend, one of these is mirostat sampling - while it achieves better results, it slows down the inference. You can disable this by setting `mirostat: 0` in the model config file. See also the advanced section ({{%relref "docs/advanced/advanced-usage" %}}) for more information and [this issue](https://github.com/mudler/LocalAI/issues/2780).
|
||||
@@ -89,4 +87,4 @@ This typically happens when your prompt exceeds the context size. Try to reduce
|
||||
|
||||
### I'm getting a 'SIGILL' error, what's wrong?
|
||||
|
||||
Your CPU probably does not have support for certain instructions that are compiled by default in the pre-built binaries. If you are running in a container, try setting `REBUILD=true` and disable the CPU instructions that are not compatible with your CPU. For instance: `CMAKE_ARGS="-DGGML_F16C=OFF -DGGML_AVX512=OFF -DGGML_AVX2=OFF -DGGML_FMA=OFF" make build`
|
||||
Your CPU probably does not have support for certain instructions that are compiled by default in the pre-built binaries. If you are running in a container, try setting `REBUILD=true` and disable the CPU instructions that are not compatible with your CPU. For instance: `CMAKE_ARGS="-DGGML_F16C=OFF -DGGML_AVX512=OFF -DGGML_AVX2=OFF -DGGML_FMA=OFF" make build`
|
||||
|
||||
Reference in New Issue
Block a user