From 3ce9cb566d72c5d377b32cc3037b8d6262c18a74 Mon Sep 17 00:00:00 2001 From: "LocalAI [bot]" <139863280+localai-bot@users.noreply.github.com> Date: Mon, 3 Nov 2025 09:09:50 +0100 Subject: [PATCH] chore(model gallery): :robot: add 1 new models via gallery agent (#7040) chore(model gallery): :robot: add new models via gallery agent Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com> Co-authored-by: mudler <2420543+mudler@users.noreply.github.com> --- gallery/index.yaml | 25 +++++++++++++++++++++++++ 1 file changed, 25 insertions(+) diff --git a/gallery/index.yaml b/gallery/index.yaml index d5422daa6..1eb77c8f5 100644 --- a/gallery/index.yaml +++ b/gallery/index.yaml @@ -22981,3 +22981,28 @@ - filename: GroveMoE-Base.i1-Q4_K_M.gguf sha256: 9d7186ba9531bf689c91176468d7a35c0aaac0cd52bd44d4ed8f7654949ef4f4 uri: huggingface://mradermacher/GroveMoE-Base-i1-GGUF/GroveMoE-Base.i1-Q4_K_M.gguf +- !!merge <<: *qwen3 + name: "nvidia.qwen3-nemotron-32b-rlbff" + urls: + - https://huggingface.co/DevQuasar/nvidia.Qwen3-Nemotron-32B-RLBFF-GGUF + description: | + The **nvidia/Qwen3-Nemotron-32B-RLBFF** is a large language model based on the Qwen3 architecture, fine-tuned by NVIDIA using Reinforcement Learning from Human Feedback (RLHF) for improved alignment with human preferences. With 32 billion parameters, it excels in complex reasoning, instruction following, and natural language generation, making it suitable for advanced tasks such as code generation, dialogue systems, and content creation. + + This model is part of NVIDIA’s Nemotron series, designed to deliver high performance and safety in real-world applications. It is optimized for efficient deployment while maintaining strong language understanding and generation capabilities. + + **Key Features:** + - **Base Model**: Qwen3-32B + - **Fine-tuning**: Reinforcement Learning from Human Feedback (RLBFF) + - **Use Case**: Advanced text generation, coding, dialogue, and reasoning + - **License**: MIT (check Hugging Face for full details) + + 👉 [View on Hugging Face](https://huggingface.co/nvidia/Qwen3-Nemotron-32B-RLBFF) + + *Note: The GGUF version hosted by DevQuasar is a quantized variant for efficient local inference. The original, unquantized model is available at the link above.* + overrides: + parameters: + model: nvidia.Qwen3-Nemotron-32B-RLBFF.Q4_K_M.gguf + files: + - filename: nvidia.Qwen3-Nemotron-32B-RLBFF.Q4_K_M.gguf + sha256: 5dfc9f1dc21885371b12a6e0857d86d6deb62b6601b4d439e4dfe01195a462f1 + uri: huggingface://DevQuasar/nvidia.Qwen3-Nemotron-32B-RLBFF-GGUF/nvidia.Qwen3-Nemotron-32B-RLBFF.Q4_K_M.gguf