chore(model gallery): 🤖 add 1 new models via gallery agent (#7133)

chore(model gallery): 🤖 add new models via gallery agent

Signed-off-by: github-actions[bot] <41898282+github-actions[bot]@users.noreply.github.com>
Co-authored-by: mudler <2420543+mudler@users.noreply.github.com>
This commit is contained in:
LocalAI [bot]
2025-11-06 09:18:59 +01:00
committed by GitHub
parent 41b60fcfd3
commit 2573102317

View File

@@ -23075,3 +23075,77 @@
- filename: Orca-Agent-v0.1.Q4_K_M.gguf
sha256: 2943397fe2c23959215218adbfaf361ca7974bbb0f948e08c230e6bccb1f130a
uri: huggingface://mradermacher/Orca-Agent-v0.1-GGUF/Orca-Agent-v0.1.Q4_K_M.gguf
- !!merge <<: *qwen3
name: "orca-agent-v0.1-i1"
urls:
- https://huggingface.co/mradermacher/Orca-Agent-v0.1-i1-GGUF
description: |
**Model Name:** Orca-Agent-v0.1
**Base Model:** Qwen3-14B
**Repository:** [Danau5tin/Orca-Agent-v0.1](https://huggingface.co/Danau5tin/Orca-Agent-v0.1)
**License:** Apache 2.0
**Use Case:** Multi-Agent Orchestration for Complex Code & System Tasks
---
### 🔍 **Overview**
Orca-Agent-v0.1 is a powerful **task orchestration agent** designed to manage complex, multi-step workflows—especially in code and system administration—without directly modifying code. Instead, it acts as a strategic planner that coordinates a team of specialized agents.
---
### 🛠️ **Key Features**
- **Intelligent Task Breakdown:** Analyzes user requests and decomposes them into focused subtasks.
- **Agent Coordination:** Dynamically dispatches:
- *Explorer agents* to understand the system state.
- *Coder agents* to implement changes with precise instructions.
- *Verifier agents* to validate results.
- **Context Management:** Maintains a persistent context store to track discoveries across steps.
- **High Performance:** Achieves **18.25% on TerminalBench** when paired with Qwen3-Coder-30B, nearing the performance of a 480B model.
---
### 📊 **Performance**
| Orchestrator | Subagent | Terminal Bench |
|--------------|----------|----------------|
| Orca-Agent-v0.1-14B | Qwen3-Coder-30B | **18.25%** |
| Qwen3-14B | Qwen3-Coder-30B | 7.0% |
> *Trained on 32x H100s using GRPO + curriculum learning, with full open-source training code available.*
---
### 📌 **Example Output**
```xml
<task_create>
agent_type: 'coder'
title: 'Attempt recovery using the identified backup file'
description: |
Move the backup file from /tmp/terraform_work/.terraform.tfstate.tmp to /infrastructure/recovered_state.json.
Verify file existence, size, and permissions (rw-r--r--).
max_turns: 10
context_refs: ['task_003']
</task_create>
```
---
### 📁 **Serving**
- ✅ **vLLM:** `vllm serve Danau5tin/Orca-Agent-v0.1`
- ✅ **SGLang:** `python -m sglang.launch_server --model-path Danau5tin/Orca-Agent-v0.1`
---
### 🌐 **Learn More**
- **Training & Code:** [GitHub - Orca-Agent-RL](https://github.com/Danau5tin/Orca-Agent-RL)
- **Orchestration Framework:** [multi-agent-coding-system](https://github.com/Danau5tin/multi-agent-coding-system)
---
> ✅ *Note: The model at `mradermacher/Orca-Agent-v0.1-i1-GGUF` is a quantized version of this original model. This description reflects the full, unquantized version by the original author.*
overrides:
parameters:
model: Orca-Agent-v0.1.i1-Q4_K_M.gguf
files:
- filename: Orca-Agent-v0.1.i1-Q4_K_M.gguf
sha256: 05548385128da98431f812d1b6bc3f1bff007a56a312dc98d9111b5fb51e1751
uri: huggingface://mradermacher/Orca-Agent-v0.1-i1-GGUF/Orca-Agent-v0.1.i1-Q4_K_M.gguf