Add explanations of agent configurations

2026-01-06 21:39:58 -06:00 · 2025-10-23 14:37:08 -07:00
parent 9a409c3b9f
commit eed006cc25
1 changed files with 9 additions and 5 deletions
--- a/README.md
+++ b/README.md
@@ -126,12 +126,16 @@ Cua uses the OpenAI Agent response format.

 ## Model Configuration

-These are the valid model configurations for a `ComputerAgent`:
+These are the valid model configurations for `ComputerAgent(model="...")`:

-1. `{computer-use-model}`
-2. `{grounding-model}+{any-vlm-with-tools}`
-3. `moondream3+{any-llm-with-tools}`
-4. `human/human` ([Human-in-the-Loop](https://docs.trycua.com/docs/agent-sdk/supported-agents/human-in-the-loop))
+| Configuration                            | Description                                                                                                                                         |
+| ---------------------------------------- | --------------------------------------------------------------------------------------------------------------------------------------------------- |
+| `{computer-use-model}`                   | A single model to perform all computer-use tasks                                                                                                    |
+| `{grounding-model}+{any-vlm-with-tools}` | [Composed](https://docs.trycua.com/docs/agent-sdk/supported-agents/composed-agents) with VLM for captioning and grounding LLM for element detection |
+| `moondream3+{any-llm-with-tools}`        | [Composed](https://docs.trycua.com/docs/agent-sdk/supported-agents/composed-agents) with Moondream3 for captioning and UI element detection         |
+| `human/human`                            | A [human-in-the-loop](https://docs.trycua.com/docs/agent-sdk/supported-agents/human-in-the-loop) in place of a model                                |
+
+### Model Capabilities

 The following table shows which capabilities are supported by each model: