Prioritize UI-TARS-2 section in computer-use agents docs

This commit is contained in:
ddupont
2025-11-18 17:22:53 -05:00
committed by GitHub
parent 210fc01eea
commit 00231b6d52

View File

@@ -9,6 +9,18 @@ All agent loops are compatible with any LLM provider supported by LiteLLM.
See [Running Models Locally](/agent-sdk/supported-model-providers/local-models) for how to use Hugging Face and MLX models on your own machine.
## UI-TARS-2
Nextgeneration UITARS via Cua Router:
- `cua/bytedance/ui-tars-2`
```python
agent = ComputerAgent("cua/bytedance/ui-tars-2", tools=[computer])
async for _ in agent.run("Open a browser and search for Python tutorials"):
pass
```
## Gemini CUA
Gemini models with computer-use capabilities:
@@ -99,18 +111,6 @@ async for _ in agent.run("Open the settings menu and change the theme to dark mo
pass
```
## UI-TARS-2
Nextgeneration UITARS via Cua Router:
- `cua/bytedance/ui-tars-2`
```python
agent = ComputerAgent("cua/bytedance/ui-tars-2", tools=[computer])
async for _ in agent.run("Open a browser and search for Python tutorials"):
pass
```
---
CUAs also support direct click prediction. See [Grounding Models](./grounding-models) for details on `predict_click()`.