mirror of
https://github.com/trycua/computer.git
synced 2026-01-04 20:40:15 -06:00
Merge pull request #620 from sarinali/feat/cua-vlm-qwen-docs
Qwen Cua VLM Docs
This commit is contained in:
@@ -77,10 +77,10 @@ async for _ in agent.run("Open Firefox and navigate to github.com"):
|
||||
|
||||
Qwen3 VL family:
|
||||
|
||||
- `openrouter/qwen/qwen3-vl-235b-a22b-instruct`
|
||||
- `cua/qwen/qwen3-vl-235b` (via CUA VLM Router - recommended)
|
||||
|
||||
```python
|
||||
agent = ComputerAgent("openrouter/qwen/qwen3-vl-235b-a22b-instruct", tools=[computer])
|
||||
agent = ComputerAgent("cua/qwen/qwen3-vl-235b", tools=[computer])
|
||||
async for _ in agent.run("Open Firefox and navigate to github.com"):
|
||||
pass
|
||||
```
|
||||
|
||||
@@ -55,10 +55,12 @@ async for result in agent.run(messages):
|
||||
|
||||
The CUA VLM Router currently supports these models:
|
||||
|
||||
| Model ID | Provider | Description | Best For |
|
||||
| --------------------------------- | --------- | ----------------- | ---------------------------------- |
|
||||
| `cua/anthropic/claude-sonnet-4.5` | Anthropic | Claude Sonnet 4.5 | General-purpose tasks, recommended |
|
||||
| `cua/anthropic/claude-haiku-4.5` | Anthropic | Claude Haiku 4.5 | Fast responses, cost-effective |
|
||||
| Model ID | Provider | Description | Best For |
|
||||
| --------------------------------- | --------- | ----------------- | --------------------------------------- |
|
||||
| `cua/anthropic/claude-sonnet-4.5` | Anthropic | Claude Sonnet 4.5 | General-purpose tasks, recommended |
|
||||
| `cua/anthropic/claude-opus-4.5` | Anthropic | Claude Opus 4.5 | Enhanced agentic and computer-use tasks |
|
||||
| `cua/anthropic/claude-haiku-4.5` | Anthropic | Claude Haiku 4.5 | Fast responses, cost-effective |
|
||||
| `cua/qwen/qwen3-vl-235b` | Qwen | Qwen3 VL 235B | Large-scale vision-language tasks |
|
||||
|
||||
## How It Works
|
||||
|
||||
@@ -347,6 +349,52 @@ agent = ComputerAgent(
|
||||
)
|
||||
```
|
||||
|
||||
### Using Qwen3 VL 235B
|
||||
|
||||
```python
|
||||
from agent import ComputerAgent
|
||||
from computer import Computer
|
||||
|
||||
computer = Computer(os_type="linux", provider_type="docker")
|
||||
|
||||
agent = ComputerAgent(
|
||||
model="cua/qwen/qwen3-vl-235b",
|
||||
tools=[computer],
|
||||
only_n_most_recent_images=3
|
||||
)
|
||||
|
||||
messages = [{"role": "user", "content": "Open a browser and search for Python tutorials"}]
|
||||
|
||||
async for result in agent.run(messages):
|
||||
print(result)
|
||||
```
|
||||
|
||||
### Using Claude Opus 4.5
|
||||
|
||||
```python
|
||||
from agent import ComputerAgent
|
||||
from computer import Computer
|
||||
|
||||
computer = Computer(
|
||||
os_type="linux",
|
||||
provider_type="cloud",
|
||||
name="your-container-name",
|
||||
api_key="your-cua-api-key"
|
||||
)
|
||||
|
||||
agent = ComputerAgent(
|
||||
model="cua/anthropic/claude-opus-4.5",
|
||||
tools=[computer],
|
||||
instructions="You are a helpful assistant that can control computers",
|
||||
only_n_most_recent_images=3
|
||||
)
|
||||
|
||||
messages = [{"role": "user", "content": "Open a browser and search for Python tutorials"}]
|
||||
|
||||
async for result in agent.run(messages):
|
||||
print(result)
|
||||
```
|
||||
|
||||
## Migration from Direct Provider Access
|
||||
|
||||
Switching from direct provider access (BYOK) to CUA VLM Router is simple:
|
||||
|
||||
@@ -367,7 +367,9 @@ Choose how you want to access vision-language models for your agent:
|
||||
|
||||
**Available CUA models:**
|
||||
- `cua/anthropic/claude-sonnet-4.5` - Claude Sonnet 4.5 (recommended)
|
||||
- `cua/anthropic/claude-haiku-4.5` - Claude Haiku 4.5 (faster)
|
||||
- `cua/anthropic/claude-opus-4.5` - Claude Opus 4.5 (enhanced agentic capabilities)
|
||||
- `cua/anthropic/claude-haiku-4.5` - Claude Haiku 4.5 (faster, cost-effective)
|
||||
- `cua/qwen/qwen3-vl-235b` - Qwen3 VL 235B (large-scale vision-language tasks)
|
||||
|
||||
**Benefits:**
|
||||
- Single API key for multiple providers
|
||||
|
||||
1372
libs/typescript/pnpm-lock.yaml
generated
1372
libs/typescript/pnpm-lock.yaml
generated
File diff suppressed because it is too large
Load Diff
Reference in New Issue
Block a user