From bb721952a1169d193c640700976c223039715c22 Mon Sep 17 00:00:00 2001 From: James Murdza Date: Fri, 24 Oct 2025 11:38:06 -0700 Subject: [PATCH 1/5] Add clarifications to README --- README.md | 26 ++++++++++++++++++-------- 1 file changed, 18 insertions(+), 8 deletions(-) diff --git a/README.md b/README.md index bc1f4567..6ae8fd25 100644 --- a/README.md +++ b/README.md @@ -41,13 +41,19 @@ With the Agent SDK, you can: # Agent Usage +Install the agent SDK: + ```bash pip install cua-agent[all] ``` +Initialize a computer agent using a [model configuration string](#model-configuration) and a [computer instance](#computer-usage): + ```python from agent import ComputerAgent +# ComputerAgent works with any computer initialized with the Computer SDK + agent = ComputerAgent( model="anthropic/claude-3-5-sonnet-20241022", tools=[computer], @@ -176,22 +182,26 @@ The following table shows which capabilities are supported by each model: Missing a model? Create a [feature request](https://github.com/trycua/cua/issues/new?assignees=&labels=enhancement&projects=&title=%5BAgent%5D%3A+Add+model+support+for+) or [contribute](https://github.com/trycua/cua/blob/main/CONTRIBUTING.md)! -# Computer +# Computer Usage + +Install the computer SDK: ```bash -pip install cua-computer[all] +pip install cua-computer ``` +Initialize a computer: + ```python from computer import Computer -async with Computer( - os_type="linux", - provider_type="cloud", +computer = Computer( + os_type="linux", # or "macos", "windows" + provider_type="cloud", # or "lume", "docker", "windows_sandbox" name="your-sandbox-name", - api_key="your-api-key" -) as computer: - # Take screenshot + api_key="your-api-key" # only for cloud + # or use_host_computer_server=True for host desktop +) screenshot = await computer.interface.screenshot() # Click and type From 63d0c8122b77c6aa10594da82055f414da8a158e Mon Sep 17 00:00:00 2001 From: James Murdza Date: Fri, 24 Oct 2025 11:48:08 -0700 Subject: [PATCH 2/5] Add a section for each module to the README --- README.md | 90 ++++++++++++++++++++++++++++++++++++++++++++++--------- 1 file changed, 76 insertions(+), 14 deletions(-) diff --git a/README.md b/README.md index 6ae8fd25..e9b5da52 100644 --- a/README.md +++ b/README.md @@ -20,12 +20,12 @@ -With the Computer SDK, you can: +With the [Computer SDK](#computer-sdk), you can: - automate Windows, Linux, and macOS VMs with a consistent, [pyautogui-like API](https://docs.trycua.com/docs/libraries/computer#interface-actions) - create & manage VMs [locally](https://docs.trycua.com/docs/computer-sdk/computers#cua-local-containers) or using [Cua cloud](https://www.trycua.com/) -With the Agent SDK, you can: +With the [Agent SDK](#agent-sdk), you can: - run computer-use models with a [consistent schema](https://docs.trycua.com/docs/agent-sdk/message-format) - benchmark on OSWorld-Verified, SheetBench-V2, and more [with a single line of code using HUD](https://docs.trycua.com/docs/agent-sdk/integrations/hud) ([Notebook](https://github.com/trycua/cua/blob/main/notebooks/eval_osworld.ipynb)) @@ -39,7 +39,7 @@ With the Agent SDK, you can: - [Get started with the Computer-Use Agent CLI](https://docs.trycua.com/docs/quickstart-cli) - [Get started with the Python SDKs](https://docs.trycua.com/docs/quickstart-devs) -# Agent Usage +# Agent SDK Install the agent SDK: @@ -182,7 +182,9 @@ The following table shows which capabilities are supported by each model: Missing a model? Create a [feature request](https://github.com/trycua/cua/issues/new?assignees=&labels=enhancement&projects=&title=%5BAgent%5D%3A+Add+model+support+for+) or [contribute](https://github.com/trycua/cua/blob/main/CONTRIBUTING.md)! -# Computer Usage +Learn more in the [Agent SDK documentation](./libs/python/agent/README.md). + +# Computer SDK Install the computer SDK: @@ -202,25 +204,85 @@ computer = Computer( api_key="your-api-key" # only for cloud # or use_host_computer_server=True for host desktop ) + +try: + await computer.run() + + # Take a screenshot screenshot = await computer.interface.screenshot() # Click and type await computer.interface.left_click(100, 100) await computer.interface.type("Hello!") +finally: + await computer.close() ``` +Learn more in the [Computer SDK documentation](./libs/python/computer/README.md). + +# MCP Server + +Install the MCP server: + +```bash +pip install cua-mcp-server +``` + +Learn more in the [MCP Server documentation](./libs/python/mcp-server/README.md). + +# Computer Server + +Install the Computer Server: + +```bash +pip install cua-computer-server +python -m computer_server +``` + +Learn more in the [Computer Server documentation](./libs/python/computer-server/README.md). + +# Lume + +Install Lume: + +```bash +curl -fsSL https://raw.githubusercontent.com/trycua/cua/main/libs/lume/scripts/install.sh | bash +``` + +Learn more in the [Lume documentation](./libs/lume/README.md). + +# Lumier + +Install Lumier: + +```bash +docker pull trycua/lumier:latest +``` + +Learn more in the [Lumier documentation](./libs/lumier/README.md). + +# SOM + +Install SOM: + +```bash +pip install cua-som +``` + +Learn more in the [SOM documentation](./libs/python/som/README.md). + # Modules -| Module | Description | Installation | -| ---------------------------------------------------------------------------------------------------- | -------------------------------------------------------------------- | --------------------------------------------------------------------------------------------------- | -| [**Lume**](./libs/lume/README.md) | VM management for macOS/Linux using Apple's Virtualization.Framework | `curl -fsSL https://raw.githubusercontent.com/trycua/cua/main/libs/lume/scripts/install.sh \| bash` | -| [**Lumier**](./libs/lumier/README.md) | Docker interface for macOS and Linux VMs | `docker pull trycua/lumier:latest` | -| **Computer ([Python](./libs/python/computer/README.md)/[TS](./libs/typescript/computer/README.md))** | Interface for controlling virtual machines | `pip install "cua-computer[all]"`
`npm install @trycua/computer` | -| [**Agent**](./libs/python/agent/README.md) | AI agent framework for automating tasks | `pip install "cua-agent[all]"` | -| [**MCP Server**](./libs/python/mcp-server/README.md) | MCP server for using CUA with Claude Desktop | `pip install cua-mcp-server` | -| [**SOM**](./libs/python/som/README.md) | Self-of-Mark library for Agent | `pip install cua-som` | -| [**Computer Server**](./libs/python/computer-server/README.md) | Server component for Computer | `pip install cua-computer-server` | -| **Core ([Python](./libs/python/core/README.md)/[TS](./libs/typescript/core/README.md))** | Core utilities | `pip install cua-core`
`npm install @trycua/core` | +| Module | Description | +| ---------------------------------------------------------------------------------------------------- | -------------------------------------------------------------------- | +| [**Agent**](./libs/python/agent/README.md) | AI agent framework for automating tasks | +| **Computer ([Python](./libs/python/computer/README.md)/[TS](./libs/typescript/computer/README.md))** | Interface for controlling virtual machines | +| [**MCP Server**](./libs/python/mcp-server/README.md) | MCP server for using CUA with Claude Desktop | +| [**Computer Server**](./libs/python/computer-server/README.md) | Server component for Computer | +| [**Lume**](./libs/lume/README.md) | VM management for macOS/Linux using Apple's Virtualization.Framework | +| [**Lumier**](./libs/lumier/README.md) | Docker interface for macOS and Linux VMs | +| [**SOM**](./libs/python/som/README.md) | Set-of-Mark library for Agent | +| **Core ([Python](./libs/python/core/README.md)/[TS](./libs/typescript/core/README.md))** | Core utilities | # Resources From 73fc9ac2bd99bba36fad60e323e089407b79e461 Mon Sep 17 00:00:00 2001 From: James Murdza Date: Fri, 24 Oct 2025 12:01:21 -0700 Subject: [PATCH 3/5] Add module grid to the top of the README --- README.md | 73 +++++++++++++++++++++++++++++++++++++++++++++---------- 1 file changed, 60 insertions(+), 13 deletions(-) diff --git a/README.md b/README.md index e9b5da52..ae008822 100644 --- a/README.md +++ b/README.md @@ -39,6 +39,66 @@ With the [Agent SDK](#agent-sdk), you can: - [Get started with the Computer-Use Agent CLI](https://docs.trycua.com/docs/quickstart-cli) - [Get started with the Python SDKs](https://docs.trycua.com/docs/quickstart-devs) +# Modules + + + + + + + + +
+ +[**Agent**](#agent-sdk)
+AI agent framework for automating tasks + +
+ +**[Computer](#computer-sdk)**
+Interface for controlling virtual machines + +
+ +**[MCP Server](#mcp-server)**
+MCP server for using CUA with Claude Desktop + +
+ +**[Computer Server](#computer-server)**
+Server component for Computer + +
+ + + + + + + + +
+ +**[Lume](#lume)**
+VM management for macOS/Linux using Apple's Virtualization.Framework + +
+ +**[Lumier](#lumier)**
+Docker interface for macOS and Linux VMs + +
+ +**[SOM](#som)**
+Set-of-Mark library for Agent + +
+ +**[Core](#core)**
+Core utilities + +
+ # Agent SDK Install the agent SDK: @@ -271,19 +331,6 @@ pip install cua-som Learn more in the [SOM documentation](./libs/python/som/README.md). -# Modules - -| Module | Description | -| ---------------------------------------------------------------------------------------------------- | -------------------------------------------------------------------- | -| [**Agent**](./libs/python/agent/README.md) | AI agent framework for automating tasks | -| **Computer ([Python](./libs/python/computer/README.md)/[TS](./libs/typescript/computer/README.md))** | Interface for controlling virtual machines | -| [**MCP Server**](./libs/python/mcp-server/README.md) | MCP server for using CUA with Claude Desktop | -| [**Computer Server**](./libs/python/computer-server/README.md) | Server component for Computer | -| [**Lume**](./libs/lume/README.md) | VM management for macOS/Linux using Apple's Virtualization.Framework | -| [**Lumier**](./libs/lumier/README.md) | Docker interface for macOS and Linux VMs | -| [**SOM**](./libs/python/som/README.md) | Set-of-Mark library for Agent | -| **Core ([Python](./libs/python/core/README.md)/[TS](./libs/typescript/core/README.md))** | Core utilities | - # Resources - [Cua Blog](https://www.trycua.com/blog) From b99e2f159851911adb91b04e56df275643a6011e Mon Sep 17 00:00:00 2001 From: James Murdza Date: Fri, 24 Oct 2025 12:03:56 -0700 Subject: [PATCH 4/5] Update module descriptions --- README.md | 12 ++++++------ 1 file changed, 6 insertions(+), 6 deletions(-) diff --git a/README.md b/README.md index ae008822..6a2ddc78 100644 --- a/README.md +++ b/README.md @@ -52,19 +52,19 @@ AI agent framework for automating tasks **[Computer](#computer-sdk)**
-Interface for controlling virtual machines +TypeScript/Python SDK for controlling Cua environments **[MCP Server](#mcp-server)**
-MCP server for using CUA with Claude Desktop +MCP server for using Cua agents and computers **[Computer Server](#computer-server)**
-Server component for Computer +Server component that runs on Cua environments @@ -75,13 +75,13 @@ Server component for Computer **[Lume](#lume)**
-VM management for macOS/Linux using Apple's Virtualization.Framework +VM management for macOS **[Lumier](#lumier)**
-Docker interface for macOS and Linux VMs +Docker interface for macOS/Linux VMs @@ -93,7 +93,7 @@ Set-of-Mark library for Agent **[Core](#core)**
-Core utilities +Core utilities for Cua From 43934bb22dbf08675f584f2238aec528eda3e501 Mon Sep 17 00:00:00 2001 From: James Murdza Date: Fri, 24 Oct 2025 12:08:35 -0700 Subject: [PATCH 5/5] Update quick start section --- README.md | 12 ++++++------ 1 file changed, 6 insertions(+), 6 deletions(-) diff --git a/README.md b/README.md index 6a2ddc78..2a43f3b7 100644 --- a/README.md +++ b/README.md @@ -33,12 +33,6 @@ With the [Agent SDK](#agent-sdk), you can: - use new UI agent models and UI grounding models from the Model Zoo below with just a model string (e.g., `ComputerAgent(model="openai/computer-use-preview")`) - use API or local inference by changing a prefix (e.g., `openai/`, `openrouter/`, `ollama/`, `huggingface-local/`, `mlx/`, [etc.](https://docs.litellm.ai/docs/providers)) -# Quick Start - -- [Clone a starter template and run the code in <1 min](https://github.com/trycua/agent-template) (⭐️ Recommended!) -- [Get started with the Computer-Use Agent CLI](https://docs.trycua.com/docs/quickstart-cli) -- [Get started with the Python SDKs](https://docs.trycua.com/docs/quickstart-devs) - # Modules @@ -99,6 +93,12 @@ Core utilities for Cua
+# Quick Start + +- [Clone a starter template and run the code in <1 min](https://github.com/trycua/agent-template) +- [Get started with the Cua SDKs](https://docs.trycua.com/docs/quickstart-devs) +- [Get started with the Cua CLI](https://docs.trycua.com/docs/quickstart-cli) + # Agent SDK Install the agent SDK: