Files
computer/libs/agent
2025-03-16 16:06:32 +01:00
..
2025-03-16 16:06:32 +01:00
2025-03-16 16:06:32 +01:00
2025-03-16 16:06:32 +01:00
2025-03-16 16:06:32 +01:00
2025-03-16 16:06:32 +01:00

Shows my svg

Python macOS Discord PyPI

Agent is a Computer Use (CUA) framework for running multi-app agentic workflows targeting macOS and Linux sandbox, supporting local (Ollama) and cloud model providers (OpenAI, Anthropic, Groq, DeepSeek, Qwen). The framework integrates with Microsoft's OmniParser for enhanced UI understanding and interaction.

Get started with Agent

from agent import ComputerAgent, AgentLoop, LLMProvider
from computer import Computer

computer = Computer(verbosity=logging.INFO)

agent = ComputerAgent(
    computer=computer,
    loop=AgentLoop.ANTHROPIC,
    # loop=AgentLoop.OMNI,
    model=LLM(provider=LLMProvider.ANTHROPIC, name="claude-3-7-sonnet-20250219"),
    # model=LLM(provider=LLMProvider.OPENAI, name="gpt-4.5-preview"),
    save_trajectory=True,
    trajectory_dir=str(Path("trajectories")),
    only_n_most_recent_images=3,
    verbosity=logging.INFO,
)

tasks = [
"""
Please help me with the following task:
1. Open Safari browser
2. Go to Wikipedia.org
3. Search for "Claude AI" 
4. Summarize the main points you find about Claude AI
"""
]

async with agent:
    for i, task in enumerate(tasks, 1):
        print(f"\nExecuting task {i}/{len(tasks)}: {task}")
        async for result in agent.run(task):
            print(result)
        print(f"Task {i} completed")

Install

cua-agent

pip install "cua-agent[all]"

# or install specific loop providers
pip install "cua-agent[anthropic]"
pip install "cua-agent[omni]"

Run

Refer to these notebooks for step-by-step guides on how to use the Computer-Use Agent (CUA):