From 0d442ac606fbdf09851c05e5d8cd1f91417264ca Mon Sep 17 00:00:00 2001 From: f-trycua Date: Sat, 5 Apr 2025 09:51:37 -0700 Subject: [PATCH] Update Agent README with Ollama provider --- libs/agent/README.md | 8 +++++++- 1 file changed, 7 insertions(+), 1 deletion(-) diff --git a/libs/agent/README.md b/libs/agent/README.md index c9bd5f42..b0fc41e9 100644 --- a/libs/agent/README.md +++ b/libs/agent/README.md @@ -43,6 +43,12 @@ async with Computer() as macos_computer: computer=macos_computer, loop=AgentLoop.OPENAI, model=LLM(provider=LLMProvider.OPENAI) + # or + # loop=AgentLoop.ANTHROPIC, + # model=LLM(provider=LLMProvider.ANTHROPIC) + # or + # loop=AgentLoop.OMNI, + # model=LLM(provider=LLMProvider.OLLAMA, model="gemma3") ) tasks = [ @@ -74,7 +80,7 @@ The `cua-agent` package provides three agent loops variations, based on differen |:-----------|:-----------------|:------------|:-------------| | `AgentLoop.OPENAI` | • `computer_use_preview` | Use OpenAI Operator CUA model | Not Required | | `AgentLoop.ANTHROPIC` | • `claude-3-5-sonnet-20240620`
• `claude-3-7-sonnet-20250219` | Use Anthropic Computer-Use | Not Required | -| `AgentLoop.OMNI`
(experimental) | • `claude-3-5-sonnet-20240620`
• `claude-3-7-sonnet-20250219`
• `gpt-4.5-preview`
• `gpt-4o`
• `gpt-4` | Use OmniParser for element pixel-detection (SoM) and any VLMs for UI Grounding and Reasoning | OmniParser | +| `AgentLoop.OMNI` | • `claude-3-5-sonnet-20240620`
• `claude-3-7-sonnet-20250219`
• `gpt-4.5-preview`
• `gpt-4o`
• `gpt-4`
• `phi4`
• `phi4-mini`
• `gemma3`
• `...`
• `Any Ollama-compatible model` | Use OmniParser for element pixel-detection (SoM) and any VLMs for UI Grounding and Reasoning | OmniParser | ## AgentResponse The `AgentResponse` class represents the structured output returned after each agent turn. It contains the agent's response, reasoning, tool usage, and other metadata. The response format aligns with the new [OpenAI Agent SDK specification](https://platform.openai.com/docs/api-reference/responses) for better consistency across different agent loops.