Commit Graph

1047 Commits

Author SHA1 Message Date
Dillon DuPont
f45f6b84e9 added gpt-5 + gpta1 examples 2025-08-08 18:36:01 -04:00
Dillon DuPont
9685833428 updated docs 2025-08-08 18:32:39 -04:00
Dillon DuPont
5495529462 limited tasks in notebook 2025-08-08 18:26:44 -04:00
Dillon DuPont
ae128a2ae4 added run_job's kwargs 2025-08-08 18:21:48 -04:00
Dillon DuPont
8f15c21df9 added run_job 2025-08-08 18:15:56 -04:00
Dillon DuPont
f819c578b7 Add example notebook 2025-08-08 13:14:56 -04:00
Dillon DuPont
1882fb68e5 Added HUD integration 2025-08-08 12:47:07 -04:00
Dillon DuPont
5c46ca0c9a removed max_its 2025-08-08 12:40:50 -04:00
Dillon DuPont
b9f307a149 Added HUD integration 2025-08-08 12:17:35 -04:00
Dillon DuPont
6ebf06e2fa Documented custom computers 2025-08-08 11:14:30 -04:00
Dillon DuPont
f78d026060 renamed ComputerHandler to AsyncComputerHandler 2025-08-08 11:09:44 -04:00
Dillon DuPont
feec92a1d6 added support for sync methods 2025-08-08 11:06:43 -04:00
Dillon DuPont
73cd489ac3 Added dict-based custom computers 2025-08-08 10:53:26 -04:00
Dillon DuPont
d47ef3f1b6 Added protocol for custom computer handlers 2025-08-08 10:22:10 -04:00
ddupont
a78a0e1e58 Merge pull request #333 from trycua/fix/passthrough-tool-errors
[Agent] Implement left_mouse_down, left_mouse_up, and tool errors
2025-08-07 16:49:46 -04:00
Dillon DuPont
4b0b07240c Made coordinates optional for left_mouse_up/down agent-v0.4.11 2025-08-07 16:47:34 -04:00
Dillon DuPont
1b406b197a Updated anthropic loop to include function_call and function_call_output, along with passing tool exceptions to the agent 2025-08-07 16:39:40 -04:00
ddupont
889d32dc1f Merge pull request #331 from trycua/fix/failsafes
[Computer-Server] Disable pyautogui FAILSAFE
2025-08-06 14:49:21 -04:00
Dillon DuPont
0056ae44f5 Disabled pyautogui FAILSAFE computer-server-v0.1.21 2025-08-06 14:47:29 -04:00
Dillon DuPont
4eccf059e5 Added omniparser to grounding page 2025-08-06 11:54:36 -04:00
Dillon DuPont
760faf1b55 Added Claude 4.1 to docs 2025-08-06 11:46:18 -04:00
Dillon DuPont
8e249174cd Moved hotkey splitting into keypress handler agent-v0.4.10 2025-08-06 10:53:51 -04:00
Dillon DuPont
3631caa883 Changed keypress to cast strings to lists agent-v0.4.9 2025-08-06 10:27:59 -04:00
ddupont
63afd4ac16 Merge pull request #330 from trycua/feat/extra-models
[Agent] Add support for UI grounding models (GTA1) and composed grounding-planner agents (Agent S)
agent-v0.4.8
2025-08-05 13:28:34 -04:00
Dillon DuPont
a6a60b9fe0 added claude 4.1 to docs 2025-08-05 13:24:29 -04:00
Dillon DuPont
5168b6f082 added docs for benchmarks and composed agents 2025-08-05 13:02:45 -04:00
Dillon DuPont
74a25f2003 fixed docstring 2025-08-05 12:54:23 -04:00
Dillon DuPont
0063eccb79 Merge branch 'main' into feat/extra-models 2025-08-05 12:46:26 -04:00
Dillon DuPont
8eb662bf4d added base models to benchmark 2025-08-05 12:45:00 -04:00
Dillon DuPont
3cc36905ff added missing abstract mthd 2025-08-05 12:36:22 -04:00
Dillon DuPont
e1e6a7d6a7 Added anthropic click mode 2025-08-05 12:32:42 -04:00
Dillon DuPont
dc0e479170 changed comment 2025-08-05 11:25:29 -04:00
Dillon DuPont
5e7b1e46b0 fixed resolution & OS 2025-08-05 11:23:08 -04:00
ddupont
5456daba56 Merge pull request #329 from trycua/feat/agentui-localhost
[Agent] Add localhost to the agent UI
agent-v0.4.7
2025-08-05 10:56:54 -04:00
Dillon DuPont
075e48141d Merge branch 'main' into feat/agentui-localhost computer-v0.4.1 2025-08-05 10:56:26 -04:00
Dillon DuPont
01371ed6cf bump version, add localhost to ui 2025-08-05 10:55:06 -04:00
ddupont
2f6d3f9588 Merge pull request #328 from trycua/feat/agentui-localhost
[Agent, MCP] Update mcp server to cua-agent==0.4.x, add localhost to UI
2025-08-05 10:54:16 -04:00
Dillon DuPont
96e4b7586a update mcp server to cua-agent==0.4.x 2025-08-05 10:51:21 -04:00
Dillon DuPont
3e7bc0aa79 Add omniparser predict_click 2025-08-04 18:11:21 -04:00
ddupont
8bd6249062 Merge pull request #326 from ggcr/main
Some NITs
2025-08-04 17:46:58 -04:00
ddupont
583b0ce85c Merge pull request #327 from jamesmurdza/fix/broken-import-in-agent-ui-example
Fix broken import in Agent UI example
agent-v0.4.6
2025-08-04 17:22:25 -04:00
James Murdza
d27ee728b5 Fix broken import after refactor in 5bfadf8f9a 2025-08-04 17:02:11 -04:00
Dillon DuPont
f87b8eaea5 added grounding+planning composed loop 2025-08-04 16:32:05 -04:00
cgutierr
77922c92fc Dummy (small) models can produce action_types=None, which will later produce a crash, avoid it 2025-08-03 13:04:29 +02:00
cgutierr
9781810451 Move inputs to model's same device, previously hardcoded to CUDA 2025-08-03 13:02:15 +02:00
cgutierr
5b5f17af8e Use generic AutoModel not tied to specific arch 2025-08-03 13:00:27 +02:00
cgutierr
b2df58a91e Fix quickstart script 2025-08-03 12:54:10 +02:00
Dillon DuPont
d5564977f0 working gta1 loop 2025-08-01 15:49:19 -04:00
ddupont
5bfadf8f9a Update pii_anonymization.py 2025-08-01 11:46:32 -04:00
Morgan Dean
10bac17c78 Merge pull request #316 from trycua/feat/docs/init
[DRAFT] Docs Website
2025-07-31 10:57:24 -07:00