Commit Graph

162 Commits

Author SHA1 Message Date
bowman
1fe41d57f4 update hud in agent pyproject.toml 2025-10-06 20:24:21 -07:00
Dillon DuPont
1e94b5d8b4 Added working moondream3 agent 2025-10-02 11:07:11 -04:00
Dillon DuPont
0b3c677205 added moondream3 agent loop 2025-10-02 10:57:06 -04:00
Dillon DuPont
c892f4ecea Change anthropic predict_click logic 2025-09-25 15:53:20 -07:00
Dillon DuPont
1346feb125 Add postponed annotations to internvl.py 2025-09-19 19:23:13 -04:00
Dillon DuPont
821bd03e48 remove extra prints 2025-09-18 11:27:50 -04:00
Dillon DuPont
03d7806549 Fixed invalid trajectory names on ollama 2025-09-18 10:52:45 -04:00
Dillon DuPont
6ddddf8f88 fix internVL inference 2025-09-16 12:56:07 -04:00
Dillon DuPont
9147e8eeaf Added "cua-agent[internvl-hf]" dep 2025-09-16 12:02:07 -04:00
Dillon DuPont
c5bbd4611a add qwen2_5_vl.py 2025-09-15 16:29:26 -04:00
Dillon DuPont
7a7de5d50f add holo models 2025-09-15 16:10:54 -04:00
Dillon DuPont
ca564b2436 Merge branch 'main' into models/opencua 2025-09-15 15:11:15 -04:00
James Murdza
b4b45e5b8b Upgrade HUD SDK to 0.4.26 2025-09-14 00:34:51 -04:00
James Murdza
8096fbfd34 Upgrade HUD SDK to 0.4.25 2025-09-13 22:43:12 -04:00
Dillon DuPont
c58ff55969 Added agent tool filtering 2025-09-12 20:12:29 -04:00
Dillon DuPont
fc0f10aaf9 Ignore extra computers when running evals 2025-09-12 14:50:18 -04:00
Dillon DuPont
ae22572658 Merge branch 'main' of https://github.com/trycua/cua 2025-09-12 13:58:22 -04:00
Dillon DuPont
fad293957d Pin HUD version 2025-09-12 13:57:28 -04:00
Dillon DuPont
cf95646503 Merge branch 'main' into models/opencua 2025-09-12 13:30:21 -04:00
Dillon DuPont
58807378dd Added internVL 2025-09-12 13:30:09 -04:00
Dillon DuPont
faf531825e Fixed error during response call 2025-09-12 12:32:03 -04:00
Dillon DuPont
b3040306b8 Fixing bugs 2025-09-12 12:06:36 -04:00
Dillon DuPont
b69943121d Fixed KeyError 2025-09-12 11:29:40 -04:00
Dillon DuPont
f795660f75 Upgraded HUD impl. to support custom tools 2025-09-12 11:14:03 -04:00
Dillon DuPont
2f28c3a2ce Added missing file 2025-09-10 14:57:44 -04:00
Dillon DuPont
17d6709629 added simple guide for customizing computeragent 2025-09-09 10:55:57 -04:00
Dillon DuPont
2dfaf0047d Fix multimodal user inputs in the anthropic loop 2025-09-04 16:15:33 -04:00
Dillon DuPont
957fef788d Fix X/Y scrolling on linux 2025-09-04 09:52:59 -04:00
James Murdza
e17f6106c8 Move text from README to Cua documentation 2025-09-01 09:30:06 -04:00
Dillon DuPont
39d60a852c Updated license of cua-som and cua-agent[omni] 2025-08-29 11:33:04 -04:00
Dillon DuPont
6ec083e28a Removed reasoning pass 2025-08-28 19:21:04 -04:00
Dillon DuPont
efc2c3e54c Fixed KeyError 2025-08-28 18:24:20 -04:00
Dillon DuPont
ddd01ee719 Improved image retention callback 2025-08-28 18:18:40 -04:00
Dillon DuPont
e61fbeda5e Made labels more descriptive 2025-08-28 17:36:56 -04:00
Dillon DuPont
2cf6290c47 Made UI more compact 2025-08-28 17:26:09 -04:00
Dillon DuPont
e2fac486ee Renamed Response to Message 2025-08-28 17:19:36 -04:00
Dillon DuPont
022f999259 Simplified UI 2025-08-28 17:12:26 -04:00
Dillon DuPont
2ba67d399d Added wait action 2025-08-28 17:09:35 -04:00
Dillon DuPont
038ad4df10 Force default model for human-in-the-loop 2025-08-28 16:55:28 -04:00
Dillon DuPont
2216043305 Fixed bug in init 2025-08-28 16:50:38 -04:00
Dillon DuPont
b7cbac31a0 Added grounding model fallback 2025-08-28 16:39:43 -04:00
Dillon DuPont
e90997c4ff Added screenshot_dir and lazy loading of MLX 2025-08-28 13:18:17 -04:00
ddupont
311bbf9709 Merge pull request #371 from trycua/chore/hud-upgrade
[Agent] Upgrade HUD SDK to 0.4.12
2025-08-28 11:29:18 -04:00
Dillon DuPont
5fafe861ef added implicit scroll action 2025-08-27 22:59:26 -04:00
Dillon DuPont
4c678f0f4e normalize { 'click': 'left', ... } hallucination 2025-08-27 22:19:37 -04:00
Dillon DuPont
e8210389a5 added kwarg filter 2025-08-27 21:39:44 -04:00
Dillon DuPont
063bf223d1 added kwarg filter 2025-08-27 21:35:18 -04:00
Dillon DuPont
c003d7ec7b added more normalizer cases 2025-08-27 21:27:58 -04:00
Dillon DuPont
15301935c0 renamed agent.py to proxy 2025-08-27 21:18:06 -04:00
Dillon DuPont
95cefc50f0 added extended kwargs, renamed callback to normalizer 2025-08-27 20:49:31 -04:00