Commit Graph

91 Commits

Author SHA1 Message Date
James Murdza
635480f3b7 Remove PyLume code and examples
# Conflicts:
#	examples/pylume_examples.py
#	libs/python/mcp-server/scripts/start_mcp_server.sh
#	libs/python/pylume/README.md
#	libs/python/pylume/__init__.py
#	libs/python/pylume/pylume/__init__.py
#	libs/python/pylume/pylume/client.py
#	libs/python/pylume/pylume/exceptions.py
#	libs/python/pylume/pylume/models.py
#	libs/python/pylume/pylume/pylume.py
#	libs/python/pylume/pylume/server.py
#	libs/python/pylume/pyproject.toml
#	pdm.lock
#	pyproject.toml
#	scripts/build-uv.sh
#	scripts/build.ps1
#	scripts/build.sh
2025-10-28 16:52:23 -07:00
James Murdza
ddc5a5de91 Format codebase with uv run pre-commit run --all-files 2025-10-22 11:35:31 -07:00
f-trycua
9d8dd768bf update container to sandbox 2025-10-08 18:14:46 -07:00
bowman
b0da328323 switch demo to gpt-5 in eval_osworld.ipynb 2025-10-06 20:27:01 -07:00
Dillon DuPont
297801fecb fixed wording 2025-09-18 10:55:39 -04:00
Dillon DuPont
e7a20fe4c2 latest nb 2025-09-18 10:53:00 -04:00
Dillon DuPont
13fc9e2e90 updated nb 2025-09-18 10:51:16 -04:00
Dillon DuPont
af4a328cec latest notebook 2025-09-18 10:42:48 -04:00
Dillon DuPont
5928c41dc0 Rewrote notebook to be easier to follow 2025-09-18 10:39:19 -04:00
Dillon DuPont
b9185ad54e Merge branch 'main' into integrations/ollama 2025-09-16 19:09:25 -04:00
Dillon DuPont
ca564b2436 Merge branch 'main' into models/opencua 2025-09-15 15:11:15 -04:00
James Murdza
1c79a3a5a2 Add cloud version of hackathon notebook 2025-09-13 04:44:23 -04:00
James Murdza
deb2132aef Fix hackathon notebook 2025-09-13 03:20:08 -04:00
James Murdza
993d52527f Rename hackathon notebook 2025-09-13 01:50:01 -04:00
James Murdza
981a081672 Improve notebook structure 2025-09-13 01:42:06 -04:00
James Murdza
8938b37ca7 Change hackathon notebook to use Docker 2025-09-13 01:42:05 -04:00
James Murdza
77d91ef6e1 Clarify instructions in hackathon notebook 2025-09-13 00:57:14 -04:00
James Murdza
4ec4bbc888 Add link to HUD integration documentation 2025-09-12 21:41:14 -04:00
James Murdza
3552ef62a8 Add relevant links to docs 2025-09-12 18:41:38 -04:00
James Murdza
73fb0002f0 Improve notebook structure 2025-09-12 18:36:56 -04:00
James Murdza
4c52aaa298 Assert that HUD_API_KEY is set 2025-09-12 18:27:34 -04:00
James Murdza
b72d8da8a7 Add Prequisites section 2025-09-12 18:13:16 -04:00
James Murdza
ea1caea73c Reuse agent configuration for HUD evaluation 2025-09-12 18:00:27 -04:00
James Murdza
48e42d2334 Only load .env file in notebook directory 2025-09-12 18:00:27 -04:00
Dillon DuPont
cf95646503 Merge branch 'main' into models/opencua 2025-09-12 13:30:21 -04:00
Dillon DuPont
492ebe9f0e Added simple ollama agent example 2025-09-12 13:02:23 -04:00
James Murdza
4b3e2077fb Remove dataset size limit during HUD evaluation 2025-09-12 12:55:43 -04:00
James Murdza
68ecdcc99a Assert Cua API keys exist in notebook 2025-09-12 12:55:22 -04:00
James Murdza
1aca043006 Automatically create .env file in notebook 2025-09-12 12:55:11 -04:00
James Murdza
28f206d824 Improve explanatory text in notebook 2025-09-12 12:54:50 -04:00
James Murdza
4dedd06c5b Improve notebook structure 2025-09-12 12:39:37 -04:00
Dillon DuPont
665e65cb85 Replaced computer shim with Docker computer 2025-09-09 11:00:52 -04:00
Dillon DuPont
f270af30e1 added notebook 2025-09-09 10:57:16 -04:00
James Murdza
796835b9e5 Add trajectory viewer and VNC instructions to notebook 2025-09-08 09:53:42 -04:00
James Murdza
64b555bb34 Update notebook to use OSWorld-Tiny dataset 2025-09-08 09:53:42 -04:00
James Murdza
cd59a63a49 Fix URL in example notebook 2025-09-08 09:53:42 -04:00
James Murdza
c5ca6e9e9f Add Jupyter notebook for the SOTA challenge 2025-09-05 07:43:45 -04:00
James Murdza
1882f099a2 Change HUD dataset name from OSWorld-Verified-XLang to OSWorld-Verified 2025-09-03 11:18:43 -04:00
James Murdza
c820d5124d Load environment variables in HUD notebook 2025-09-02 16:10:13 -04:00
James Murdza
33ce7515a5 Clear HUD notebook outputs 2025-09-02 16:08:24 -04:00
Dillon DuPont
c4ce791a49 Update OSWorld output 2025-08-28 12:07:56 -04:00
ddupont
311bbf9709 Merge pull request #371 from trycua/chore/hud-upgrade
[Agent] Upgrade HUD SDK to 0.4.12
2025-08-28 11:29:18 -04:00
Dillon DuPont
95cefc50f0 added extended kwargs, renamed callback to normalizer 2025-08-27 20:49:31 -04:00
Dillon DuPont
0d3f8ea3ff Improved trajectory saving 2025-08-27 16:48:57 -04:00
Dillon DuPont
e8eaf66e2a Added latest nb 2025-08-27 13:38:55 -04:00
Dillon DuPont
3c502354a8 added simple task id 2025-08-27 13:28:24 -04:00
Dillon DuPont
61a442da56 fixed getattr crash 2025-08-27 13:21:46 -04:00
James Murdza
afe01ff831 Add missing comma in example code 2025-08-27 12:53:55 -04:00
Dillon DuPont
84e2a27aea added notebook 2025-08-26 18:29:39 -04:00
f-trycua
a6406ae179 Update notebooks for KASM Docker 2025-08-26 13:05:35 +00:00