GuideIntegrationsBenchmarks

OSWorld-Verified

Benchmark ComputerAgent on OSWorld tasks using HUD

OSWorld-Verified is a curated subset of OSWorld tasks that can be run using the HUD framework.

Use ComputerAgent with HUD to benchmark on these tasks.

Was this page helpful?


On this page

No Headings