代码库
Open-source benchmark for browser AI agents on daily tasks.
Python
agent-evaluationagentic-aiai-agent-benchmarkai-agentsbenchmarkbrowser-agentbrowser-automationbrowser-usechrome-agentchrome-extensioncomputer-usedatasetevaluationeveryday-tasksllmllm-evaluationonline-tasksreal-world-benchmarkweb-agentweb-agents
A version of verl to support diverse tool use
Python
agentlearningllmreinforcement