bananalyzer
Basic Information
Bananalyzer is an open source evaluation framework and dataset for testing AI agents on web tasks using Playwright. It provides a CLI that runs structured evaluation examples defined in a examples.json file and serves historic static snapshots of pages as MHTML so tests remain reproducible despite site changes, latency, or anti-bot protections. Users plug in their agent by implementing an AgentRunner interface and exposing an agent instance, and the tool dynamically constructs pytest test suites to execute examples. The repo includes utilities and a notebook to capture pages, a basic FastAPI server to expose example data and API docs, a schema inspired by existing web datasets, and a roadmap to add multi-step interactions and translate other web evaluation datasets.