blast
Basic Information
BLAST is a high-performance serving engine designed to add web browsing capabilities to AI applications. It exposes an OpenAI-compatible API so existing clients can send requests and receive streaming, browser-augmented LLM outputs. The project focuses on efficient resource management for interactive web-based agents by providing built-in concurrency, automatic parallelism, and caching to reduce latency and costs. BLAST can be run locally or deployed to serve multiple users while keeping memory and budget constraints under control. Typical use cases described include embedding web browsing AI into apps, automating web-based workflows, and local development and experimentation. The repository provides a pip-installable package and a serve command to run a local server and example code showing how to stream browser actions through a compatible OpenAI client.