Report Abuse

Basic Information

syftr is an agent optimizer designed to find Pareto-optimal agentic and non-agentic workflows for a given budget and competing objectives. It lets users supply datasets and compose a search space of models and flow components, then runs an efficient multi-objective search to estimate a Pareto frontier that trades off accuracy against costs such as LLM token cost, latency, and throughput. syftr implements multi-objective Bayesian optimization together with a domain-specific Pareto Pruner to focus sampling on promising configurations. It functions as a library and a CLI with example studies and Jupyter notebooks, requires a config.yaml for LLM and infrastructure settings, and outputs discovered Pareto flows and associated metrics. The project integrates with distributed execution and is intended for researchers and engineers exploring cost-accuracy trade-offs in agentic pipelines.

Links

Categorization

App Details

Features
Supports multi-objective Bayesian optimization and a custom Pareto Pruner to efficiently search agentic flow configurations. Integrates Ray for distributed scaling and Optuna for flexible study definition and multi-objective algorithms. Provides LLM configuration with explicit cost and metadata fields, support for embedding models, and automatic inclusion of configured models into the default search space. Offers CLI and Python API usage patterns, example studies and Jupyter notebooks, and a study YAML format for reproducible experiments. Uses HuggingFace Datasets for dataset management, LlamaIndex for building RAG workflows, and Trace for textual component optimization. Configurable storage with SQLite by default and optional Postgres for distributed runs. Includes docs for adding custom datasets and flows and a sample config.yaml for required credentials and endpoints.
Use Cases
Helps practitioners and researchers discover efficient generative AI pipelines by automating exploration of model and component combinations under budget constraints. By producing a Pareto frontier of solutions, syftr makes trade-offs between accuracy and competing objectives explicit so teams can select workflows that match cost, latency, or throughput requirements. Its distributed execution and Optuna-backed search let users scale experiments across CPUs and GPUs and compare many candidate flows reproducibly. The configurable LLM cost modeling and provider-agnostic setup let organizations evaluate heterogeneous model deployments. Extensibility for custom datasets and flows enables application to different domains and datasets, and the CLI plus API support makes it usable both interactively and in automated pipelines.

Please fill the required fields*