Report Abuse

Basic Information

xLAM is a research repository that presents a family of Large Action Models and accompanying tooling to support agentic systems research. It aggregates and standardizes multi-turn agent trajectories from diverse environments into unified datasets and provides a generic data loader and training pipeline for agent training. The project includes pretrained and fine-tuned models optimized for multi-turn conversation and function-calling across a range of parameter scales, model naming conventions, and release artifacts on model hubs. The repository also contains APIGen-MT, an agentic pipeline for multi-turn data generation via simulated agent-human interplay, and ActionStudio, a lightweight framework for preparing agentic data and training large action models. The codebase includes examples, training configs, deployment instructions for Transformers and vLLM, benchmark results, licensing information, and notes that data and models are released for research purposes only.

Links

App Details

Features
Provides a suite of xLAM models with function-calling and multi-turn conversation capabilities across multiple parameter sizes and context lengths. Includes ActionStudio, a structured framework with unified trajectory datasets, data conversion utilities, example configs, and training scripts to support reproducible training. Ships APIGen-MT for synthesizing multi-turn function-calling data via simulated agent-human interactions. Offers deployment examples for Transformers and vLLM, including a vLLM tool-parser plugin and OpenAI-compatible serving examples. Contains benchmarking results on leaderboards and multiple benchmarks, automated training bookkeeping features like unified config tracking and DeepSpeed/Hugging Face parity fixes, and compatibility notes for FastChat and other inference frameworks. Licensing and dataset usage constraints are documented for research-only use.
Use Cases
The repository helps researchers and engineers build, fine-tune, evaluate, and deploy agent-capable language models by providing standardized multi-turn trajectory datasets and a unified data loader to reduce preprocessing friction. ActionStudio and APIGen-MT accelerate data creation, mixture configuration, and training workflows, while example training configurations and DeepSpeed/HF parity improvements increase reproducibility. Pretrained xLAM models and deployment recipes for Transformers and vLLM enable rapid prototyping of function-calling agents and OpenAI-compatible local serving. Benchmark results and model naming conventions clarify capabilities and intended use cases. Documentation of licenses and research-only release status guides appropriate usage and data sharing for academic and experimental projects.

Please fill the required fields*