rl
Basic Information
TorchRL is an open-source, PyTorch-native library that provides primitives and higher-level tools to build, train and evaluate reinforcement learning agents and language-model fine-tuning pipelines. It is designed for developers and researchers to implement RL algorithms, manage environments and data, run distributed collectors, and perform modular experiment workflows. The project centers on a Python-first, modular design with a unified data structure called TensorDict that simplifies batched rollouts, replay buffers and training loops. TorchRL also includes an LLM API for supervised fine-tuning and RLHF workflows with conversation management and tool integration. The repository bundles example implementations, SOTA recipes, and utilities for both online and offline RL to accelerate reproducible research and production experimentation.