reflexion
Basic Information
This repository contains the code, demonstration notebooks, and logged experiment outputs for the NeurIPS 2023 paper "Reflexion: Language Agents with Verbal Reinforcement Learning." It is organized to reproduce and explore experiments in three domains described in the paper: reasoning (HotPotQA), decision-making (AlfWorld), and programming. The materials include notebooks that run agent variants, shell scripts to launch iterative AlfWorld trials, and recorded runs and logs for prior experiments. Setup instructions show how to install required Python dependencies and configure an OpenAI API key. The project exposes configurable agent types and reflexion strategies and stores outputs in structured log directories so researchers and developers can inspect reasoning traces, self-reflections, and trial-level results without rerunning costly API experiments.