ai agent demo
Basic Information
This repository is a demo playground for building and experimenting with voice-enabled AI agents that combine large language models and real-time speech-to-speech processing. It demonstrates an end-to-end stack with a React + TypeScript frontend and a Python FastAPI backend. The demo integrates Google Gemini Live for AI model capabilities and VideoSDK for real-time audio/video transport and conferencing. It provides examples of agent lifecycle management, real-time audio pipelines, and integration with Gemini Realtime API to enable conversational voice agents. The README includes prerequisites, server and client setup steps, environment configuration, and sample API usage to run and test agents locally or via tunneling. The project is intended for prototyping interactive assistants, automated calling flows, and voice interfaces while showcasing configuration options and core components.