ai00_server
Basic Information
AI00 RWKV Server is an inference API server implementation for the RWKV family of language models, built on the web-rwkv inference engine. It provides a compact, Rust-based server that exposes OpenAI ChatGPT-compatible API endpoints for chat completions, text completions and embeddings. The project targets GPU acceleration via Vulkan so it can run on AMD cards and integrated GPUs without CUDA or PyTorch. The repository supplies pre-built executables and instructions to build from source with Rust, a model conversion tool for converting PyTorch .pth models to safetensors .st, and configuration files for model paths and quantization. Typical use cases documented include chatbots, text generation, translation and Q&A, and it includes a WebUI served on port 65530 for interactive use.