mcp crawl4ai rag
Basic Information
This repository provides an MCP server that enables AI agents and AI coding assistants to crawl the web, index content into a vector database, and perform retrieval augmented generation workflows. It integrates Crawl4AI for web crawling, Supabase for vector storage and search, and optionally Neo4j for a knowledge graph used in hallucination detection and repository analysis. The server exposes MCP tools for single page crawling, smart site crawling, listing available sources, and running semantic RAG queries. It also includes optional agentic code extraction and code example search, cross-encoder reranking, and contextual embedding generation. The project is intended as a testbed and a building block to be integrated into a larger knowledge engine called Archon, and it supports running in Docker or directly with Python uv while using configuration flags to enable or disable advanced RAG strategies.