Production-grade documentation RAG system with multi-source ingestion, hybrid retrieval, freshness handling, tenant isolation, and MCP server, built with the depth and tradeoffs
> A production-grade, multi-tenant documentation RAG system with empirically validated retrieval, streaming answers, and an MCP server — built from first principles to match what real AI infrastructure companies ship. --- Most RAG tutorials stop at "chunk → embed → retrieve → generate." This project goes further — it's what a real company like [kapa.ai](https://kapa.ai) actually has to build: