Context-Aware RAG Chatbot
A GenAI-powered AI/ML chatbot that delivers accurate, citation-backed answers using Retrieval-Augmented Generation. The system ingests curated Wikipedia articles, applies semantic chunking to preserve topic boundaries, and retrieves context through hybrid search with vector MMR and BM25. It uses LangGraph for structured conversation flow, AutoCut-style context distillation to reduce redundant evidence, SQLite-backed session persistence, and LangSmith tracing for observability. Built with FastAPI, Pydantic, ChromaDB, HuggingFace embeddings, Ollama-hosted LLMs, and a Next.js + TypeScript + Tailwind frontend, with Docker Compose support for deployment.
RAGLangGraphHybrid Search