Engineered a production-style Retrieval-Augmented Generation (RAG) system for syllabus-grounded question answering, supporting multimodal ingestion (PDF, PPT, video transcripts) and semantic retrieval using FAISS and Qdrant.
Built modular RAG pipelines with LlamaIndex and MistralAI, with optional Kafka-backed streaming. Implemented a fully local RAG evaluation framework (470+ queries) using TF-IDF and cosine similarity to measure context precision, recall, and answer correctness. Real-time chat architecture via WebSockets and Redis session management.