← Back to projects

Samvaad

Voice-first learning/information-gathering platform. Allows users to upload their docs and have text or voice conversations with them.

Samvaad

Problem Statement

Users struggle to quickly extract insights from large documents without reading everything. Traditional search is slow and voice-based interaction is limited.

Solution

Built a conversational AI that ingests documents and allows natural language queries via text or voice, using RAG for accurate responses.

Architecture

Frontend in Next.js handles UI and voice input/output. Backend uses FastAPI with OpenAI for embeddings and generation, Postgres for vector storage.

Problems Faced

Integrating ASR/TTS reliably, optimizing vector search for large docs, handling privacy concerns with user data.

What I Learned

Deepened knowledge in LLM fine-tuning, real-time audio processing, and scalable data pipelines.

Future Vision

Expand to multi-modal inputs (screenshots/videos), integrate with productivity tools, and add collaborative features.