
Zambezi Voice RAG: A RAG System Starter Code
Zambezi Voice RAG is a hybrid search retrieval-augmented generation system built with Streamlit. It combines dense vector embeddings via mxbai-embed-large and sparse BM25 retrieval on a Pinecone index for accurate, context-aware answers. Users can query research documents conversationally, inspect retrieved chunks, and get streaming responses powered by Llama 3.1 via Ollama.
Key Features
- Hybrid search combining BM25 sparse and dense vector retrieval
- Streaming LLM responses powered by Llama 3.1 via Ollama
- Inspectable retrieved document chunks per query
- Persistent chat history within session
Tech Stack
PythonPineconeLangChainOllamaStreamlit