🧠 Local RAG-Powered Personal Knowledge Base

A privacy-first, fully local RAG (Retrieval-Augmented Generation) system for querying your personal notes, PDFs, and Markdown files — powered by ChromaDB, SentenceTransformers, and Ollama.

✨ Features

Feature	Details
🔒 100% local	No cloud APIs — your data never leaves your machine
📄 Multi-format ingestion	PDF, Markdown (`.md`), plain text (`.txt`)
🔍 Hybrid search	BM25 + dense vector search fused with Reciprocal Rank Fusion (RRF)
🔄 Query refinement loop	Iterative LLM-powered query rewriting for better retrieval
🧩 Streamlit UI	Chat interface, one-off search, summarisation, and insights tabs
💻 CLI	Full-featured command-line interface + REPL
📦 Idempotent ingestion	Re-ingesting the same file never creates duplicate chunks
⚙️ Fully configurable	All parameters in `.env` or environment variables

🏗️ Architecture

┌─────────────────────────────────────────────────────────────────┐
│                        User Interface                           │
│              CLI (cli.py)          Streamlit UI (app.py)        │
└──────────────────────────┬──────────────────────────────────────┘
                           │
                           ▼
┌─────────────────────────────────────────────────────────────────┐
│                      RAG Pipeline (src/rag_pipeline.py)         │
│                                                                 │
│  ┌──────────────┐   ┌──────────────────┐   ┌───────────────┐   │
│  │  Ingestion   │   │  Query Rewriter   │   │  LLM Client   │   │
│  │ (ingestion.py│   │(query_rewriter.py)│   │   (llm.py)    │   │
│  └──────┬───────┘   └────────┬─────────┘   └───────┬───────┘   │
│         │                    │                       │           │
│         ▼                    ▼                       │           │
│  ┌──────────────┐   ┌──────────────────┐             │           │
│  │  Embeddings  │   │ Hybrid Retriever │◄────────────┘           │
│  │(embeddings.py│   │  (retrieval.py)  │                         │
│  └──────┬───────┘   └────────┬─────────┘                         │
│         │                    │                                   │
│         ▼                    ▼                                   │
│  ┌─────────────────────────────────┐                            │
│  │       Vector Store              │                            │
│  │     (vector_store.py)           │                            │
│  │         ChromaDB                │                            │
│  └─────────────────────────────────┘                            │
└─────────────────────────────────────────────────────────────────┘

Data Flow

Ingestion:

Document (PDF/MD/TXT) → Chunk → Embed (SentenceTransformers) → Store (ChromaDB)

Query:

Query → [Refine loop] → Embed → Hybrid Search (BM25 + Vector + RRF) → LLM → Answer

🚀 Quick Start

1. Prerequisites

# Python 3.10+
python --version

# Install Ollama from https://ollama.com
ollama serve
ollama pull llama3     # or mistral, phi3, gemma2, etc.

2. Install Dependencies

git clone https://github.com/YOUR_USERNAME/rag-knowledge-base
cd rag-knowledge-base

python -m venv .venv
source .venv/bin/activate        # Windows: .venv\Scripts\activate

pip install -r requirements.txt

3. Configure

cp .env.example .env
# Edit .env to set your preferred model and paths

4. Ingest Documents

# Ingest a directory of notes
python cli.py ingest ./docs_sample

# Or a single file
python cli.py ingest ./my_notes/important.pdf

5. Ask Questions

# CLI one-shot
python cli.py query "What are the main components of a RAG system?"

# Streaming response
python cli.py query "Explain hybrid search" --stream

# Streamlit UI
streamlit run app.py

📖 CLI Usage

usage: rag-kb [-h] [--db DB] [--model MODEL] {ingest,query,summarise,insights,stats,repl} ...

Commands:
  ingest      Ingest a file or directory
  query       Ask a question (with optional streaming & refinement)
  summarise   Summarise a topic from the knowledge base
  insights    Generate insights from the knowledge base
  stats       Show knowledge base statistics
  repl        Interactive REPL mode

Examples

# Ingest
python cli.py ingest ./my_notes --no-recursive

# Query with options
python cli.py query "What is BM25?" --mode bm25 --stream
python cli.py query "Explain RAG" --no-refine

# Summarise
python cli.py summarise "data engineering best practices"

# Insights
python cli.py insights "machine learning"

# Stats
python cli.py stats

# Interactive REPL
python cli.py repl
# In REPL: /mode hybrid|semantic|bm25, /stats, /quit

🌐 Streamlit UI

streamlit run app.py

Opens at http://localhost:8501 with four tabs:

Tab	Feature
💬 Chat	Streaming Q&A with source attribution
🔍 Search	Raw hybrid/semantic/BM25 search results
📊 Insights	AI-generated insights from your notes
📝 Summarise	Topic summarisation

🔧 Configuration Reference

All settings can be set via .env or environment variables:

Variable	Default	Description
`CHROMA_DB_PATH`	`./chroma_db`	ChromaDB persistence directory
`EMBEDDING_MODEL`	`all-MiniLM-L6-v2`	SentenceTransformer model
`EMBEDDING_DEVICE`	`cpu`	`cpu` or `cuda`
`OLLAMA_MODEL`	`llama3`	Ollama model name
`OLLAMA_BASE_URL`	`http://localhost:11434`	Ollama server URL
`TOP_K_SEMANTIC`	`10`	Dense retrieval candidates
`TOP_K_BM25`	`10`	BM25 retrieval candidates
`TOP_K_FINAL`	`5`	Final chunks passed to LLM
`RRF_K`	`60`	RRF smoothing constant
`CHUNK_SIZE`	`512`	Words per chunk
`CHUNK_OVERLAP`	`64`	Overlap between chunks
`MAX_REFINEMENT_LOOPS`	`2`	Query refinement iterations
`REFINEMENT_SCORE_THRESHOLD`	`0.35`	Minimum relevance to skip refinement

🧪 Running Tests

pytest tests/ -v
pytest tests/ --cov=src --cov-report=term-missing

📚 Supported Formats

Format	Extension	Notes
Markdown	`.md`	YAML front-matter is stripped
Plain text	`.txt`	UTF-8
PDF	`.pdf`	Text extraction via pdfplumber

🗺️ Roadmap / Potential Extensions

Cross-encoder reranking — add a ms-marco-MiniLM reranker after retrieval
HyDE — Hypothetical Document Embeddings for better open-domain QA
Multi-modal — embed images from PDFs (CLIP embeddings)
Conversation memory — multi-turn chat with context window management
Document tagging — auto-tag chunks with topics for filtered search
Evaluation harness — RAGAS-based automated quality scoring
Web scraping ingestion — ingest URLs directly into the knowledge base
Export — export Q&A sessions as Markdown reports

🤝 Contributing

Fork the repo
Create a feature branch (git checkout -b feat/cross-encoder-reranker)
Commit your changes (git commit -m "feat: add cross-encoder reranker")
Push and open a PR

📄 License

MIT — see LICENSE for details.

🙏 Acknowledgements

ChromaDB — vector database
SentenceTransformers — embedding models
Ollama — local LLM inference
rank-bm25 — BM25 implementation
pdfplumber — PDF text extraction
Streamlit — web UI framework

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

🧠 Local RAG-Powered Personal Knowledge Base

✨ Features

🏗️ Architecture

Data Flow

🚀 Quick Start

1. Prerequisites

2. Install Dependencies

3. Configure

4. Ingest Documents

5. Ask Questions

📖 CLI Usage

Examples

🌐 Streamlit UI

🔧 Configuration Reference

🧪 Running Tests

📚 Supported Formats

🗺️ Roadmap / Potential Extensions

🤝 Contributing

📄 License

🙏 Acknowledgements

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
__pycache__		__pycache__
chroma_db		chroma_db
docs_sample		docs_sample
src		src
tests		tests
LICENSE		LICENSE
README.md		README.md
app.py		app.py
cli.py		cli.py
config.py		config.py
requirements.txt		requirements.txt

Folders and files

Latest commit

History

Repository files navigation

🧠 Local RAG-Powered Personal Knowledge Base

✨ Features

🏗️ Architecture

Data Flow

🚀 Quick Start

1. Prerequisites

2. Install Dependencies

3. Configure

4. Ingest Documents

5. Ask Questions

📖 CLI Usage

Examples

🌐 Streamlit UI

🔧 Configuration Reference

🧪 Running Tests

📚 Supported Formats

🗺️ Roadmap / Potential Extensions

🤝 Contributing

📄 License

🙏 Acknowledgements

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages