RAGentX 🤖

Autonomous Self-Correcting Multi-Agent RAG Orchestrator

RAGentX is an enterprise-grade Agentic Retrieval-Augmented Generation (RAG) system designed to eliminate the common pitfalls of traditional RAG—such as irrelevant context retrieval and hallucinations. Built with LangGraph, it coordinates a team of specialized AI agents that collaboratively retrieve, evaluate, and self-correct their reasoning path in real-time.

🌟 Project Motivation

Standard RAG systems often suffer from "one-shot" failures: if the first retrieval is poor, the final answer is incorrect. RAGentX solves this by treating RAG as an iterative, agentic process. It introduces a Reflexion Loop where the system critiques its own retrieved context and autonomously decides whether to re-search, re-write the query, or proceed to generation.

✨ Key Features

🧠 Intelligent Orchestration (DAG)

RAGentX is governed by a stateful Directed Acyclic Graph (DAG) that manages complex state transitions between agent nodes.

graph TD
    %% Entry Point
    START((START)) --> Rewriter[Query Rewriter]

    %% Main Pipeline
    subgraph "Autonomous Agentic Pipeline"
        Rewriter --> Retriever[Hybrid Retriever]
        Retriever --> Evaluator[Context Evaluator]
    end

    %% Self-Correction Logic
    Evaluator -->|Relevant| Generator[Response Generator]
    Evaluator -->|"Irrelevant (Retry < 3)"| Rewriter
    Evaluator -->|Fallback| Generator

    %% Validation Gate
    Generator --> Validator[Output Validator]
    Validator -->|Valid| END((END))
    Validator -->|Invalid| Generator

    %% Visual Styles
    style START fill:#0f172a,stroke:#38bdf8,color:#fff
    style END fill:#0f172a,stroke:#38bdf8,color:#fff
    classDef agentNode fill:#1e293b,stroke:#38bdf8,stroke-width:2px,color:#fff
    class Rewriter,Retriever,Evaluator,Generator,Validator agentNode

Self-Healing Retrieval: Implements an autonomous "Test-and-Repair" loop. If the Evaluator finds the retrieved documents insufficient, the system re-attempts retrieval with a refined query.
Hybrid Ensemble Search: Combines FAISS (dense vector similarity) with BM25 (sparse keyword matching) to maximize both semantic recall and term-based precision.
Cost-Optimized Architecture: Runs local HuggingFace embeddings (all-MiniLM-L6-v2), removing expensive API dependencies for vectorization.
High-Fidelity Generation: Leverages Groq's Llama 3.3-70b-versatile for near-instant reasoning and response generation.

🛠️ Developer Experience (DX)

Unified Entry Point: Launch the complete stack (FastAPI + Streamlit) with a single command: uv run python init_and_run.py.
Transparent Traceability: A built-in "Debug Mode" in the UI allows you to inspect the agent's internal reasoning, rewritten queries, and relevance scores.
Production-Ready Persistence: Uses SQLAlchemy and SQLite to manage session-based chat history and document metadata.

🚀 Quick Start

1. Prerequisites

Python 3.10+
uv (Fastest Python package manager)
Groq API Key

2. Initialization & Setup

Run the automated setup script to sync dependencies and scaffold your environment:

python setup.py

Or manually:

uv sync

3. Configuration

Create a .env file in the root directory. You can use the provided .env.example as a template:

cp .env.example .env

Ensure your .env file has the following keys configured:

GROQ_API_KEY=your_gsk_key_here
GROQ_MODEL_NAME=llama-3.3-70b-versatile
DATABASE_URL=sqlite:///./ragentx.db
FAISS_INDEX_PATH=vectorstore/faiss_index

4. Run Everything (Recommended)

uv run python init_and_run.py

5. Individual Execution (Manual)

If you prefer to run the services separately for development, use two terminals:

Terminal 1: Backend (FastAPI)

uv run python -m uvicorn app.main:app --host 0.0.0.0 --port 8000 --reload

Terminal 2: Frontend (Streamlit)

uv run streamlit run frontend/app.py

📂 Project Structure

app/
├── api/             # FastAPI REST endpoints (/chat, /ingest)
├── agents/          # LangGraph nodes and state orchestration
├── retrieval/       # Hybrid search and vectorstore logic
├── services/        # Centralized LLM factory
├── database/        # SQLite models and session management
├── core/            # Configuration and logging
frontend/            # Streamlit interactive dashboard
sample_data/         # Complex markdown documents for showcase testing

📋 Technical Stack

AI Framework: LangChain & LangGraph
LLM Engine: Groq (Llama 3.3-70b-versatile)
Vector Store: FAISS
Retrieval: Hybrid (Dense + Sparse Ensemble)
Embeddings: HuggingFace (Local)
API: FastAPI
Frontend: Streamlit
Package Manager: uv

📸 Screenshots

1. Document Management and Ingestion

Upload and manage your PDF, TXT, and Markdown files with ease. The system automatically chunks and indexes them for retrieval.

2. Multi-Agent RAG Chat Interface

Ask questions about your documents. The agentic loop retrieves context, evaluates relevance, and generates precise answers with source attribution.

3. Agentic Debug Trace

Peek under the hood to see exactly how the agent rewrote your query, evaluated context, and formulated the final answer.

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
app		app
assets		assets
data/uploads		data/uploads
docs/superpowers		docs/superpowers
frontend		frontend
sample_data		sample_data
scripts		scripts
tests		tests
.env.example		.env.example
.gitignore		.gitignore
GEMINI.md		GEMINI.md
LICENSE		LICENSE
README.md		README.md
app.log		app.log
data.txt		data.txt
init_and_run.py		init_and_run.py
large.txt		large.txt
main.py		main.py
pyproject.toml		pyproject.toml
requirements.txt		requirements.txt
setup.py		setup.py
test_error.txt		test_error.txt
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

RAGentX 🤖

🌟 Project Motivation

✨ Key Features

🧠 Intelligent Orchestration (DAG)

🛠️ Developer Experience (DX)

🚀 Quick Start

1. Prerequisites

2. Initialization & Setup

3. Configuration

4. Run Everything (Recommended)

5. Individual Execution (Manual)

📂 Project Structure

📋 Technical Stack

📸 Screenshots

1. Document Management and Ingestion

2. Multi-Agent RAG Chat Interface

3. Agentic Debug Trace

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

RAGentX 🤖

🌟 Project Motivation

✨ Key Features

🧠 Intelligent Orchestration (DAG)

🛠️ Developer Experience (DX)

🚀 Quick Start

1. Prerequisites

2. Initialization & Setup

3. Configuration

4. Run Everything (Recommended)

5. Individual Execution (Manual)

📂 Project Structure

📋 Technical Stack

📸 Screenshots

1. Document Management and Ingestion

2. Multi-Agent RAG Chat Interface

3. Agentic Debug Trace

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages