Self-RAG: Custom Implementation

This is my custom implementation of
Self-RAG: Learning to Retrieve, Generate, and Critique through Self-Reflection.

Overview

This project is a personal re-implementation of the Self-RAG architecture —
a system that learns to retrieve, generate, and self-critique its outputs through iterative reflection.

It integrates multiple modern tools to reproduce the core ideas of the paper using practical components.

Components Used

Ollama — for local LLM inference and prompt execution
Vespa — as the vector database for document embeddings and retrieval
LangGraph — for orchestrating the data flow and logic graph between modules

How It Works

Text Chunking & Embedding:
Input documents are split into semantically coherent chunks.
Each chunk is embedded into a dense vector representation.
Vector Storage & Retrieval:
These embeddings are stored in Vespa, which enables fast vector similarity search for relevant chunks.
LLM Reasoning via Ollama:
Queries are sent to an Ollama-served LLM.
The retrieved context from Vespa is combined with the user’s input to generate a first draft answer.
Self-Reflection with LangGraph:
Using LangGraph, the system organizes the reasoning flow —
feeding generated responses back into the LLM for critique and refinement, inspired by Self-RAG’s reflection mechanism.

Technologies

Component	Purpose
Ollama	Local LLM interface
Vespa	Vector database for embeddings
LangGraph	Graph-based orchestration
Docker	Containerized setup for reproducibility

Reference

Shinn, N., Cassano, F., Kiela, D., & Weston, J. (2023).
Self-RAG: Learning to Retrieve, Generate, and Critique through Self-Reflection.
arXiv:2310.11511

Notes

This implementation is experimental and simplified compared to the original Self-RAG paper.
It focuses on practical integration of retrieval, generation, and reflection using available open-source components.

License

This project is released for educational and research purposes.

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
app		app
data		data
data_processing		data_processing
vespa_app		vespa_app
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Self-RAG: Custom Implementation

Overview

Components Used

How It Works

Technologies

Reference

Notes

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Self-RAG: Custom Implementation

Overview

Components Used

How It Works

Technologies

Reference

Notes

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages