Detecting Overflow in Compressed Token Representations for Retrieval-Augmented Generation

This repository accompanies the paper. It studies overflow in soft compression for RAG, specifically in the xRAG setting — the failure mode where a compressed <xRAG> token loses enough information to cause a wrong answer. We evaluate overflow detection stage-wise across the compression pipeline, supporting pre-LLM gating decisions (re-retrieve, re-chunk, or fall back to full context) and a consistent way to compare compression modules by checking where overflow first becomes detectable.

What is overflow?

We study token overflow in projector-based soft compression for RAG: Retriever Embeddings → Compression Projector → LLM Generator.

Overflow is compression-induced task failure: the model succeeds on the full context ($\mathcal{T}_i^{\text{ref}} = 1$), but fails when that context is replaced by its compressed representation ($\mathcal{T}_i(\mathbf{C}_i) = 0$).

$$\mathcal{O}_i = \mathbf{1}\left(\mathcal{T}_i^{\text{ref}} = 1 \land \mathcal{T}_i(\mathbf{C}_i) = 0\right)$$

Context → Compression → ⚠️ Overflow ⚠️ → Wrong answer

Setup

Prerequisites

You need two pre-trained models:

Model	Role	Where to get it?
LLM with Projector	Generates baseline and xRAG answers and provides hidden states	xRAG repo
Retriever	Encodes background passages into xRAG embeddings	Salesforce/SFR-Embedding-Mistral

Install

pip install .

This installs dependencies for the full pipeline under scripts/. The scripts/xRAG/ submodule has its own setup — see its upstream docs.

Quickstart

The main probing pipeline has four stages. Each stage writes output files that the next stage reads.

Step 1 — Build canonical QA samples

cd scripts/data_preprocessing

python build_samples.py \
  --data triviaqa \
  --split validation \
  --out_jsonl runs/triviaqa/samples.jsonl \
  --max_samples 100

Output: runs/triviaqa/samples.jsonl — records with id, question, answer, background, task_type.

Step 2 — Run baseline vs. xRAG and label overflow

python run_pipeline.py \
  --samples_jsonl runs/triviaqa/samples.jsonl \
  --out_dir runs/triviaqa \
  --mode both \
  --model_name_or_path /path/to/xrag-7b \
  --retriever_name_or_path /path/to/xrag_embed \
  --device cuda:0

Outputs:

runs/triviaqa/results.jsonl — per-sample predictions + overflow_label
runs/triviaqa/embeds/background_embeds.pt — cached background embeddings

Step 3 — Extract probing vectors

python probe_pipeline.py \
  --samples_jsonl runs/triviaqa/results.jsonl \
  --ctx2embed runs/triviaqa/embeds/background_embeds.pt \
  --out_dir runs/triviaqa/probe_m16 \
  --model_name_or_path /path/to/xrag-7b \
  --device cuda:0 \
  --mid_layer_index 16 \
  --project_questions \
  --save_vectors_pt

Outputs:

runs/triviaqa/probe_m16/vectors.pt — tensors keyed by stage (preproj, postproj, mid, last, mid_q, last_q) + labels
runs/triviaqa/probe_m16/features.jsonl — scalar saturation metrics per sample

Alternative: scripts/feature_collection/run_feature_extraction.ipynb runs Steps 2–3 in a single notebook pass and also adds context-level features (perplexity, length, gzip) to features.jsonl.

Step 4 — Train and evaluate overflow detectors

cd ../probing_experiments

python run_probing_experiments.py \
  --data_path ../data_preprocessing/runs/triviaqa/probe_m16/vectors.pt \
  --output_dir results/triviaqa_probe_m16 \
  --cv_folds 5 \
  --device 0

This trains lightweight classifiers on all embedding stage combinations with stratified 5-fold CV, and writes JSON result files with AUC-ROC, F1, and average precision.

Citation

Julia Belikova, Danila Rozhevskii, Dennis Svirin, Konstantin Polev, and Alexander Panchenko (2026): Detecting Overflow in Compressed Token Representations for Retrieval-Augmented Generation. In Proceedings of the 19th Conference of the European Chapter of the Association for Computational Linguistics (Volume 4: Student Research Workshop), pages 797–810, Rabat, Morocco. Association for Computational Linguistics.

@inproceedings{belikova-etal-2026-detecting,
    title = "Detecting Overflow in Compressed Token Representations for Retrieval-Augmented Generation",
    author = "Belikova, Julia  and Rozhevskii, Danila  and Svirin, Dennis  and Polev, Konstantin  and Panchenko, Alexander",
    editor = "Baez Santamaria, Selene  and Somayajula, Sai Ashish  and Yamaguchi, Atsuki",
    booktitle = "Proceedings of the 19th Conference of the {E}uropean Chapter of the {A}ssociation for {C}omputational {L}inguistics (Volume 4: Student Research Workshop)",
    month = mar,
    year = "2026",
    address = "Rabat, Morocco",
    publisher = "Association for Computational Linguistics",
    url = "https://aclanthology.org/2026.eacl-srw.59/",
    pages = "797--810",
    ISBN = "979-8-89176-383-8"
}

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
assets		assets
scripts		scripts
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
index.html		index.html
pyproject.toml		pyproject.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Detecting Overflow in Compressed Token Representations for Retrieval-Augmented Generation

Contents

What is overflow?

Setup

Prerequisites

Install

Quickstart

Step 1 — Build canonical QA samples

Step 2 — Run baseline vs. xRAG and label overflow

Step 3 — Extract probing vectors

Step 4 — Train and evaluate overflow detectors

Citation

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Detecting Overflow in Compressed Token Representations for Retrieval-Augmented Generation

Contents

What is overflow?

Setup

Prerequisites

Install

Quickstart

Step 1 — Build canonical QA samples

Step 2 — Run baseline vs. xRAG and label overflow

Step 3 — Extract probing vectors

Step 4 — Train and evaluate overflow detectors

Citation

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages