feat: research-grade prompt-injection detection benchmark with baselines + ablation by quantifylabs · Pull Request #39 · quantifylabs/aegis-memory

quantifylabs · 2026-05-31T17:30:04Z

No description provided.

…nes + ablation

chatgpt-codex-connector

💡 Codex Review

Here are some automated review suggestions for this pull request.

Reviewed commit: c7cfaafe09

ℹ️ About Codex in GitHub

Your team has set up Codex to review pull requests in this repo. Reviews are triggered when you

Open a pull request for review
Mark a draft as ready
Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

Codex can also answer questions or update the PR. Try commenting "@codex address that feedback".

chatgpt-codex-connector · 2026-05-31T17:32:52Z

+        from datasets import load_dataset
+
+        resolved = _resolve_hf_revision(DEEPSET_REPO, DEEPSET_REVISION)
+        ds = load_dataset(DEEPSET_REPO, revision=DEEPSET_REVISION)


Pin Hugging Face loads to the resolved revision

This resolves main to a commit SHA and records that SHA, but the actual download still uses DEEPSET_REVISION (main). If the upstream dataset advances between resolution and download, or on later reruns, the benchmark consumes moving data while presenting the run as pinned; the same pattern appears in the Dolly loader. Use the resolved SHA for load_dataset(..., revision=resolved) so the recorded revision actually matches the rows scored.

Useful? React with 👍 / 👎.

chatgpt-codex-connector · 2026-05-31T17:32:52Z

+        resolved = _github_file_sha(INJECAGENT_REPO, INJECAGENT_FILES[0], INJECAGENT_REF)
+        cases: list[str] = []
+        for path in INJECAGENT_FILES:
+            raw_url = f"https://raw.githubusercontent.com/{INJECAGENT_REPO}/{INJECAGENT_REF}/{path}"


Pin InjecAgent raw fetches to immutable commits

The metadata records only the latest commit for the first InjecAgent file, but both JSON files are fetched from the moving main ref. If main advances or data/test_cases_ds_base.json changes independently, the benchmark can sample rows that are not represented by the recorded revision, breaking reproducibility of the published results. Resolve an immutable repository commit (or record/fetch each file SHA) and use that ref in the raw URL.

Useful? React with 👍 / 👎.

… reproducibility

feat: research-grade prompt-injection detection benchmark with baseli…

c7cfaaf

…nes + ablation

chatgpt-codex-connector Bot reviewed May 31, 2026

View reviewed changes

fix(benchmark): pin all dataset loaders to resolved revision SHAs for…

b089193

… reproducibility

quantifylabs merged commit ca426bb into main May 31, 2026
4 checks passed

quantifylabs deleted the feat/injection-benchmark branch May 31, 2026 18:03

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: research-grade prompt-injection detection benchmark with baselines + ablation#39

feat: research-grade prompt-injection detection benchmark with baselines + ablation#39
quantifylabs merged 2 commits into
mainfrom
feat/injection-benchmark

quantifylabs commented May 31, 2026

Uh oh!

chatgpt-codex-connector Bot left a comment

Uh oh!

chatgpt-codex-connector Bot May 31, 2026

Uh oh!

chatgpt-codex-connector Bot May 31, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

quantifylabs commented May 31, 2026

Uh oh!

chatgpt-codex-connector Bot left a comment

Choose a reason for hiding this comment

💡 Codex Review

Uh oh!

chatgpt-codex-connector Bot May 31, 2026

Choose a reason for hiding this comment

Uh oh!

chatgpt-codex-connector Bot May 31, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants