indirect-prompt-injection

Here are 12 public repositories matching this topic...

StackOneHQ / defender

Open source prompt injection protection for Agents calling tools (via MCP, CLI or direct function calling). Detect and defend against prompt injection attacks. 22MB, CPU-only, < 10ms latency.

ai-security prompt-injection llm-security mcp-security prompt-injection-defense prompt-injection-detection indirect-prompt-injection prompt-injection-protection

Updated Apr 23, 2026
TypeScript

Agentic-AI-Risk-Mitigation / Janus

Star

System-level security for LLM agents: fine-grained policy enforcement on tool calls to defend against indirect prompt injection

python security ai-agents ai-security policy-enforcement spicedb langchain tool-calling blackhat-arsenal indirect-prompt-injection

Updated Apr 26, 2026
Python

Transform any content into 9 platform-native formats or convert between content types — with optional brand voice matching. Supports Twitter/X, LinkedIn, newsletter, Instagram, YouTube Shorts, TikTok, Threads, Bluesky, and podcast. Secure-by-default: includes prompt injection defenses for safe URL and web content processing.

Updated Feb 19, 2026
Shell

deconvolute-labs / benchmarks

Star

Reproducible security benchmarking for the Deconvolute SDK and AI system integrity against adversarial attacks.

benchmark ai-safety adversarial-attacks rag adversarial-ml llm-security deconvolute rag-security indirect-prompt-injection

Updated Feb 1, 2026
Python

kenithphilip / Tessera

Star

Signed provenance labels and taint-tracking policy for LLM agent security. The core library behind AgentMesh.

Updated Apr 27, 2026
Python

fitz2882 / narthex

Star

Prompt-injection defenses for Claude Code. A PreToolUse Bash hook blocks compositional credential-exfiltration shapes (secret read plus network, env dump to network, remote script to shell, reverse shells). A sanitizing MCP server wraps untrusted URLs and files in sentinels, strips invisible unicode, flags jailbreaks.

hooks security mcp ai-safety claude ai-security prompt-injection anthropic llm-security model-context-protocol mcp-server claude-code indirect-prompt-injection

Updated Apr 19, 2026
Python

deconvolute-labs / yara-gen

Star

Automatically generate YARA rules from adversarial and benign text samples. Built for detecting indirect prompt injection attacks on RAG pipelines.

yara blue-team ai-security threat-detection rag adversarial-ml prompt-injection llm-security indirect-prompt-injection

Updated Jan 29, 2026
Python

aparnaa19 / AgentForensics

Star

AgentForensics is an open-source security framework that monitors complete LLM agent sessions in real time, detecting prompt injection attacks across tool outputs, web pages, documents, and API responses. It uses heuristic rules, a DistilBERT ML classifier, instruction boundary detection, semantic drift, and sliding-window multi-turn detection.

Updated Apr 25, 2026
Python

aliuyar1234 / selective-revocation-replay

Star

Research artifact, paper, and frozen evaluation outputs for selective revocation and replay after persistent indirect prompt injection in memory-augmented LLM agents.

provenance reproducibility machine-learning-security prompt-injection llm-security llm-agents agent-memory research-artifact indirect-prompt-injection memory-augmented-llms