feat(analyzer): implement MCP rug-pull detection (RP1-RP3) by mimran-khan · Pull Request #161 · NVIDIA/SkillSpector

mimran-khan · 2026-06-22T21:08:27Z

Summary

The mcp_rug_pull analyzer was a non-functional stub that unconditionally returned empty findings — indistinguishable from "checked and found nothing" in scan output. This gave users a false sense of security when scanning MCP-enabled skills.

This PR implements actual rug-pull detection with two analysis modes:

Manifest Comparison (when `previous_manifest` is supplied)

Rule	Detection	Severity
RP1	New parameters added to tools	HIGH
RP2	Tool/skill description changed	MEDIUM
RP3	Parameters removed from tools	MEDIUM

Static Risk Analysis (always runs)

Rule	Detection	Severity
RP1	Wildcard (`*`) permissions	HIGH
RP2	Dynamic tool loading patterns (fetch_tools, runtime discovery)	MEDIUM

Key Behavior Changes

No more silent no-ops: When previous_manifest is unavailable, the analyzer now logs a WARNING instead of silently producing "0 findings"
Actual detection: Identifies concrete rug-pull risk indicators
Actionable findings: Each finding includes specific remediation guidance

How Rug-Pull Detection Works

First scan:   manifest A → stored as baseline
Second scan:  manifest B → compared against A
                          → RP1 if new params added (data capture)
                          → RP2 if descriptions changed (prompt injection)
                          → RP3 if params removed (behavior divergence)

Static analysis runs on every scan regardless, catching:

Overly broad permissions that amplify rug-pull impact
Dynamic tool loading that makes definitions unpredictable between scans

Testing

15 new tests covering:

RP1: New parameter detection
RP2: Description change detection
RP3: Parameter removal detection
No-change produces no findings
Empty manifests handled gracefully
Wildcard permission flagged
Specific permissions pass
Dynamic loading patterns caught
Normal code not flagged
Node integration with/without previous_manifest
Warning logged when comparison unavailable

Fixes #150

Replaces the non-functional stub with a real analyzer that detects MCP rug-pull attack indicators: Manifest comparison (when previous_manifest available): - RP1: New parameters added (potential data capture) - RP2: Description changed (potential prompt injection via description) - RP3: Parameters removed (behavior divergence) Static risk analysis (always runs): - RP1: Wildcard permissions that amplify rug-pull impact - RP2: Dynamic tool loading patterns (fetch_tools, runtime discovery) Key behavior changes: - Logs WARNING when previous_manifest unavailable (was silently returning "0 findings" indistinguishable from "checked and found nothing") - Actually detects rug-pull indicators instead of being a no-op - Returns actionable findings with remediations Fixes NVIDIA#150

rng1995

Verdict: Request changes — the analyzer is a real improvement over the stub and the tests are good, but there's a likely duplicate PR, inconsistent rule-ID semantics, and the headline manifest-diff path is unreachable.

Blocking / please clarify

Possible duplicate of the still-open #125 (identical title, "implement MCP rug-pull detection (RP1-RP3)"). Only one should proceed — please confirm which supersedes so we don't merge both.
Inconsistent rule-ID semantics. In _compare_manifests (src/skillspector/nodes/analyzers/mcp_rug_pull.py): RP1 = new parameter (~L63), RP3 = removed parameter (~L92), RP2 = description change (~L121). But in _static_risk_analysis: RP1 = wildcard permission (~L168) and RP2 = dynamic loading (~L202). The same rule_id carries two different meanings, which produces inconsistent SARIF rule descriptors — note the rules[] dedup added in #158 keeps only the first message seen per id, so one of the two meanings will be silently mislabeled. Give each distinct concept its own rule id.
Manifest-comparison path is unreachable. node() reads state.get("previous_manifest") (~L240) but nothing in this PR populates it — there's no --previous-manifest CLI flag or state plumbing. So in practice previous_manifest is always None, only _static_risk_analysis runs, and the headline RP1/RP2/RP3 manifest-diffing never fires (confirmed by test_without_previous_manifest_warns). Please add the plumbing or clarify the follow-up that wires it.

Tests

Unit tests for both _compare_manifests and _static_risk_analysis are good. They don't catch (2)/(3) because they call the helpers directly / inject previous_manifest into state by hand, bypassing the missing CLI/state wiring.

mimran-khan · 2026-06-23T10:24:31Z

Thanks for the review @rng1995 — all three blocking points are valid.

After checking, PR #125 by @tcconnally covers the same scope with cleaner rule-ID semantics (each RP1/RP2/RP3 has a single consistent meaning) and was opened first. Our manifest-comparison path here is unreachable without CLI plumbing that doesn't exist yet.

Closing this in favor of #125. If the manifest-diffing logic is useful down the road, it can be proposed as a follow-up once #125 lands and the state/CLI plumbing is in place.

mimran-khan force-pushed the feat/mcp-rug-pull-detection branch from 4c9130a to 88b912f Compare June 23, 2026 09:47

rng1995 requested changes Jun 23, 2026

View reviewed changes

mimran-khan closed this Jun 23, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

feat(analyzer): implement MCP rug-pull detection (RP1-RP3)#161

feat(analyzer): implement MCP rug-pull detection (RP1-RP3)#161
mimran-khan wants to merge 1 commit into
NVIDIA:mainfrom
mimran-khan:feat/mcp-rug-pull-detection

mimran-khan commented Jun 22, 2026

Uh oh!

rng1995 left a comment

Uh oh!

mimran-khan commented Jun 23, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Uh oh!

Conversation

mimran-khan commented Jun 22, 2026

Summary

Manifest Comparison (when previous_manifest is supplied)

Static Risk Analysis (always runs)

Key Behavior Changes

How Rug-Pull Detection Works

Testing

Uh oh!

rng1995 left a comment

Choose a reason for hiding this comment

Blocking / please clarify

Tests

Uh oh!

mimran-khan commented Jun 23, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Manifest Comparison (when `previous_manifest` is supplied)