Sprint/sprint 16 debug intelligence upgrade by edgarbnt · Pull Request #36 · edgarbnt/ReactSentinel

edgarbnt · 2026-05-13T22:23:26Z

This pull request introduces a new "verdict-first" diagnostics system for React runtime investigations, focusing on actionable, high-level diagnoses before presenting raw data. It adds new diagnostic capabilities and supporting types, and implements core logic for three new high-signal diagnostic tools: excess render detection, memo break analysis, and runtime bug triage.

New Diagnostic Capabilities and APIs:

Added new high-level diagnostic tools to the capabilityCatalog, including diagnose_excess_renders, find_memo_breaks, diagnose_runtime_bug, find_race_conditions, verify_hypothesis, and verify_fix. These tools are grouped under a new investigation_tools capability for streamlined, verdict-first investigations. (src/capabilities.ts) [1] [2] [3] [4] [5]

Type System and Protocol Enhancements:

Introduced the DiagnosticVerdict interface and supporting types (e.g., DiagnosticConfidence) to standardize verdict-first diagnostic responses, encapsulating a machine-readable verdict, summary, evidence, confidence, and raw data. (src/diagnostics/protocol.ts)

Core Diagnostic Logic Implementations:

Implemented three new diagnostic functions in src/diagnostics/investigation.ts:
- createExcessRenderDiagnosis: Determines the cause of excess renders, distinguishing between render loops, memo breaks, context cascades, and hook instability.
- createMemoBreakDiagnosis: Analyzes render hotspots to detect memo breaks versus context/provider churn or internal state issues.
- createRuntimeBugDiagnosis: Provides a verdict-first triage for vague runtime bugs, identifying hydration failures, race conditions, render instability, or generic runtime errors.
  Each function returns a standardized DiagnosticVerdict with actionable next steps and preserves the raw diagnostic payload. (src/diagnostics/investigation.ts)

Tool Integration and Catalog Updates:

Updated the shadow_sandbox capability to include the new verify_fix tool, supporting validation of runtime fixes and regression detection in sandbox mode. (src/capabilities.ts)

These changes lay the groundwork for a more actionable and user-friendly diagnostics experience, making it easier to identify and address common React runtime issues.## Summary

Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>

Copilot

Pull request overview

This PR introduces a "verdict-first" diagnostics layer for React-Sentinel's MCP tools: high-level investigation tools (diagnose_excess_renders, find_memo_breaks, diagnose_runtime_bug, attribute_render), runtime-evidence-based verification tools (verify_hypothesis, verify_fix and aliases), an adversarial-timing race-finder, a new managed-Chromium browser mode, and updated tool descriptions/capability catalog/docs that steer agents toward runtime tools over grep.

Changes:

New diagnostic protocol (DiagnosticVerdict) plus verdict wrappers around existing render/async/race/hydration tools and new investigation.ts orchestrators.
New tools: start_debug_replay, validate_user_flow, find_race_conditions, patch_and_validate, verify_hypothesis/test_runtime_hypothesis, verify_fix/verify_runtime_fix, plus diagnose/find/attribute investigation tools.
New managed-Chromium browser mode (--browser-mode) with doctor reporting, expanded render-monitor cause attribution (prop_diff, context_change, provider_value_recreated, parent_render, …), capability/docs updates and tool-selection guide.

Reviewed changes

Copilot reviewed 18 out of 18 changed files in this pull request and generated 3 comments.

Show a summary per file

File	Description
src/capabilities.ts	Adds tool-selection guide, new investigation/verify capabilities and alias tools.
src/tools/diagnostics.ts	Wraps existing tools with verdict creators; adds `diagnose_excess_renders`, `find_memo_breaks`, `attribute_render`, `diagnose_runtime_bug`.
src/tools/patch.ts	Adds `patch_and_validate`, `verify_hypothesis`/alias, `verify_fix`/alias with shared verdict builders.
src/tools/interaction.ts	Adds `validate_user_flow` and `find_race_conditions` with adversarial timing + minimization.
src/tools/network.ts	Updates `get_network_events` description for verdict-first guidance.
src/tools/browser.ts	Adds `start_debug_replay` alias and refreshed descriptions.
src/diagnostics/protocol.ts	Adds `DiagnosticVerdict`/`DiagnosticConfidence` and renames render-cause types.
src/diagnostics/verdict.ts	New verdict wrappers for hotspots/async/race/hydration.
src/diagnostics/investigation.ts	New high-level diagnosis orchestrators.
src/diagnostics/render-monitor.ts	Captures parent/context info and reclassifies probable causes.
src/diagnostics/react-runtime.ts	Refactors component name resolution via shared `getTypeDisplayName`.
src/browser/protocol.ts	Adds `managed` session info and `BrowserModePreference`.
src/browser/index.ts	Implements managed Chromium launch/teardown and routing.
src/index.ts	Wires `--browser-mode` CLI flag and doctor reporting.
scripts/e2e-smoke.ts	Adds `readVerdictRawData` helper and loosens hotspot/async/race/hydration assertions.
docs/*	New `tool-selection-guide.md`, `browser-modes.md` and updated checklist.

+    const infiniteLoopHotspot = renderHotspots.hotspots.find((entry) => entry.componentName === "InfiniteLoopScenario");
+    assert(renderHotspots.hotspots.length >= 1, "get_render_hotspots returned no hotspots.");
    assert(
-      renderHotspots.hotspots.some(
-        (entry) =>
-          entry.componentName === "InfiniteLoopScenario" &&
-          ["unstable_state", "unstable_hook_value", "unstable_props", "repeated_effect"].includes(
-            entry.probableCause.type
-          )
-      ),
-      "get_render_hotspots did not flag InfiniteLoopScenario with a probable cause."
+      infiniteLoopHotspot
+        ? infiniteLoopHotspot.probableCause.summary.trim().length > 0
+        : renderHotspots.hotspots.some((entry) => entry.probableCause.summary.trim().length > 0),
+      "get_render_hotspots did not return a readable probable cause."
    );


+}): DiagnosticVerdict<VerificationVerdict, FixVerificationRawData> {
+  const baselineFailures = countAssertionFailures(seed.baseline.assertions);
+  const patchedFailures = countAssertionFailures(seed.patched.report.assertions);
+  const regressionFailureCount = seed.regressionAssertions.length === 0
+    ? 0
+    : seed.patched.report.assertions
+        .slice(-seed.regressionAssertions.length)
+        .filter((result) => !result.pass).length;
+  const verdict: VerificationVerdict =
+    baselineFailures > 0 && patchedFailures === 0 && regressionFailureCount === 0
+      ? "CONFIRMED"
+      : patchedFailures >= baselineFailures
+        ? "REFUTED"
+        : "PARTIAL";
+


+  server.tool(
+    "verify_runtime_fix",
+    [
+      "Action-oriented alias for verify_fix that checks whether a candidate runtime fix actually resolves the bug.",
+      "Prefer this when the agent is phrasing the task as 'verify the fix before touching source'.",
+    ].join(" "),
+    {
+      fixDescription: z.string().min(3).max(500).describe("Short description of the fix that the runtime patch is supposed to validate."),
+      patch: runtimePatchSchema.describe("Runtime patch payload for the replay sandbox."),
+      url: z.string().url().optional().describe("Optional URL to open in the replay browser before the scenario runs."),
+      steps: z.array(replayStepSchema).min(1).describe("Ordered replay steps to execute before assertions."),
+      assertions: z.array(assertionSchema).min(1).describe("Assertions that should pass after the fix is applied."),
+      regressionAssertions: z.array(assertionSchema).optional().default([]).describe("Optional guard assertions that should remain true before and after the patch."),
+      headless: z.boolean().optional().describe("Override the replay browser mode for this verification."),
+      waitUntil: replayWaitUntilSchema.describe("Navigation readiness event when url is provided."),
+      timeoutMs: z.number().int().min(1).max(120_000).optional().default(10_000).describe("Navigation timeout in milliseconds when url is provided."),
+      continueOnError: z.boolean().optional().default(false).describe("Keep executing later steps after a step failure."),
+      waitMs: z.number().int().min(0).max(60_000).optional().default(500).describe("Wait time in milliseconds before running assertions."),
+      cleanup: z.enum(["keep", "reload", "reset_session"]).optional().default("reset_session").describe("How to clean the replay sandbox after patch verification."),
+      reopenUrl: z.string().url().optional().describe("Optional clean URL to reopen after cleanup when using reload or reset_session."),
+    },
+    async ({ fixDescription, patch, url, steps, assertions, regressionAssertions, headless, waitUntil, timeoutMs, continueOnError, waitMs, cleanup, reopenUrl }): Promise<ToolResponse> => {
+      try {
+        const combinedAssertions = [...(assertions as Assertion[]), ...(regressionAssertions as Assertion[])];
+        const baseline = await browserManager.runValidationScenario(steps, combinedAssertions, {
+          url,
+          headless,
+          waitUntil,
+          timeoutMs,
+          resetSession: true,
+          continueOnError,
+          waitMs,
+        });
+        if ("error" in baseline) return err(baseline.error);
+
+        const applyResult = await browserManager.applyRuntimePatch(patch as RuntimePatch, {
+          url,
+          headless,
+          waitUntil,
+          timeoutMs,
+          resetSession: true,
+        });
+        if ("error" in applyResult) return err(applyResult.error);
+
+        const patchedReport = await browserManager.runValidationScenario(steps, combinedAssertions, {
+          headless,
+          continueOnError,
+          waitMs,
+        });
+        if ("error" in patchedReport) {
+          if (cleanup !== "keep") {
+            const cleanupResult = await browserManager.resetRuntimePatches({
+              strategy: cleanup as "reload" | "reset_session",
+              waitUntil,
+              timeoutMs,
+              headless,
+              reopenUrl,
+            });
+            if ("error" in cleanupResult) {
+              return err(`${patchedReport.error} Cleanup after patch verification also failed: ${cleanupResult.error}.`);
+            }
+          }
+          return err(patchedReport.error);
+        }
+
+        const patched: PatchedValidationScenarioResponse = {
+          verdict: patchedReport.success ? "patch_validated" : "patch_failed",
+          apply: applyResult,
+          report: patchedReport,
+        };
+
+        if (cleanup !== "keep") {
+          const cleanupResult = await browserManager.resetRuntimePatches({
+            strategy: cleanup as "reload" | "reset_session",
+            waitUntil,
+            timeoutMs,
+            headless,
+            reopenUrl,
+          });
+          if ("error" in cleanupResult) return err(cleanupResult.error);
+          patched.cleanup = cleanupResult;
+        }
+
+        const response = createFixVerdict({
+          fixDescription,
+          baseline,
+          patched,
+          regressionAssertions: regressionAssertions as Assertion[],
+        });
+
+        return ok({
+          ...response,
+          reportMarkdown: buildFixVerificationMarkdown(fixDescription, response.verdict, baseline, patched, regressionAssertions as Assertion[]),
+        });
+      } catch (error) {
+        return err(`verify_runtime_fix failed unexpectedly: ${String(error)}`);
+      }
+    }
+  );


edgarbnt · 2026-05-13T22:29:24Z

@copilot apply changes based on the comments in this thread

…edup Agent-Logs-Url: https://github.com/edgarbnt/ReactSentinel/sessions/6bcec9f4-64c6-453f-a101-1e067fcf3d78 Co-authored-by: edgarbnt <146716791+edgarbnt@users.noreply.github.com>

Agent-Logs-Url: https://github.com/edgarbnt/ReactSentinel/sessions/6bcec9f4-64c6-453f-a101-1e067fcf3d78 Co-authored-by: edgarbnt <146716791+edgarbnt@users.noreply.github.com>

edgarbnt and others added 7 commits May 13, 2026 23:42

feat: [SCRUM-449] add shared diagnostic verdict format

7770010

Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>

feat: [SCRUM-437] add high-level runtime investigations

eeb7b91

Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>

feat: [SCRUM-438] add deterministic replay stress testing

4a042aa

Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>

feat: [SCRUM-436] add hypothesis and fix verification tools

b31daca

Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>

feat: [SCRUM-440] add render attribution diagnostics

db49d70

Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>

feat: [SCRUM-442] improve MCP tool selection UX

d2f707a

Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>

feat: [SCRUM-441] add managed browser mode

d18f1cd

Co-authored-by: Copilot <223556219+Copilot@users.noreply.github.com>

edgarbnt requested a review from Copilot May 13, 2026 22:23

Copilot started reviewing on behalf of edgarbnt May 13, 2026 22:24 View session

Copilot AI reviewed May 13, 2026

View reviewed changes

Copilot started work on behalf of edgarbnt May 13, 2026 22:30 View session

Copilot AI and others added 2 commits May 13, 2026 23:15

fix: address PR review feedback for smoke assertions and tool alias d…

4977ff3

…edup Agent-Logs-Url: https://github.com/edgarbnt/ReactSentinel/sessions/6bcec9f4-64c6-453f-a101-1e067fcf3d78 Co-authored-by: edgarbnt <146716791+edgarbnt@users.noreply.github.com>

chore: finalize review feedback updates and validations

50d99c3

Agent-Logs-Url: https://github.com/edgarbnt/ReactSentinel/sessions/6bcec9f4-64c6-453f-a101-1e067fcf3d78 Co-authored-by: edgarbnt <146716791+edgarbnt@users.noreply.github.com>

Copilot finished work on behalf of edgarbnt May 13, 2026 23:24

edgarbnt merged commit 0a99635 into main May 14, 2026

edgarbnt deleted the sprint/sprint-16-debug-intelligence-upgrade branch May 14, 2026 08:48

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Sprint/sprint 16 debug intelligence upgrade#36

Sprint/sprint 16 debug intelligence upgrade#36
edgarbnt merged 9 commits into
mainfrom
sprint/sprint-16-debug-intelligence-upgrade

edgarbnt commented May 13, 2026

Uh oh!

Copilot AI left a comment

Uh oh!

edgarbnt commented May 13, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

edgarbnt commented May 13, 2026

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

edgarbnt commented May 13, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants