From fe3f0a30d26d77e2093aa2ade0a7a4be1361399c Mon Sep 17 00:00:00 2001
From: Jesse Turner <ajesstur@amazon.com>
Date: Mon, 11 May 2026 13:34:54 +0000
Subject: [PATCH 1/3] docs: split TESTING.md into per-type docs with manual
 testing requirement

Break the monolithic TESTING.md into focused files under docs/testing/
(unit, integration, TUI, browser, e2e, manual) and add a requirement
that every change must be manually tested before submitting.
---
 docs/TESTING.md                   | 445 +-----------------------------
 docs/testing/browser-tests.md     |  74 +++++
 docs/testing/e2e-tests.md         |  22 ++
 docs/testing/integration-tests.md |  23 ++
 docs/testing/manual-testing.md    |  33 +++
 docs/testing/tui-tests.md         | 220 +++++++++++++++
 docs/testing/unit-tests.md        | 107 +++++++
 7 files changed, 489 insertions(+), 435 deletions(-)
 create mode 100644 docs/testing/browser-tests.md
 create mode 100644 docs/testing/e2e-tests.md
 create mode 100644 docs/testing/integration-tests.md
 create mode 100644 docs/testing/manual-testing.md
 create mode 100644 docs/testing/tui-tests.md
 create mode 100644 docs/testing/unit-tests.md

diff --git a/docs/TESTING.md b/docs/TESTING.md
index 601cf258f..24c914f69 100644
--- a/docs/TESTING.md
+++ b/docs/TESTING.md
@@ -11,414 +11,19 @@ npm run test:browser  # Run browser tests (requires AWS creds, uv, agentcore)
 npm run test:all      # Run all tests (unit + integ)
 ```
 
-## Test Organization
+## Test Types
 
-### Unit Tests
+| Type | Description | Docs |
+| ---- | ----------- | ---- |
+| Unit | Co-located tests for individual modules, includes snapshot tests | [testing/unit-tests.md](testing/unit-tests.md) |
+| Integration | Runs the real CLI binary, asserts on local files and stdout (no AWS creds needed) | [testing/integration-tests.md](testing/integration-tests.md) |
+| TUI | Full CLI in a pseudo-terminal — verifies screen output, keyboard navigation, wizard flows | [testing/tui-tests.md](testing/tui-tests.md) |
+| Browser | Playwright tests for the agent inspector web UI served by `agentcore dev` | [testing/browser-tests.md](testing/browser-tests.md) |
+| E2E | Full user journey across the AWS boundary — deploy, invoke, status, logs, traces | [testing/e2e-tests.md](testing/e2e-tests.md) |
 
-Unit tests are co-located with source files in `__tests__/` directories:
+## Manual Testing
 
-```
-src/cli/commands/add/
-├── action.ts
-├── command.ts
-└── __tests__/
-    └── add.test.ts
-```
-
-### Integration Tests
-
-Integration tests live in `integ-tests/`:
-
-```
-integ-tests/
-├── create-no-agent.test.ts
-├── create-with-agent.test.ts
-├── deploy.test.ts
-└── ...
-```
-
-See [integ-tests/README.md](../integ-tests/README.md) for integration test details.
-
-### E2E Tests
-
-E2E tests live in `e2e-tests/` and verify the full user journey across the AWS boundary — deploy, invoke, status, logs,
-traces, and control plane API calls.
-
-```
-e2e-tests/
-├── e2e-helper.ts           # Shared utilities and createE2ESuite() factory
-├── strands-bedrock.test.ts
-├── langgraph-openai.test.ts
-└── ...
-```
-
-See [e2e-tests/README.md](../e2e-tests/README.md) for e2e test details.
-
-## Writing Tests
-
-### Imports
-
-Use vitest for all test utilities:
-
-```typescript
-import { afterEach, beforeEach, describe, expect, it, vi } from 'vitest';
-```
-
-### Assertions
-
-Use `expect` assertions:
-
-```typescript
-// Equality
-expect(result).toBe('expected');
-expect(obj).toEqual({ key: 'value' });
-
-// Truthiness
-expect(value).toBeTruthy();
-expect(value).toBeFalsy();
-
-// Errors
-expect(() => fn()).toThrow();
-expect(() => fn()).toThrow('message');
-```
-
-### Mocking
-
-Use `vi` for mocks:
-
-```typescript
-// Mock functions
-const mockFn = vi.fn();
-mockFn.mockReturnValue('value');
-mockFn.mockResolvedValue('async value');
-
-// Spies
-vi.spyOn(module, 'method');
-
-// Module mocks
-vi.mock('./module');
-```
-
-## Test Utilities
-
-### CLI Runner
-
-`src/test-utils/cli-runner.ts` runs CLI commands in tests:
-
-```typescript
-import { runCLI } from '../src/test-utils/cli-runner';
-
-const result = await runCLI(['create', '--name', 'test'], tempDir);
-expect(result.exitCode).toBe(0);
-```
-
-## Snapshot Tests
-
-The `src/assets/` directory contains template files vended to users when they create projects. Snapshot tests ensure
-these templates don't change unexpectedly.
-
-### Running Snapshot Tests
-
-Snapshot tests run as part of unit tests:
-
-```bash
-npm test           # Runs all unit tests including snapshots
-npm run test:unit  # Same as above
-```
-
-### Updating Snapshots
-
-When you intentionally modify asset files (templates, configs, etc.), update snapshots:
-
-```bash
-npm run test:update-snapshots
-```
-
-Review the changes in `src/assets/__tests__/__snapshots__/` before committing.
-
-### What's Tested
-
-- File structure of `src/assets/`
-- Contents of all template files (CDK, Python frameworks, MCP, static assets)
-- Any file addition or removal
-
-## TUI Integration Tests
-
-TUI integration tests run the full CLI binary inside a pseudo-terminal (PTY) and verify screen output, keyboard
-navigation, and end-to-end wizard flows.
-
-> **Note:** TUI tests require `node-pty` (native addon). If node-pty is not installed, TUI tests are automatically
-> skipped.
-
-### Running TUI Tests
-
-```bash
-npm run test:tui              # Builds first, then runs TUI tests
-npx vitest run --project tui  # Skip build (use when build is fresh)
-```
-
-### Test Organization
-
-```
-integ-tests/tui/
-├── setup.ts              # Global setup: availability check, afterAll cleanup
-├── helpers.ts            # createMinimalProjectDir, common test setup
-├── harness.test.ts       # TuiSession self-tests (spawn, send, read)
-├── navigation.test.ts    # Screen navigation flows
-├── create-flow.test.ts   # Create wizard end-to-end
-├── add-flow.test.ts      # Add resource flows
-└── deploy-screen.test.ts # Deploy screen rendering
-```
-
-### Writing a TUI Flow Test
-
-Below is a complete example showing the typical pattern for a TUI flow test:
-
-```typescript
-import { isAvailable } from '../../src/test-utils/tui-harness/index.js';
-import { TuiSession } from '../../src/test-utils/tui-harness/index.js';
-import { createMinimalProjectDir } from './helpers.js';
-import { afterEach, describe, expect, it } from 'vitest';
-
-describe.skipIf(!isAvailable)('my TUI flow', () => {
-  let session: TuiSession;
-
-  afterEach(async () => {
-    await session?.close();
-  });
-
-  it('navigates to the add screen', async () => {
-    // createMinimalProjectDir makes a temp dir with agentcore config (~10ms)
-    const { dir, cleanup } = await createMinimalProjectDir({ hasAgents: true });
-
-    try {
-      // Launch the CLI TUI in the project directory
-      session = await TuiSession.launch({
-        command: 'node',
-        args: ['../../dist/cli/index.mjs'],
-        cwd: dir,
-      });
-
-      // Wait for the HelpScreen to render
-      await session.waitFor('Commands');
-
-      // Navigate: type 'add' to filter, then Enter
-      await session.sendKeys('add');
-      await session.sendSpecialKey('enter');
-
-      // Verify we reached the AddScreen
-      await session.waitFor('agent');
-      const screen = session.readScreen();
-      expect(screen.lines.join('\n')).toContain('agent');
-    } finally {
-      await cleanup();
-    }
-  });
-});
-```
-
-Key points:
-
-- **`describe.skipIf(!isAvailable)`** -- gracefully skips when `node-pty` is missing.
-- **`afterEach` with `session?.close()`** -- always clean up PTY processes.
-- **`createMinimalProjectDir`** -- fast temp directory setup (no `npm install`).
-- **`try/finally` with `cleanup()`** -- always remove temp directories.
-
-### TuiSession API Quick Reference
-
-| Method                                 | Returns                | Description                                                                                  |
-| -------------------------------------- | ---------------------- | -------------------------------------------------------------------------------------------- |
-| `TuiSession.launch(options)`           | `Promise<TuiSession>`  | Spawn CLI in PTY. Throws `LaunchError` if process exits during startup.                      |
-| `session.sendKeys(text, waitMs?)`      | `Promise<ScreenState>` | Type text, wait for screen to settle, return screen.                                         |
-| `session.sendSpecialKey(key, waitMs?)` | `Promise<ScreenState>` | Send special key (enter, tab, escape, etc.), wait, return screen.                            |
-| `session.readScreen(options?)`         | `ScreenState`          | Read current screen (synchronous). Options: `{ includeScrollback?, numbered? }`.             |
-| `session.waitFor(pattern, timeoutMs?)` | `Promise<ScreenState>` | Wait for text/regex on screen. **Throws `WaitForTimeoutError` on timeout** (default 5000ms). |
-| `session.close(signal?)`               | `Promise<CloseResult>` | Close session. Returns exit code, signal, final screen.                                      |
-| `session.info`                         | `SessionInfo`          | Session metadata: sessionId, pid, dimensions, alive status.                                  |
-| `session.alive`                        | `boolean`              | Whether the PTY process is still running.                                                    |
-
-### ScreenState Shape
-
-```typescript
-interface ScreenState {
-  lines: string[]; // Each line of terminal text
-  cursor: { x: number; y: number }; // Cursor position
-  dimensions: { cols: number; rows: number }; // Terminal size
-  bufferType: 'normal' | 'alternate'; // Active buffer
-}
-```
-
-### Special Keys
-
-The following special keys can be passed to `session.sendSpecialKey()`:
-
-`enter`, `tab`, `escape`, `backspace`, `delete`, `space`, `up`, `down`, `left`, `right`, `home`, `end`, `pageup`,
-`pagedown`, `ctrl+c`, `ctrl+d`, `ctrl+q`, `ctrl+g`, `ctrl+a`, `ctrl+e`, `ctrl+w`, `ctrl+u`, `ctrl+k`, `f1` through
-`f12`.
-
-### Key Concepts
-
-#### waitFor vs Settling
-
-- **Settling** (automatic after `sendKeys`/`sendSpecialKey`): Waits for screen text to stop changing. Good for most
-  screens. Fails on spinner/animation screens because text changes continuously.
-- **waitFor**: Polls for a specific text pattern. Use for: (a) async operations with spinners, (b) confirming you
-  reached the right screen, (c) any case where you need a specific pattern before proceeding.
-- **Rule of thumb**: Use `waitFor` when waiting for an async result (project creation, deployment). Use
-  `sendKeys`/`sendSpecialKey` (which auto-settle) for navigating between static screens.
-
-#### waitFor Throws on Timeout
-
-`waitFor()` throws `WaitForTimeoutError` when the pattern is not found within the timeout. The error includes:
-
-- The pattern that was not found
-- How long it waited
-- The full screen content at timeout
-
-This means tests fail fast with useful diagnostics. You do not need to check a `found` boolean.
-
-#### WaitForTimeoutError Output
-
-When `waitFor()` times out, the thrown `WaitForTimeoutError` produces a message like this:
-
-```
-WaitForTimeoutError: waitFor("created successfully") timed out after 5000ms.
-Screen content:
-AgentCore Create
-
-Creating project...
-⠋ Installing dependencies
-```
-
-The error message includes the full non-blank screen content at the time of the timeout. This makes it straightforward
-to diagnose why the expected pattern was not found -- was the screen still loading? Did the test land on the wrong
-screen? Was there a typo in the pattern?
-
-If you need to inspect the error properties programmatically (for example, to log additional context or make assertions
-on the screen state), you can catch the error directly:
-
-```typescript
-import { WaitForTimeoutError } from '../../src/test-utils/tui-harness/index.js';
-
-try {
-  await session.waitFor('expected text', 3000);
-} catch (err) {
-  if (err instanceof WaitForTimeoutError) {
-    console.log(err.pattern); // 'expected text'
-    console.log(err.elapsed); // ~3000
-    console.log(err.screen); // ScreenState with full content
-  }
-  throw err;
-}
-```
-
-#### createMinimalProjectDir
-
-Creates a temp directory that AgentCore recognizes as a project in ~10ms (no npm install). Use it when your test needs a
-project context:
-
-```typescript
-const { dir, cleanup } = await createMinimalProjectDir({
-  projectName: 'mytest', // optional, defaults to 'testproject'
-  hasAgents: true, // optional, adds a sample agent
-});
-```
-
-Always call `cleanup()` when done (in `finally` or `afterEach`).
-
-#### LaunchError
-
-`TuiSession.launch()` throws `LaunchError` when the spawned process exits before the screen settles. Common causes
-include a missing binary, a crash on startup, or an invalid working directory.
-
-The error includes the following diagnostic properties:
-
-- `command` -- the executable that was launched
-- `args` -- the arguments passed to the command
-- `cwd` -- the working directory used for the spawned process
-- `exitCode` -- the process exit code (or `null` if terminated by signal)
-- `screen` -- the `ScreenState` captured at the time of exit
-
-You can assert that a launch fails with `LaunchError`:
-
-```typescript
-import { LaunchError, TuiSession } from '../../src/test-utils/tui-harness/index.js';
-
-it('throws LaunchError for missing binary', async () => {
-  await expect(TuiSession.launch({ command: 'nonexistent-binary' })).rejects.toThrow(LaunchError);
-});
-
-// Or if you need to inspect the error:
-it('provides diagnostics in LaunchError', async () => {
-  try {
-    await TuiSession.launch({ command: 'node', args: ['missing-file.js'] });
-  } catch (err) {
-    if (err instanceof LaunchError) {
-      console.log(err.command); // 'node'
-      console.log(err.exitCode); // 1
-      console.log(err.screen); // ScreenState at time of crash
-    }
-    throw err;
-  }
-});
-```
-
-## Browser Tests
-
-Browser tests use Playwright to test the web UI (agent inspector) served by `agentcore dev`.
-
-### Prerequisites
-
-- AWS credentials configured (`aws sts get-caller-identity` must succeed)
-- `uv` on PATH
-- Local build (`npm run build`)
-- Playwright browsers installed: `npx playwright install chromium`
-
-### Running
-
-```bash
-npm run test:browser
-```
-
-Test results and the HTML report are written to `browser-tests/test-results/` and `browser-tests/playwright-report/`
-respectively. To view the report:
-
-```bash
-npx playwright show-report browser-tests/playwright-report
-```
-
-By default, tests run against the `@aws/agent-inspector` package from npm (in `node_modules`).
-
-### Testing against a local agent-inspector build
-
-To test with a local checkout of the agent-inspector (e.g. when developing new UI features or adding test IDs):
-
-1. Clone `agent-inspector` as a sibling directory and build it
-2. Run with `AGENT_INSPECTOR_PATH`:
-
-```bash
-AGENT_INSPECTOR_PATH=../agent-inspector/dist-assets npm run test:browser
-```
-
-### Test structure
-
-```
-browser-tests/
-├── playwright.config.ts  # Playwright configuration
-├── global-setup.ts       # Creates test project, starts agentcore dev
-├── global-teardown.ts    # Stops dev server, cleans up temp files
-├── constants.ts          # Shared constants (env file path)
-├── fixtures.ts           # Custom test fixtures (testEnv with port, project path)
-└── tests/                # Test files
-    ├── chat-invocation.test.ts
-    ├── inspector-loads.test.ts
-    ├── resources.test.ts
-    ├── start-agent.test.ts
-    └── traces.test.ts
-```
-
-The global setup creates a temporary project via `agentcore create`, starts `agentcore dev`, and writes connection
-details to an env file. Tests read the env file via the `testEnv` fixture.
+Every change must be manually tested before submitting. See [testing/manual-testing.md](testing/manual-testing.md) for instructions on building a local tarball and installing it without conflicting with global installs.
 
 ## Configuration
 
@@ -429,33 +34,3 @@ Test configuration is in `vitest.config.ts` using Vitest projects:
 - **tui** project: `integ-tests/tui/**/*.test.ts` (TUI integration tests)
 - Test timeout: 120 seconds
 - Hook timeout: 120 seconds
-
-## Troubleshooting
-
-### `Cannot find module '@playwright/test'`
-
-Playwright is not installed. Run:
-
-```bash
-npm install
-```
-
-### `browserType.launch: Executable doesn't exist` (Playwright browsers)
-
-Playwright browsers need to be downloaded after install. Run:
-
-```bash
-npx playwright install chromium
-```
-
-## Integration Tests
-
-Integration tests require no AWS credentials. They run the real CLI binary and assert on local files and stdout only.
-
-Run integration tests:
-
-```bash
-npm run test:integ
-```
-
-See [integ-tests/README.md](../integ-tests/README.md) for full details.
diff --git a/docs/testing/browser-tests.md b/docs/testing/browser-tests.md
new file mode 100644
index 000000000..81eb97146
--- /dev/null
+++ b/docs/testing/browser-tests.md
@@ -0,0 +1,74 @@
+# Browser Tests
+
+Browser tests use Playwright to test the web UI (agent inspector) served by `agentcore dev`.
+
+## Prerequisites
+
+- AWS credentials configured (`aws sts get-caller-identity` must succeed)
+- `uv` on PATH
+- Local build (`npm run build`)
+- Playwright browsers installed: `npx playwright install chromium`
+
+## Running
+
+```bash
+npm run test:browser
+```
+
+Test results and the HTML report are written to `browser-tests/test-results/` and `browser-tests/playwright-report/`
+respectively. To view the report:
+
+```bash
+npx playwright show-report browser-tests/playwright-report
+```
+
+By default, tests run against the `@aws/agent-inspector` package from npm (in `node_modules`).
+
+## Testing against a local agent-inspector build
+
+To test with a local checkout of the agent-inspector (e.g. when developing new UI features or adding test IDs):
+
+1. Clone `agent-inspector` as a sibling directory and build it
+2. Run with `AGENT_INSPECTOR_PATH`:
+
+```bash
+AGENT_INSPECTOR_PATH=../agent-inspector/dist-assets npm run test:browser
+```
+
+## Test Structure
+
+```
+browser-tests/
+├── playwright.config.ts  # Playwright configuration
+├── global-setup.ts       # Creates test project, starts agentcore dev
+├── global-teardown.ts    # Stops dev server, cleans up temp files
+├── constants.ts          # Shared constants (env file path)
+├── fixtures.ts           # Custom test fixtures (testEnv with port, project path)
+└── tests/                # Test files
+    ├── chat-invocation.test.ts
+    ├── inspector-loads.test.ts
+    ├── resources.test.ts
+    ├── start-agent.test.ts
+    └── traces.test.ts
+```
+
+The global setup creates a temporary project via `agentcore create`, starts `agentcore dev`, and writes connection
+details to an env file. Tests read the env file via the `testEnv` fixture.
+
+## Troubleshooting
+
+### `Cannot find module '@playwright/test'`
+
+Playwright is not installed. Run:
+
+```bash
+npm install
+```
+
+### `browserType.launch: Executable doesn't exist` (Playwright browsers)
+
+Playwright browsers need to be downloaded after install. Run:
+
+```bash
+npx playwright install chromium
+```
diff --git a/docs/testing/e2e-tests.md b/docs/testing/e2e-tests.md
new file mode 100644
index 000000000..dfd80a5bc
--- /dev/null
+++ b/docs/testing/e2e-tests.md
@@ -0,0 +1,22 @@
+# E2E Tests
+
+E2E tests verify the full user journey across the AWS boundary — deploy, invoke, status, logs, traces, and control
+plane API calls.
+
+## Running
+
+```bash
+npm run test:all      # Run all tests (unit + integ)
+```
+
+## Test Organization
+
+```
+e2e-tests/
+├── e2e-helper.ts           # Shared utilities and createE2ESuite() factory
+├── strands-bedrock.test.ts
+├── langgraph-openai.test.ts
+└── ...
+```
+
+See [e2e-tests/README.md](../../e2e-tests/README.md) for full details.
diff --git a/docs/testing/integration-tests.md b/docs/testing/integration-tests.md
new file mode 100644
index 000000000..e62d5184f
--- /dev/null
+++ b/docs/testing/integration-tests.md
@@ -0,0 +1,23 @@
+# Integration Tests
+
+Integration tests require no AWS credentials. They run the real CLI binary and assert on local files and stdout only.
+
+## Running
+
+```bash
+npm run test:integ    # Run integration tests
+```
+
+## Test Organization
+
+Integration tests live in `integ-tests/`:
+
+```
+integ-tests/
+├── create-no-agent.test.ts
+├── create-with-agent.test.ts
+├── deploy.test.ts
+└── ...
+```
+
+See [integ-tests/README.md](../../integ-tests/README.md) for full details.
diff --git a/docs/testing/manual-testing.md b/docs/testing/manual-testing.md
new file mode 100644
index 000000000..a8f95fd22
--- /dev/null
+++ b/docs/testing/manual-testing.md
@@ -0,0 +1,33 @@
+# Manual Testing
+
+## Building a local tarball
+
+Run `npm run bundle` from the agentcore-cli directory. This bundles the CLI along with the CDK constructs from the sister repo (`agentcore-l3-cdk-constructs`) into a single installable tarball.
+
+```bash
+cd agentcore-cli
+npm run bundle
+```
+
+## Installing locally (without conflicting with global installs)
+
+Install the tarball into your working directory so it doesn't conflict with other `agentcore` commands on the machine:
+
+```bash
+# From the parent workspace directory
+npm init -y  # if no package.json exists yet
+npm install ./agentcore-cli/agentcore-cli-*.tgz
+```
+
+Then run it with:
+
+```bash
+npx agentcore
+```
+
+Or add `node_modules/.bin` to your PATH for this directory only:
+
+```bash
+export PATH="$(pwd)/node_modules/.bin:$PATH"
+agentcore
+```
diff --git a/docs/testing/tui-tests.md b/docs/testing/tui-tests.md
new file mode 100644
index 000000000..1ea6fdeca
--- /dev/null
+++ b/docs/testing/tui-tests.md
@@ -0,0 +1,220 @@
+# TUI Integration Tests
+
+TUI integration tests run the full CLI binary inside a pseudo-terminal (PTY) and verify screen output, keyboard
+navigation, and end-to-end wizard flows.
+
+> **Note:** TUI tests require `node-pty` (native addon). If node-pty is not installed, TUI tests are automatically
+> skipped.
+
+## Running
+
+```bash
+npm run test:tui              # Builds first, then runs TUI tests
+npx vitest run --project tui  # Skip build (use when build is fresh)
+```
+
+## Test Organization
+
+```
+integ-tests/tui/
+├── setup.ts              # Global setup: availability check, afterAll cleanup
+├── helpers.ts            # createMinimalProjectDir, common test setup
+├── harness.test.ts       # TuiSession self-tests (spawn, send, read)
+├── navigation.test.ts    # Screen navigation flows
+├── create-flow.test.ts   # Create wizard end-to-end
+├── add-flow.test.ts      # Add resource flows
+└── deploy-screen.test.ts # Deploy screen rendering
+```
+
+## Writing a TUI Flow Test
+
+Below is a complete example showing the typical pattern for a TUI flow test:
+
+```typescript
+import { isAvailable } from '../../src/test-utils/tui-harness/index.js';
+import { TuiSession } from '../../src/test-utils/tui-harness/index.js';
+import { createMinimalProjectDir } from './helpers.js';
+import { afterEach, describe, expect, it } from 'vitest';
+
+describe.skipIf(!isAvailable)('my TUI flow', () => {
+  let session: TuiSession;
+
+  afterEach(async () => {
+    await session?.close();
+  });
+
+  it('navigates to the add screen', async () => {
+    // createMinimalProjectDir makes a temp dir with agentcore config (~10ms)
+    const { dir, cleanup } = await createMinimalProjectDir({ hasAgents: true });
+
+    try {
+      // Launch the CLI TUI in the project directory
+      session = await TuiSession.launch({
+        command: 'node',
+        args: ['../../dist/cli/index.mjs'],
+        cwd: dir,
+      });
+
+      // Wait for the HelpScreen to render
+      await session.waitFor('Commands');
+
+      // Navigate: type 'add' to filter, then Enter
+      await session.sendKeys('add');
+      await session.sendSpecialKey('enter');
+
+      // Verify we reached the AddScreen
+      await session.waitFor('agent');
+      const screen = session.readScreen();
+      expect(screen.lines.join('\n')).toContain('agent');
+    } finally {
+      await cleanup();
+    }
+  });
+});
+```
+
+Key points:
+
+- **`describe.skipIf(!isAvailable)`** -- gracefully skips when `node-pty` is missing.
+- **`afterEach` with `session?.close()`** -- always clean up PTY processes.
+- **`createMinimalProjectDir`** -- fast temp directory setup (no `npm install`).
+- **`try/finally` with `cleanup()`** -- always remove temp directories.
+
+## TuiSession API Quick Reference
+
+| Method                                 | Returns                | Description                                                                                  |
+| -------------------------------------- | ---------------------- | -------------------------------------------------------------------------------------------- |
+| `TuiSession.launch(options)`           | `Promise<TuiSession>`  | Spawn CLI in PTY. Throws `LaunchError` if process exits during startup.                      |
+| `session.sendKeys(text, waitMs?)`      | `Promise<ScreenState>` | Type text, wait for screen to settle, return screen.                                         |
+| `session.sendSpecialKey(key, waitMs?)` | `Promise<ScreenState>` | Send special key (enter, tab, escape, etc.), wait, return screen.                            |
+| `session.readScreen(options?)`         | `ScreenState`          | Read current screen (synchronous). Options: `{ includeScrollback?, numbered? }`.             |
+| `session.waitFor(pattern, timeoutMs?)` | `Promise<ScreenState>` | Wait for text/regex on screen. **Throws `WaitForTimeoutError` on timeout** (default 5000ms). |
+| `session.close(signal?)`               | `Promise<CloseResult>` | Close session. Returns exit code, signal, final screen.                                      |
+| `session.info`                         | `SessionInfo`          | Session metadata: sessionId, pid, dimensions, alive status.                                  |
+| `session.alive`                        | `boolean`              | Whether the PTY process is still running.                                                    |
+
+## ScreenState Shape
+
+```typescript
+interface ScreenState {
+  lines: string[]; // Each line of terminal text
+  cursor: { x: number; y: number }; // Cursor position
+  dimensions: { cols: number; rows: number }; // Terminal size
+  bufferType: 'normal' | 'alternate'; // Active buffer
+}
+```
+
+## Special Keys
+
+The following special keys can be passed to `session.sendSpecialKey()`:
+
+`enter`, `tab`, `escape`, `backspace`, `delete`, `space`, `up`, `down`, `left`, `right`, `home`, `end`, `pageup`,
+`pagedown`, `ctrl+c`, `ctrl+d`, `ctrl+q`, `ctrl+g`, `ctrl+a`, `ctrl+e`, `ctrl+w`, `ctrl+u`, `ctrl+k`, `f1` through
+`f12`.
+
+## Key Concepts
+
+### waitFor vs Settling
+
+- **Settling** (automatic after `sendKeys`/`sendSpecialKey`): Waits for screen text to stop changing. Good for most
+  screens. Fails on spinner/animation screens because text changes continuously.
+- **waitFor**: Polls for a specific text pattern. Use for: (a) async operations with spinners, (b) confirming you
+  reached the right screen, (c) any case where you need a specific pattern before proceeding.
+- **Rule of thumb**: Use `waitFor` when waiting for an async result (project creation, deployment). Use
+  `sendKeys`/`sendSpecialKey` (which auto-settle) for navigating between static screens.
+
+### waitFor Throws on Timeout
+
+`waitFor()` throws `WaitForTimeoutError` when the pattern is not found within the timeout. The error includes:
+
+- The pattern that was not found
+- How long it waited
+- The full screen content at timeout
+
+This means tests fail fast with useful diagnostics. You do not need to check a `found` boolean.
+
+### WaitForTimeoutError Output
+
+When `waitFor()` times out, the thrown `WaitForTimeoutError` produces a message like this:
+
+```
+WaitForTimeoutError: waitFor("created successfully") timed out after 5000ms.
+Screen content:
+AgentCore Create
+
+Creating project...
+⠋ Installing dependencies
+```
+
+The error message includes the full non-blank screen content at the time of the timeout. This makes it straightforward
+to diagnose why the expected pattern was not found -- was the screen still loading? Did the test land on the wrong
+screen? Was there a typo in the pattern?
+
+If you need to inspect the error properties programmatically (for example, to log additional context or make assertions
+on the screen state), you can catch the error directly:
+
+```typescript
+import { WaitForTimeoutError } from '../../src/test-utils/tui-harness/index.js';
+
+try {
+  await session.waitFor('expected text', 3000);
+} catch (err) {
+  if (err instanceof WaitForTimeoutError) {
+    console.log(err.pattern); // 'expected text'
+    console.log(err.elapsed); // ~3000
+    console.log(err.screen); // ScreenState with full content
+  }
+  throw err;
+}
+```
+
+### createMinimalProjectDir
+
+Creates a temp directory that AgentCore recognizes as a project in ~10ms (no npm install). Use it when your test needs a
+project context:
+
+```typescript
+const { dir, cleanup } = await createMinimalProjectDir({
+  projectName: 'mytest', // optional, defaults to 'testproject'
+  hasAgents: true, // optional, adds a sample agent
+});
+```
+
+Always call `cleanup()` when done (in `finally` or `afterEach`).
+
+### LaunchError
+
+`TuiSession.launch()` throws `LaunchError` when the spawned process exits before the screen settles. Common causes
+include a missing binary, a crash on startup, or an invalid working directory.
+
+The error includes the following diagnostic properties:
+
+- `command` -- the executable that was launched
+- `args` -- the arguments passed to the command
+- `cwd` -- the working directory used for the spawned process
+- `exitCode` -- the process exit code (or `null` if terminated by signal)
+- `screen` -- the `ScreenState` captured at the time of exit
+
+You can assert that a launch fails with `LaunchError`:
+
+```typescript
+import { LaunchError, TuiSession } from '../../src/test-utils/tui-harness/index.js';
+
+it('throws LaunchError for missing binary', async () => {
+  await expect(TuiSession.launch({ command: 'nonexistent-binary' })).rejects.toThrow(LaunchError);
+});
+
+// Or if you need to inspect the error:
+it('provides diagnostics in LaunchError', async () => {
+  try {
+    await TuiSession.launch({ command: 'node', args: ['missing-file.js'] });
+  } catch (err) {
+    if (err instanceof LaunchError) {
+      console.log(err.command); // 'node'
+      console.log(err.exitCode); // 1
+      console.log(err.screen); // ScreenState at time of crash
+    }
+    throw err;
+  }
+});
+```
diff --git a/docs/testing/unit-tests.md b/docs/testing/unit-tests.md
new file mode 100644
index 000000000..6c3a09fd7
--- /dev/null
+++ b/docs/testing/unit-tests.md
@@ -0,0 +1,107 @@
+# Unit Tests
+
+Unit tests are co-located with source files in `__tests__/` directories:
+
+```
+src/cli/commands/add/
+├── action.ts
+├── command.ts
+└── __tests__/
+    └── add.test.ts
+```
+
+## Running
+
+```bash
+npm test              # Run unit tests
+npm run test:watch    # Run tests in watch mode
+npm run test:unit     # Same as npm test
+```
+
+## Writing Tests
+
+### Imports
+
+Use vitest for all test utilities:
+
+```typescript
+import { afterEach, beforeEach, describe, expect, it, vi } from 'vitest';
+```
+
+### Assertions
+
+Use `expect` assertions:
+
+```typescript
+// Equality
+expect(result).toBe('expected');
+expect(obj).toEqual({ key: 'value' });
+
+// Truthiness
+expect(value).toBeTruthy();
+expect(value).toBeFalsy();
+
+// Errors
+expect(() => fn()).toThrow();
+expect(() => fn()).toThrow('message');
+```
+
+### Mocking
+
+Use `vi` for mocks:
+
+```typescript
+// Mock functions
+const mockFn = vi.fn();
+mockFn.mockReturnValue('value');
+mockFn.mockResolvedValue('async value');
+
+// Spies
+vi.spyOn(module, 'method');
+
+// Module mocks
+vi.mock('./module');
+```
+
+## Test Utilities
+
+### CLI Runner
+
+`src/test-utils/cli-runner.ts` runs CLI commands in tests:
+
+```typescript
+import { runCLI } from '../src/test-utils/cli-runner';
+
+const result = await runCLI(['create', '--name', 'test'], tempDir);
+expect(result.exitCode).toBe(0);
+```
+
+## Snapshot Tests
+
+The `src/assets/` directory contains template files vended to users when they create projects. Snapshot tests ensure
+these templates don't change unexpectedly.
+
+### Running Snapshot Tests
+
+Snapshot tests run as part of unit tests:
+
+```bash
+npm test           # Runs all unit tests including snapshots
+npm run test:unit  # Same as above
+```
+
+### Updating Snapshots
+
+When you intentionally modify asset files (templates, configs, etc.), update snapshots:
+
+```bash
+npm run test:update-snapshots
+```
+
+Review the changes in `src/assets/__tests__/__snapshots__/` before committing.
+
+### What's Tested
+
+- File structure of `src/assets/`
+- Contents of all template files (CDK, Python frameworks, MCP, static assets)
+- Any file addition or removal

From 7c4d17f2bdce835099925f5172264687ea4db723 Mon Sep 17 00:00:00 2001
From: Jesse Turner <ajesstur@amazon.com>
Date: Mon, 11 May 2026 13:39:31 +0000
Subject: [PATCH 2/3] fix: address review comments on testing docs

- e2e-tests.md: use correct `npm run test:e2e` command, add AWS creds prerequisite
- manual-testing.md: fix tarball glob to `aws-agentcore-*.tgz`
---
 docs/testing/e2e-tests.md      | 9 ++++++++-
 docs/testing/manual-testing.md | 2 +-
 2 files changed, 9 insertions(+), 2 deletions(-)

diff --git a/docs/testing/e2e-tests.md b/docs/testing/e2e-tests.md
index dfd80a5bc..a47977899 100644
--- a/docs/testing/e2e-tests.md
+++ b/docs/testing/e2e-tests.md
@@ -3,10 +3,17 @@
 E2E tests verify the full user journey across the AWS boundary — deploy, invoke, status, logs, traces, and control
 plane API calls.
 
+## Prerequisites
+
+- AWS credentials configured (`aws sts get-caller-identity` must succeed)
+- Local build (`npm run build`)
+
+See [e2e-tests/README.md](../../e2e-tests/README.md) for full prerequisite details.
+
 ## Running
 
 ```bash
-npm run test:all      # Run all tests (unit + integ)
+npm run test:e2e      # Run e2e tests
 ```
 
 ## Test Organization
diff --git a/docs/testing/manual-testing.md b/docs/testing/manual-testing.md
index a8f95fd22..3c3da3f87 100644
--- a/docs/testing/manual-testing.md
+++ b/docs/testing/manual-testing.md
@@ -16,7 +16,7 @@ Install the tarball into your working directory so it doesn't conflict with othe
 ```bash
 # From the parent workspace directory
 npm init -y  # if no package.json exists yet
-npm install ./agentcore-cli/agentcore-cli-*.tgz
+npm install ./agentcore-cli/aws-agentcore-*.tgz
 ```
 
 Then run it with:

From 3fa83b8c7aa09281f8ad4e1898a9995599cd2741 Mon Sep 17 00:00:00 2001
From: Jesse Turner <ajesstur@amazon.com>
Date: Mon, 11 May 2026 13:44:06 +0000
Subject: [PATCH 3/3] style: fix prettier formatting in testing docs

---
 docs/TESTING.md                | 17 +++++++++--------
 docs/testing/e2e-tests.md      |  4 ++--
 docs/testing/manual-testing.md |  3 ++-
 3 files changed, 13 insertions(+), 11 deletions(-)

diff --git a/docs/TESTING.md b/docs/TESTING.md
index 24c914f69..68bd9e7a0 100644
--- a/docs/TESTING.md
+++ b/docs/TESTING.md
@@ -13,17 +13,18 @@ npm run test:all      # Run all tests (unit + integ)
 
 ## Test Types
 
-| Type | Description | Docs |
-| ---- | ----------- | ---- |
-| Unit | Co-located tests for individual modules, includes snapshot tests | [testing/unit-tests.md](testing/unit-tests.md) |
-| Integration | Runs the real CLI binary, asserts on local files and stdout (no AWS creds needed) | [testing/integration-tests.md](testing/integration-tests.md) |
-| TUI | Full CLI in a pseudo-terminal — verifies screen output, keyboard navigation, wizard flows | [testing/tui-tests.md](testing/tui-tests.md) |
-| Browser | Playwright tests for the agent inspector web UI served by `agentcore dev` | [testing/browser-tests.md](testing/browser-tests.md) |
-| E2E | Full user journey across the AWS boundary — deploy, invoke, status, logs, traces | [testing/e2e-tests.md](testing/e2e-tests.md) |
+| Type        | Description                                                                               | Docs                                                         |
+| ----------- | ----------------------------------------------------------------------------------------- | ------------------------------------------------------------ |
+| Unit        | Co-located tests for individual modules, includes snapshot tests                          | [testing/unit-tests.md](testing/unit-tests.md)               |
+| Integration | Runs the real CLI binary, asserts on local files and stdout (no AWS creds needed)         | [testing/integration-tests.md](testing/integration-tests.md) |
+| TUI         | Full CLI in a pseudo-terminal — verifies screen output, keyboard navigation, wizard flows | [testing/tui-tests.md](testing/tui-tests.md)                 |
+| Browser     | Playwright tests for the agent inspector web UI served by `agentcore dev`                 | [testing/browser-tests.md](testing/browser-tests.md)         |
+| E2E         | Full user journey across the AWS boundary — deploy, invoke, status, logs, traces          | [testing/e2e-tests.md](testing/e2e-tests.md)                 |
 
 ## Manual Testing
 
-Every change must be manually tested before submitting. See [testing/manual-testing.md](testing/manual-testing.md) for instructions on building a local tarball and installing it without conflicting with global installs.
+Every change must be manually tested before submitting. See [testing/manual-testing.md](testing/manual-testing.md) for
+instructions on building a local tarball and installing it without conflicting with global installs.
 
 ## Configuration
 
diff --git a/docs/testing/e2e-tests.md b/docs/testing/e2e-tests.md
index a47977899..50a57803c 100644
--- a/docs/testing/e2e-tests.md
+++ b/docs/testing/e2e-tests.md
@@ -1,7 +1,7 @@
 # E2E Tests
 
-E2E tests verify the full user journey across the AWS boundary — deploy, invoke, status, logs, traces, and control
-plane API calls.
+E2E tests verify the full user journey across the AWS boundary — deploy, invoke, status, logs, traces, and control plane
+API calls.
 
 ## Prerequisites
 
diff --git a/docs/testing/manual-testing.md b/docs/testing/manual-testing.md
index 3c3da3f87..9b4539581 100644
--- a/docs/testing/manual-testing.md
+++ b/docs/testing/manual-testing.md
@@ -2,7 +2,8 @@
 
 ## Building a local tarball
 
-Run `npm run bundle` from the agentcore-cli directory. This bundles the CLI along with the CDK constructs from the sister repo (`agentcore-l3-cdk-constructs`) into a single installable tarball.
+Run `npm run bundle` from the agentcore-cli directory. This bundles the CLI along with the CDK constructs from the
+sister repo (`agentcore-l3-cdk-constructs`) into a single installable tarball.
 
 ```bash
 cd agentcore-cli