Unparsed primitive types throughout the codebase obscure domain semantics and defeat static analysis

### Problem 🎯

Data flows through the system as bare primitives — `dict`, `list[dict]`, `str`, `int` — long after crossing I/O boundaries where it could be parsed into meaningful domain types. Examples:

- **GitHub issue data**: `open_autonomous_issues()`, `open_issues()`, `post_issues()`, `issue_context()`, `_apply_verdict()` all pass `dict` / `list[dict]`. The actual shape (which keys exist, their types) is implicit — determined by `--json` field lists passed to `gh` and by conventions in the scan/groom loops.
- **Agent step output**: `step()` returns `dict`. The JSON schema is defined in prompts but invisible to the type system. Callers access `raw["findings"]`, `clustered["clusters"]`, `reviewed["ready"]` etc. with no static checking.
- **Project/scan config**: `load_project()` returns `dict`, scan blocks are `dict`, fields like `project["repo"]` and `scan["type"]` are untyped strings.
- **Identifiers and paths**: project IDs, scan types, issue numbers, branch names, labels — all bare `str` / `int` with no domain wrapper to distinguish them or constrain their values.
- **Run metadata**: the `metadata` dict built in scan/fix/groom finales is assembled ad-hoc with string keys.

In practice this mostly works — the codebase is small and conventions are consistent. But it means:
- Key typos and shape mismatches are silent at type-check time
- You have to trace through I/O call sites to know what a value actually contains
- Functions that need different subsets of fields share the same `dict` type
- Domain rules (e.g. "a scan type is one of these known strings") live in runtime checks or conventions, not in the type system

### Definition of done ✅

- I/O boundaries (GitHub API responses, agent step JSON output, project config loading, run metadata) parse into explicit domain types on entry — you know this is fixed when `ty check` catches a misspelled key, missing field, or wrong type at any call site
- Internal code works with validated, meaningfully labeled data — not raw dicts and strings
- The approach is chosen deliberately between TypedDicts, dataclasses, Pydantic, NewType, enums, etc. based on the tradeoffs at each boundary: JSON passthrough vs validated parsing vs runtime construction vs lightweight semantic tagging

### Out of scope ⛔

- Typing the `gh()` wrapper itself (it returns `CompletedProcess`, which is fine)
- Typing third-party library internals
- Changing the JSON schemas in agent prompts — this is about parsing what already exists, not redesigning the agent protocol

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Unparsed primitive types throughout the codebase obscure domain semantics and defeat static analysis #49

Problem 🎯

Definition of done ✅

Out of scope ⛔

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Unparsed primitive types throughout the codebase obscure domain semantics and defeat static analysis #49

Description

Problem 🎯

Definition of done ✅

Out of scope ⛔

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions