whyx — Static & Dynamic Code Path Explorer

whyx is a minimal, open‑source CLI for exploring Python codebases in two complementary ways:

Static: build a call graph from your source with a fast AST pass; query callers, callees, and find paths between functions.
Dynamic: run scripts under lightweight tracing to capture calls, returns, attribute assignments (via watchpoints), and derive coverage; then diff traces between runs and search within them.

Works entirely from the command line. No external services. JSON in, JSON out (if you want).

Jump to the diagram ↓

Features

✅ Static call graph from your project (no runtime needed)
✅ Call path queries: callers, callees (direct or transitive), and find-path
✅ Dynamic execution tracing: records call / return events
✅ Watchpoints for class attributes: record every assignment to e.g. models.User.age
✅ Trace diff between two JSON traces
✅ Trace search (grep-like) and watch history by target
✅ Coverage list (modules touched) derived from call events
✅ Zero deps for core CLI — a plain Python standard library tool

Requirements

Python 3.9+ (3.11+ recommended)
macOS / Linux / Windows
No third‑party dependencies required by default

Environment variables used by the helper script:

PY_BIN — which Python to use (default: python3)
VENV_DIR — where to create a virtualenv (default: .venv)

Quickstart

If you’re inside the repository root:

# 1) Make the helper script executable
chmod +x ./run-whyx.sh

# 2) Build a static index of the current project
./run-whyx.sh index .

# 3) Trace the included demo and save events
./run-whyx.sh run --trace --watch demo.User.age -o trace.json demo.py

# 4) Search the trace for "assign" events mentioning 'age'
./run-whyx.sh query trace-search --file trace.json --type assign --contains age

# 5) Show watch history of the demo field
./run-whyx.sh query history demo.User.age --file trace.json

Prefer to call Python directly instead of the helper script?

python -m whyx.cli index .
python -m whyx.cli run --trace --watch demo.User.age -o trace.json demo.py

Installation

You can run whyx from source without installing a package:

# In repo root
python -m venv .venv
# macOS/Linux
source .venv/bin/activate
# Windows (PowerShell)
# .\.venv\Scripts\Activate.ps1

# Run the CLI directly
python -m whyx.cli --help

Or use the provided convenience script (creates/uses .venv automatically):

chmod +x ./run-whyx.sh
./run-whyx.sh --help

Architecture diagram

✅ The diagram below is a GitHub‑native Mermaid diagram; it renders directly on GitHub README pages.

flowchart TD
    U["Developer / CI"] -->|commands| CLI["whyx.cli"]

    subgraph Static["Static analysis"]
        CLI -->|index, callers, callees, find-path| SA["static_analysis.py"]
        SA --> IDX[".whyx_index.json"]
    end

    subgraph Dynamic["Dynamic tracing"]
        CLI -->|run, report, diff| DT["dynamic_tracing.py"]
        DT --> TR["trace.json: call, return, assign events"]
        TR --> Q1["query history"]
        TR --> Q2["trace-search"]
        TR --> RPT["report coverage"]
        DT --> MODS["modules touched"]
    end

    CLI --> OUT["stdout: text or JSON"]

Usage

Index (static analysis)

Build a static index over a project folder (recurses through .py files, skipping common virtualenv/cache dirs).

# Writes .whyx_index.json next to your project root by default
./run-whyx.sh index .

Output (text)

{
  "project": ".",
  "functions": 42,
  "edges": 133,
  "index_file": "./.whyx_index.json"
}

Use --json on any command to force JSON output to stdout.

./run-whyx.sh index . --json

Query (callers / callees / find-path / history / trace-search)

Callers — show all call chains that end at a target function/method:

./run-whyx.sh query callers mypkg.module.Foo.bar
# Optional: --max-depth 64 --limit 200

Callees — list direct or transitive callees of a function:

./run-whyx.sh query callees mypkg.module.make_widget
./run-whyx.sh query callees --transitive mypkg.module.make_widget

Find path — enumerate call paths from A → B (bounded DFS):

./run-whyx.sh query find-path --from A.mod.func --to B.mod.func --limit 50 --max-depth 32

History — show all recorded assignments to a watched attribute (from a trace file):

./run-whyx.sh query history models.User.age --file trace.json

Trace search — grep-like search through events (optionally filter by type):

./run-whyx.sh query trace-search --file trace.json --contains "card_token=None"
./run-whyx.sh query trace-search --file trace.json --type call --contains "mypkg.checkout"

All query commands will load an existing ./.whyx_index.json if present. If none exists, they build an in-memory index from --project (default .). You can also point at a saved index with --index path/to/index.json.

Run (dynamic tracing & watchpoints)

Execute a Python script while recording runtime events and/or watchpoint assignments. The trace is written to JSON.

# Trace all function calls/returns + watch specific class attributes
./run-whyx.sh run --trace --watch models.User.age --watch cart.Cart.total -o trace.json path/to/script.py

Flags:

--trace — record call and return events
--watch module.Class.attr — record assignments to a specific class attribute (repeatable)
--coverage — compute a list of top‑level modules touched (based on call events)
-o, --output — where to save the trace (default: ./whyx_trace.json)
final positional arg — script to execute (e.g., demo.py)

Diff traces

Compare two trace files and list changes in call edges, return values, and watch histories.

./run-whyx.sh diff trace_before.json trace_after.json

Sample output (JSON):

{
  "added_calls": [["demo.workflow","demo.increment"]],
  "removed_calls": [],
  "changed_returns": {"demo.increment": {"old": ["1","2"], "new": ["2","4"]}},
  "watch_diffs": {"demo.User.age": {"old": ["0","1","2"], "new": ["0","2","4"]}}
}

Report coverage & top modules

Derive simple "which modules were touched" metrics from a trace file (count of call events per top‑level module).

./run-whyx.sh report trace.json --coverage
# Optionally limit output
./run-whyx.sh report trace.json --coverage --top 10

Legacy synonyms

For convenience, these still work:

# same as: query callers <FQN>
./run-whyx.sh callers mypkg.module.func

# same as: query callees <FQN>
./run-whyx.sh callees mypkg.module.func

# same as: query find-path --from A --to B
./run-whyx.sh findpath A.module.func B.module.func

# same as: query history --file <default> <target>
./run-whyx.sh history models.User.age

Data formats

Static index (`.whyx_index.json`)

{
  "root": "/abs/path/to/project",
  "generated_at": "2025-01-01T00:00:00Z",
  "functions": ["pkg.mod.Class.method", "pkg.mod.fn", "..."],
  "edges": [["callerFQN", "calleeFQN"], ["...", "..."]]
}

Dynamic trace (`trace.json`)

Each event is one of:

{"type": "call", "func": "module.Class.method"}
{"type": "return", "func": "module.Class.method", "value": "repr(value)"}
{"type": "assign", "target": "module.Class.attr", "func": "module.Class.method", "file": "rel/or/abs.py", "line": 123, "value": "repr(value)"}

[
  {"type": "call", "func": "demo.workflow"},
  {"type": "call", "func": "demo.increment"},
  {"type": "assign", "target": "demo.User.age", "func": "demo.increment", "file": "demo.py", "line": 6, "value": "2"},
  {"type": "return", "func": "demo.increment", "value": "2"},
  {"type": "return", "func": "demo.workflow", "value": "6"}
]

Watchpoint tips

Address by module name + class + attribute: module.Class.attr (e.g., models.User.age).
When tracing a script file (like demo.py), whyx treats the running script’s __main__ module as an alias of its file stem.
For demo.py, watch as demo.User.age even though frames may show __main__. This is automatic.
Watches attach lazily when classes become available at runtime. If a class doesn’t exist yet on first import, the tool keeps retrying until it can patch it.
Assignments are captured by wrapping the class’s __setattr__; the original behavior is restored at the end of execution.

Demo

This repository ships with a tiny demo in demo.py:

class User:
    def __init__(self):
        self.age = 0

def increment(u):
    u.age += 2   # changed increment to +2
    return u.age

def workflow():
    u = User()
    a1 = increment(u)
    a2 = increment(u)
    return a1 + a2

if __name__ == "__main__":
    print(workflow())

Try it end‑to‑end:

# Build the static index (optional for dynamic runs)
./run-whyx.sh index .

# Run with tracing + a watch on the User.age attribute
./run-whyx.sh run --trace --watch demo.User.age -o trace.json demo.py

# Show the watch history
./run-whyx.sh query history demo.User.age --file trace.json

# See which modules were touched
./run-whyx.sh report trace.json --coverage

Contributing

Contributions are welcome! A quick guideline:

Fork the repo and create your feature branch:
```
git checkout -b feat/my-idea
```

Set up a venv and run the CLI locally:

python -m venv .venv
# macOS/Linux
source .venv/bin/activate
# Windows
# .\.venv\Scripts\Activate.ps1

python -m whyx.cli --help

Add tests if you introduce new behavior (even small demos are fine).
Keep output deterministic where possible (e.g., sort lists before printing).
Open a PR with a clear description, motivation, and before/after examples.
For fixes, include a minimal repro if applicable.
By contributing, you agree that your code will be released under this project’s license (see below).

If you’re unsure about an approach, open a GitHub Discussion or Issue first.

Troubleshooting

"No module named whyx"
Make sure you are running from the repo root or that the repo is on PYTHONPATH. Prefer ./run-whyx.sh which sets up .venv and runs the module.
Watch didn’t capture anything
Double‑check the target string is module.Class.attr. When tracing a script (e.g., demo.py), use the file stem (demo) as the module part.
Trace file not written
Use -o trace.json or check write permissions for the working directory.
Large projects
Static indexing is lightweight, but for very large codebases consider narrowing the root path to speed things up.

Name		Name	Last commit message	Last commit date
Latest commit History 97 Commits
lab		lab
packaging		packaging
scripts		scripts
src		src
tests		tests
.gitignore		.gitignore
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
CONTRIBUTING.md		CONTRIBUTING.md
GOVERNANCE.md		GOVERNANCE.md
LICENSE.md		LICENSE.md
MAINTAINERS.md		MAINTAINERS.md
Makefile		Makefile
NOTICE.md		NOTICE.md
README.md		README.md
SECURITY.md		SECURITY.md
SUPPORT.md		SUPPORT.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

whyx — Static & Dynamic Code Path Explorer

Table of Contents

Features

Requirements

Quickstart

Installation

Architecture diagram

Usage

Index (static analysis)

Query (callers / callees / find-path / history / trace-search)

Run (dynamic tracing & watchpoints)

Diff traces

Report coverage & top modules

Legacy synonyms

Data formats

Static index (`.whyx_index.json`)

Dynamic trace (`trace.json`)

Watchpoint tips

Demo

Contributing

Troubleshooting

About

Uh oh!

Releases

Packages

Languages

License

mrakbook/whyx

Folders and files

Latest commit

History

Repository files navigation

whyx — Static & Dynamic Code Path Explorer

Table of Contents

Features

Requirements

Quickstart

Installation

Architecture diagram

Usage

Index (static analysis)

Query (callers / callees / find-path / history / trace-search)

Run (dynamic tracing & watchpoints)

Diff traces

Report coverage & top modules

Legacy synonyms

Data formats

Static index (.whyx_index.json)

Dynamic trace (trace.json)

Watchpoint tips

Demo

Contributing

Troubleshooting

About

Resources

License

Code of conduct

Contributing

Security policy

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Static index (`.whyx_index.json`)

Dynamic trace (`trace.json`)

Packages