July Backtester

A professional-grade Python engine for stress-testing US equity strategies with Monte Carlo simulation and Walk-Forward Analysis.

Tests trading strategies against full historical US equity data, runs 1,000-path Monte Carlo simulation and Walk-Forward Analysis to separate genuine edges from curve-fitting, and produces a summary table with Sharpe, Calmar, Win Rate, MC Score, WFA Verdict, and SPY/QQQ outperformance. Detailed PDF tearsheets include equity curves, drawdown plots, R-Multiple histograms, and VIX regime heatmaps.

Intraday support: Backtest on hourly (1H, 4H), 5-minute, 15-minute, or 30-minute bars with automatic metrics annualization (Sharpe, Sortino, HTB fees).

Supports Polygon, Norgate, Yahoo Finance, local CSV, and local Parquet. Free to run against Yahoo Finance with no API key.

Full reference: docs/README_full.md

Installation

git clone https://github.com/zachisit/july-backtester.git
cd july-backtester
python -m venv venv
source venv/bin/activate   # Windows: venv\Scripts\activate.bat
pip install -r requirements.txt

For Polygon data, add your API key to .env (copy .env.example to get started):

POLYGON_API_KEY=your_key_here

For interns with private strategies: After cloning, initialize the private strategies submodule:

git submodule update --init --recursive

See PRIVATE_STRATEGIES.md for the full guide.

Quick Start

%%{init: {'theme': 'neutral'}}%%
graph TD
    A[Clone Repository] --> B[Create & Activate Virtual Environment]
    B --> C[pip install -r requirements.txt]
    C --> D{Choose Data Provider}
    D -->|Polygon.io| E[Add POLYGON_API_KEY to .env file]
    D -->|Yahoo / Norgate / CSV / Parquet| F[No API Key Needed]
    E --> G[Run Setup Wizard: python main.py --init]
    F --> G
    G --> H[config.py written - Ready to Backtest]

First time? Run the setup wizard:

python main.py --init

graph TD
    A[Run Setup Wizard: python main.py --init] --> B{Select Data Provider}
    B -->|Polygon.io| C[Enter API Key]
    B -->|Yahoo / Norgate / CSV / Parquet| D[No API Key Needed]
    C --> E[Configure Account]
    D --> E
    E -->|Set Capital, Dates, & Slippage| F[Select Initial Portfolio]
    F -->|Choose Nasdaq 100, S&P 500, etc.| G[Wizard Writes config.py]
    G --> I[Setup Complete - Ready to Run]

Or manually — set these lines in config.py and run:

"data_provider": "yahoo",
"portfolios": {"My Symbols": ["SPY"]},
"start_date": "2010-01-01",
"initial_capital": 100000.0,

python main.py

The engine runs every strategy in custom_strategies/ against SPY, prints a results table, and writes output to output/runs/<timestamp>/.

Portfolio run — test all strategies against the Nasdaq 100:

"data_provider": "polygon",
"portfolios": {
    "Nasdaq 100": "nasdaq_100.json",
},

Validate before a long run: python main.py --dry-run

See examples/ for ready-to-use config files and annotated strategy examples.

Writing Strategies

Drop a .py file in custom_strategies/ and decorate your function with @register_strategy. The engine discovers and runs it automatically — no other files need editing.

Signal convention

Your function must populate df['Signal'] before returning. The engine interprets the values as:

Signal	Meaning
`1`	Enter long / hold long
`0`	No change — hold whatever position is currently open
`-1`	Exit long or cover short (go flat)
`-2`	Enter short

A common pattern is to use 1 and -1 on every bar (always in a position), or 1/-1/0 where 0 means "stay out until the next signal." Use -2 only when your strategy has explicit short logic; all existing built-in strategies are long-only and never emit -2.

Minimal example

from helpers.registry import register_strategy

@register_strategy(name="My Strategy", dependencies=[], params={})
def my_strategy(df, **kwargs):
    df["Signal"] = 0
    df.loc[df["Close"] > df["Close"].rolling(20).mean(), "Signal"] = 1   # above 20MA → long
    df.loc[df["Close"] <= df["Close"].rolling(20).mean(), "Signal"] = -1  # below 20MA → flat
    return df

Dependencies (SPY, VIX)

Declare dependencies=["spy"] or dependencies=["vix"] to have the engine inject spy_df / vix_df into **kwargs automatically. See examples/strategies/ for annotated examples including dependency usage, parameter definitions, and the forward-fill pattern for discrete entry/exit signals.

See CONTRIBUTING.md for the full plugin reference including parameters, timeframe utilities, and the PR checklist.

Norgate Data

If you have a Norgate license, you can either query Norgate live on every run or export the full database to local Parquet files once and share access with teammates who don't have a license.

Setting	What it does	Requires
`data_provider: "norgate"`	Calls Norgate API live on every run	Norgate license + NDU running
`data_provider: "parquet"`	Reads pre-exported local Parquet files	Submodule only — no license needed

Pipeline

%%{init: {'theme': 'neutral'}}%%
graph LR
    A[Norgate API\nNDU running] -->|norgate_to_parquet.py| B[(parquet_data/data/\n~36 000 .parquet files)]
    B -->|data_provider: parquet| C[Backtester]
    A -->|data_provider: norgate| C

Exporting Norgate data to Parquet

Run the three export commands once (full dump, ~36 000 symbols, ~2.5 GB):

python scripts/norgate_to_parquet.py --database "US Equities"          --output-dir parquet_data/data --start-date 1990-01-01
python scripts/norgate_to_parquet.py --database "US Equities Delisted" --output-dir parquet_data/data --start-date 1990-01-01 --skip-existing
python scripts/norgate_to_parquet.py --database "US Indices"           --output-dir parquet_data/data --start-date 1990-01-01 --skip-existing

Validate that every Norgate symbol has a local file:

python scripts/validate_norgate_export.py

See scripts/NORGATE_EXPORT.md for the full export and validation guide.

Accessing the exported data (interns / no-license teammates)

The exported dataset lives in the parquet_data/ git submodule (private repo: july-backtester-norgate-data). Clone it alongside the main repo:

git clone --recurse-submodules https://github.com/zachisit/july-backtester.git

Or, if you already cloned without --recurse-submodules:

git submodule update --init parquet_data

Then set data_provider: "parquet" in config.py. No Norgate license or NDU process required.

The Backtesting Lifecycle

%%{init: {'theme': 'neutral'}}%%
graph TD
    A[Edit config.py] -->|Set portfolios, dates, capital| B[Run: python main.py]
    B --> C{Execution Engine}
    C -->|Fetches/Loads Data| D[(Local Data Cache)]
    C -->|Calculates Edge| E[Monte Carlo & Walk-Forward Analysis]
    E --> F[Output Folder created: output/runs/RUN_ID/]
    F -->|Terminal Output| G[Summary Table & Correlation Matrix]
    F -->|Raw Trade Data| H[analyzer_csvs/ Portfolio / Strategy.csv]
    F -->|LLM-readable JSON| K[llm_verdict.json]
    H --> I[Run: python report.py --all output/runs/RUN_ID]
    I --> J[PDF & Markdown Reports generated in detailed_reports/]

LLM Verdict

Every run writes output/runs/<run_id>/llm_verdict.json — a machine-readable file designed to let an LLM, script, or downstream tool evaluate strategy results without parsing terminal output.

What it contains

Each strategy entry includes:

Field	Description
`beats_spy`	`true` / `false` boolean
`verdict`	Plain-English string — e.g. `"BEATS SPY by +81684.50pp"` or `"LAGS SPY by -788.75pp"`
`strategy_return_pct`	Total return as a float
`benchmarks`	Per-benchmark breakdown: `bh_return_pct`, `beats`, `outperformance_pp`
`sharpe_ratio`, `calmar_ratio`, `max_drawdown_pct`, `win_rate_pct`	Key risk metrics
`mc_verdict`, `wfa_verdict`	Monte Carlo and Walk-Forward verdicts
`equity_curve`	Monthly normalized equity curve (strategy + all benchmarks), starting at 100
`annual_returns`	Year-by-year strategy vs benchmark returns
`curve_smoothness`	Equity curve quality analysis (see below)

Curve smoothness

The curve_smoothness block scores the equity curve on five criteria:

Criterion	Threshold
R² of log-equity vs linear trend	≥ 0.90
Positive months	≥ 60%
Longest plateau (months without new high)	< 12 months
Upthrust months (monthly return > mean + 3σ)	≤ 2
Worst single-month drawdown	> −10%

Verdict: SMOOTH (0 failures) · ACCEPTABLE (1 failure) · ROUGH (2+ failures)

The smooth_notes array explains every failure in plain English.

Terminal output

The terminal prints a single summary line per run:

LLM verdict written to 'output/runs/2026-04-30_10-00-40/llm_verdict.json' (1/3 beat SPY)

The full structured data is in the JSON file — it is not printed to the terminal.

CLI Flags

Every setting in config.py can be overridden at runtime — no file editing required.

Built-in flags

Flag	Description
(none)	Full backtest run
`--init`	Launch the first-time setup wizard
`--dry-run`	Validate config and print run summary without fetching data
`--name <label>`	Prefix the output folder with a custom label
`--verbose`	Print Extended Metrics and Robustness tables
`--help-config [category]`	Print a guided tour of all config options with live defaults

Config override flags

Pass any of these to override config.py for a single run:

Data

--provider <str>          norgate | yahoo | polygon | csv | parquet
--csv-dir <path>          CSV folder (--provider csv only)
--parquet-dir <path>      Parquet folder (--provider parquet only)

Period & Capital

--start <YYYY-MM-DD>      Backtest start date
--end   <YYYY-MM-DD>      Backtest end date
--capital <float>         Starting equity in USD

Portfolio & Symbols (mutually exclusive)

--symbols AAPL MSFT …     Inline ticker list → runs as 'CLI' portfolio
--portfolio nasdaq_100.json   JSON file or norgate:WatchlistName
--min-bars <int>          Skip symbols with fewer bars

Strategies

--strategies "Name 1" "Name 2"   Exact strategy names to run
--strategies all                 Run every registered strategy

Execution & Costs

--allocation <float>      Fraction of equity per position (e.g. 0.05)
--execution open|close    Fill time
--slippage <float>        Bid/ask slippage fraction
--commission <float>      Commission per share in USD
--risk-free-rate <float>  Annual risk-free rate for Sharpe
--htb-rate <float>        Annual hard-to-borrow rate for shorts
--max-pct-adv <float>     Max fraction of 20d ADV per order
--volume-impact <float>   Sqrt market-impact coefficient (0 = off)

Stop Loss (repeatable)

--stop none               No stop
--stop pct:0.05           5% percentage stop
--stop atr:14:3.0         ATR stop (period=14, multiplier=3.0)
--stop pct:0.05 atr:14:3.0   Run both variants in one pass

Filtering

--min-pandl <float>       Min P&L % to show (-9999 = show all)
--max-dd <float>          Max drawdown to show (1.0 = show all)
--min-mc-score <float>    Min MC score to show
--min-vs-spy <float>      Min outperformance vs SPY

Monte Carlo

--mc-sims <int>           Number of MC simulations
--min-trades-mc <int>     Min trades required to run MC
--mc-sampling iid|block   Resampling method

Walk-Forward Analysis

--wfa-split <float>       In-sample fraction (0 = disable WFA)
--wfa-folds <int>         Rolling WFA folds (0 = disable)

Output & Misc

--save-trades / --no-save-trades
--save-filtered-only / --no-save-filtered-only
--noise <float>           OHLC noise injection fraction (0 = off)
--rolling-sharpe <int>    Rolling Sharpe window in bars (0 = off)
--export-ml / --no-export-ml
--upload-s3 / --no-upload-s3

Escape hatch — any config key not covered above

--set KEY=VALUE           Auto-cast to int/float/bool/str. Repeatable.
  e.g.  --set rolling_sharpe_window=252 --set htb_rate_annual=0.15

Guided help

python main.py --help-config              # full reference with live defaults
python main.py --help-config data         # DATA section only
python main.py --help-config wfa          # WFA section only
# categories: data, period, portfolio, strategies, costs, stop,
#             filtering, mc, wfa, output

Example commands

# Quick single-ticker run on Yahoo Finance
python main.py --provider yahoo --symbols AAPL --start 2020-01-01 --capital 25000

# Scan Nasdaq 100, only show strategies that beat SPY
python main.py --portfolio nasdaq_100.json --min-vs-spy 0.0

# Test three stop-loss variants in one pass
python main.py --stop none pct:0.05 atr:14:3.0

# Stress test with noise + block-bootstrap MC
python main.py --noise 0.01 --mc-sampling block --mc-sims 2000

# Override keys not covered by a named flag
python main.py --set rolling_sharpe_window=252 --set htb_rate_annual=0.15

Contributing

See CONTRIBUTING.md for dev setup, how to add a strategy plugin, and the PR checklist.

License

MIT License

Name		Name	Last commit message	Last commit date
Latest commit History 351 Commits
.claude		.claude
.github		.github
custom_strategies		custom_strategies
docs		docs
examples		examples
helpers		helpers
parquet_data @ bffb9e0		parquet_data @ bffb9e0
scripts		scripts
services		services
tests		tests
tickers_to_scan		tickers_to_scan
trade_analyzer		trade_analyzer
.env.example		.env.example
.gitignore		.gitignore
.gitmodules		.gitmodules
CLAUDE.md		CLAUDE.md
CONTRIBUTING.md		CONTRIBUTING.md
HISTORY.md		HISTORY.md
LICENSE		LICENSE
PRIVATE_STRATEGIES.md		PRIVATE_STRATEGIES.md
README.md		README.md
config.py		config.py
conftest.py		conftest.py
main.py		main.py
pytest.ini		pytest.ini
report.py		report.py
requirements.txt		requirements.txt
version.py		version.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

July Backtester

Installation

Quick Start

Writing Strategies

Signal convention

Minimal example

Dependencies (SPY, VIX)

Norgate Data

Pipeline

Exporting Norgate data to Parquet

Accessing the exported data (interns / no-license teammates)

The Backtesting Lifecycle

LLM Verdict

What it contains

Curve smoothness

Terminal output

CLI Flags

Built-in flags

Config override flags

Guided help

Example commands

Contributing

License

About

Uh oh!

Releases 8

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

July Backtester

Installation

Quick Start

Writing Strategies

Signal convention

Minimal example

Dependencies (SPY, VIX)

Norgate Data

Pipeline

Exporting Norgate data to Parquet

Accessing the exported data (interns / no-license teammates)

The Backtesting Lifecycle

LLM Verdict

What it contains

Curve smoothness

Terminal output

CLI Flags

Built-in flags

Config override flags

Guided help

Example commands

Contributing

License

About

Topics

Resources

License

Contributing

Uh oh!

Stars

Watchers

Forks

Releases 8

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages