Latency-Aware Execution Engine for Portfolio Rebalancing

A production-quality trade execution system that minimizes slippage on large cryptocurrency orders using reinforcement learning. Trained on 7.5M bars of real Binance market data across 3 assets, evaluated walk-forward on fully out-of-sample 2024 data, and validated against 98M real tick-level trades.

CS5130 - Programming and Data Processing for AI | Spring 2026 | Northeastern University

Results

Strategy Comparison (50 BTC, 500 simulations)

Strategy	Cost ($)	vs TWAP
Immediate	$370.29	-1,734%
TWAP	$20.20	baseline
VWAP	$15.09	+25.3%
A-C (λ=0.5)	$20.20	+0.0%
ML Agent (ours)	$11.29	+44.1%

Multi-Asset Out-of-Sample (50 BTC, 2024 H2)

Asset	vs TWAP	Win Rate	vs VWAP
BTCUSDT	+10.2 bps	75%	-0.6 bps
ETHUSDT	+0.6 bps	65%	-0.8 bps
SOLUSDT	-0.2 bps	12%	-0.03 bps
Average	+3.5 bps		-0.5 bps

At 50 BTC ($2.5M), saving 10.2 bps on BTCUSDT = **$940 per order**. Over 1,000 orders = $940,000 saved.

Problem

When a hedge fund needs to buy 50 BTC, executing it all at once moves the price against them - this is called slippage or market impact. The cost increases nonlinearly with order size: each additional BTC costs more than the last because you're consuming progressively deeper levels of the order book.

The question: How do you slice a large order over 60 minutes to minimize total execution cost?

This project builds a complete system to answer that question using real market data and reinforcement learning.

Quick Start

Setup

git clone https://github.com/YOUR_USERNAME/Latency-Aware-Execution-Engine.git
cd Latency-Aware-Execution-Engine
python -m venv venv && source venv/bin/activate
pip install pandas numpy pyarrow pyyaml torch tqdm requests matplotlib seaborn

Download Data

python scripts/download_data.py --symbols BTCUSDT ETHUSDT SOLUSDT --start 2020-01-01 --end 2024-12-31
python scripts/validate_data.py --data data/raw/klines/BTCUSDT/ --output data/processed/BTCUSDT_klines_1m.parquet

Run Full Pipeline (single command)

python scripts/run_pipeline.py --data data/processed/BTCUSDT_klines_1m.parquet
python scripts/run_pipeline.py --data data/processed/BTCUSDT_klines_1m.parquet --full  # includes failure/ethics analysis

Train RL Agent

# Single asset, 50 BTC orders (~3 hours on CPU)
python scripts/train_large.py --data data/processed/BTCUSDT_klines_1m.parquet --qty 50 --episodes 50000

# Multi-asset training on BTC + ETH + SOL (~5 hours)
python scripts/train_multi.py --train --episodes 50000 --qty 50

# Order size sweep (10, 25, 50 BTC)
python scripts/train_large.py --data data/processed/BTCUSDT_klines_1m.parquet --sweep

Evaluate on Tick Data

python scripts/eval_ticks.py --tick-dir data/raw/trades/BTCUSDT --model models/multi/best.pt --qty 50

Generate Figures

python scripts/generate_figures.py --data data/processed/BTCUSDT_klines_1m.parquet --model models/multi/best.pt

Run Tests

# Run the full test suite (176 tests)
python -m pytest tests/ -v

# Run with coverage report
python -m pytest tests/ --cov=src --cov-report=term-missing

# Run a specific module
python -m pytest tests/test_features/ -v
python -m pytest tests/test_simulator/ -v
python -m pytest tests/test_policies/ -v
python -m pytest tests/test_data/ -v
python -m pytest tests/test_evaluation/ -v
python -m pytest tests/test_utils/ -v

System Architecture

Raw Market Data (Binance OHLCV + Tick Trades)
        │
        ▼
┌──────────────────┐
│  Data Ingestion  │  download_data.py, validate_data.py
│  & Validation    │  → Schema validation, 7 quality checks
│                  │  → 2.6M bars per asset, 126 GB tick data
└────────┬─────────┘
         │
         ▼
┌──────────────────┐
│  Feature         │  src/features/engine.py
│  Engineering     │  → 14 features: volatility, volume, spread,
│                  │    momentum, time-of-day encoding
└────────┬─────────┘
         │
         ▼
┌──────────────────┐
│  Execution       │  src/simulator/impact.py + engine.py
│  Simulator       │  → Almgren-Chriss impact model
│                  │  → Variable spread from real data
│                  │  → Cost = spread + η × price × part^1.5
└────────┬─────────┘
         │
    ┌────┴────┐
    │         │
    ▼         ▼
┌────────┐ ┌─────────────┐
│Baselines││ RL Agent    │
│ TWAP   │ │ Double DQN  │  14-dim state, 7 VWAP-relative actions
│ VWAP   │ │ + Dueling   │  Terminal reward: savings vs VWAP in bps
│ A-C    │ │ + PER       │  50K episodes, 3 assets
│ Immed. │ │ + LayerNorm │
└───┬────┘ └──────┬──────┘
    │              │
    ▼              ▼
┌──────────────────┐
│  Evaluation      │  Monte Carlo (500+ sims per strategy)
│                  │  Walk-forward: Train 2020-23 → Test 2024 H2
│ → Regime analysis│  Cross-asset generalization (BTC, ETH, SOL)
│  → Tick-level    │  98M real trade validation
│  → Failure cases │  Edge cases, limitations, ethics
└──────────────────┘

Repository Structure

Latency-Aware-Execution-Engine/
├── src/                            # Source code (Python package)
│   ├── __init__.py                 # v1.0.0
│   ├── data/
│   │   ├── schemas.py              # KlineSchema, TradeSchema
│   │   ├── loader.py               # Binance CSV/Parquet loader
│   │   └── validator.py            # 7 data quality checks
│   ├── features/
│   │   └── engine.py               # 14 execution-relevant features
│   ├── simulator/
│   │   ├── impact.py               # Almgren-Chriss market impact model
│   │   └── engine.py               # Policy-agnostic execution simulator
│   ├── policies/
│   │   ├── baselines.py            # Immediate, TWAP, VWAP, A-C
│   │   ├── rl_env.py               # Gym-style RL environment
│   │   └── dqn_agent.py            # Double DQN + PER + dueling
│   ├── evaluation/
│   │   ├── backtest.py             # Monte Carlo backtester
│   │   └── visualizations.py       # Plot utilities
│   └── utils/
│       ├── config.py               # YAML config loader
│       └── errors.py               # PipelineError, safe_execute, validate helpers
├── scripts/                        # CLI tools
│   ├── download_data.py            # Data download with retry
│   ├── validate_data.py            # Quality checks + Parquet export
│   ├── run_backtest.py             # Baseline strategy comparison
│   ├── run_pipeline.py             # Single-command end-to-end pipeline
│   ├── train_large.py              # DQN training (large orders, sweep)
│   ├── train_multi.py              # Multi-asset training (BTC+ETH+SOL)
│   ├── eval_ticks.py               # Tick-level evaluation (98M trades)
│   ├── analysis.py                 # Failure/edge/ethics analysis
│   ├── generate_figures.py         # Publication-quality plots
│   └── parse_log.py                # Training log → CSV
├── tests/                          # 176 tests
│   ├── test_data/
│   │   └── test_loader_and_validator.py  # schemas, CSV loader, validator
│   ├── test_features/
│   │   └── test_engine.py               # all feature functions, edge cases
│   ├── test_simulator/
│   │   └── test_engine_and_impact.py    # impact model, A-C trajectory, simulator
│   ├── test_policies/
│   │   └── test_all_policies.py         # baselines, adaptive, RL env, DQN agent
│   ├── test_evaluation/
│   │   └── test_backtest.py             # Monte Carlo backtester, regime analysis
│   └── test_utils/
│       └── test_errors_and_config.py    # PipelineError, safe_execute, config loader
├── configs/default.yaml            # Centralized hyperparameters
├── notebooks/                      # Colab training notebooks
├── models/                         # Saved checkpoints (best.pt, final.pt)
├── reports/figures/                 # Generated plots (4 figures)
├── data/                           # Market data (gitignored)
├── .github/
│   └── workflows/
│       └── tests.yml               # CI: runs test suite on push/PR to main
└── .gitignore

Technical Details

Data

Asset	Bars	Date Range	Price Range	Median Volume
BTCUSDT	2,628,555	2020-01 → 2024-12	$3,810 → $108,258	34.6 BTC/min
ETHUSDT	2,628,554	2020-01 → 2024-12	$86 → $4,865	254.6 ETH/min
SOLUSDT	2,307,975	2020-04 → 2024-12	$1 → $264	1,407 SOL/min
Tick trades	98,667,010	2023-03	Individual trades	Trade-by-trade

Impact Model

cost = quantity × (spread/2 + η × price × participation^1.5)

Spread: variable from real data (median 2 bps, range 0.5–10 bps)
η = 0.3 (temporary impact coefficient)
Participation cap: 15% of bar volume

RL Agent

Component	Choice
Algorithm	Double DQN with dueling architecture
Replay	Prioritized experience replay (PER, α=0.6)
Network	256-256-128 with LayerNorm
Optimizer	AdamW (lr=3e-4, cosine annealing)
Actions	7 VWAP-relative: [0, 0.3, 0.6, 1.0, 1.5, 2.0, 3.0]
State	14 features (inventory, time, vol, volume, spread, momentum, time-of-day)
Reward	Terminal: savings vs VWAP in bps
Training	50K episodes across BTC+ETH+SOL (2020-2023)

Evaluation Protocol

Split	Data	Purpose
Train	BTC+ETH+SOL 2020-2023	6M bars, agent learns
Validate	2024 Jan-Jun	Early stopping, model selection
Test	2024 Jul-Dec	Final reported results (never seen)

Failure Cases & Limitations

Failure cases (from scripts/analysis.py):

Flash crashes: +318 bps IS during COVID (2020-03-20)
Low volume: 500 BTC orders achieve only 49% fill rate
Short horizons: 1-bar execution costs 20× more than 60-bar

Limitations: No real order book data, simulated (not real) fills, single-asset execution, no multi-venue routing, discrete actions, 1-minute resolution.

Ethical risks: Market manipulation (mitigated by participation caps), fairness concerns, over-reliance on automation (mitigated by kill switches), data privacy (public data only).

Full analysis: python scripts/analysis.py --data data/processed/BTCUSDT_klines_1m.parquet

References

Almgren & Chriss (2000). Optimal execution of portfolio transactions.
Bertsimas & Lo (1998). Optimal control of execution costs.
Nevmyvaka et al. (2006). Reinforcement learning for optimized trade execution. ICML.
Ning et al. (2021). Double deep Q-learning for optimal execution.
van Hasselt et al. (2016). Deep RL with double Q-learning. AAAI.
Wang et al. (2016). Dueling network architectures. ICML.
Schaul et al. (2016). Prioritized experience replay. ICLR.

Author

Nikhilesh Waghmare - MS in Artificial Intelligence, Northeastern University (Expected May 2027)
Aniket Ghosh - MS in Artificial Intelligence, Northeastern University (Expected May 2027)
Frank Duah - MS in Artificial Intelligence, Northeastern University (Expected May 2027)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Latency-Aware Execution Engine for Portfolio Rebalancing

Results

Strategy Comparison (50 BTC, 500 simulations)

Multi-Asset Out-of-Sample (50 BTC, 2024 H2)

Problem

Quick Start

Setup

Download Data

Run Full Pipeline (single command)

Train RL Agent

Evaluate on Tick Data

Generate Figures

Run Tests

System Architecture

Repository Structure

Technical Details

Data

Impact Model

RL Agent

Evaluation Protocol

Failure Cases & Limitations

References

Author

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 46 Commits
.github/workflows		.github/workflows
configs		configs
data		data
notebooks		notebooks
reports		reports
scripts		scripts
src		src
tests		tests
.gitignore		.gitignore
README.md		README.md
pyproject.toml		pyproject.toml
requirements.txt		requirements.txt
setup.sh		setup.sh

Folders and files

Latest commit

History

Repository files navigation

Latency-Aware Execution Engine for Portfolio Rebalancing

Results

Strategy Comparison (50 BTC, 500 simulations)

Multi-Asset Out-of-Sample (50 BTC, 2024 H2)

Problem

Quick Start

Setup

Download Data

Run Full Pipeline (single command)

Train RL Agent

Evaluate on Tick Data

Generate Figures

Run Tests

System Architecture

Repository Structure

Technical Details

Data

Impact Model

RL Agent

Evaluation Protocol

Failure Cases & Limitations

References

Author

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages