Skip to content

External evidence: submit one deterministic baseline row #46

@weich97

Description

@weich97

Goal

Submit one schema-valid deterministic baseline row so the leaderboard has more non-LLM evidence anchors instead of only maintainer-generated examples.

Suggested scope

Pick one simple deterministic policy and one documented scenario. Good options are always-hold, random, naive momentum, mean reversion, risk parity, or another classical baseline that does not call a live model provider.

Commands

python -m pip install -e ".[dev]"
python scripts/run_classical_baseline_matrix.py
tradearena validate-submission <path-to-your-row.json>
tradearena build-registry examples/benchmark_submissions \
  --output docs/results/community_registry.md \
  --csv-output docs/results/community_registry.csv \
  --html-output docs/results/community_registry.html

Acceptance criteria

Please submit either an issue report or a PR that includes:

  • deterministic policy name and scenario;
  • manifest path;
  • validation command output;
  • reproducibility hash;
  • registry diff or generated registry artifact path;
  • evidence labels, especially deterministic-baseline, fully-auditable, and whether the row is external-submitted.

Keep the row free of provider prompts, API keys, broker credentials, private holdings, and live-order instructions.

Metadata

Metadata

Assignees

No one assigned

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions