Set scaling-study configs to exactly one epoch by ksd3 · Pull Request #71 · Smith42/astroPT

ksd3 · 2026-06-13T19:54:59Z

max_iters = 30000 in the config/pythia-like configs was an inherited nanoGPT placeholder, not derived from the dataset. At effective batch 640 that's 19.2M presentations = ~2.27 passes over the 8,474,566-galaxy Smith42/galaxies train split.

This sets max_iters = 13241 = floor(8,474,566 / 640) — one pass over the train set (~2.17B image-patch tokens, ≈ Chinchilla-optimal for the 100M run) — and shrinks lr_decay_iters to match so the cosine LR fully decays to min_lr by the end of the run rather than stopping mid-decay.

(Note: 13241 is one pass minus a 326-example remainder, since 8,474,566 isn't divisible by the effective batch; truly exact one-pass coverage under DDP needs len-aware sampling, which is a separate change.)

max_iters was an inherited placeholder (30000), which at effective batch 640 is 19.2M presentations = ~2.27 passes over the 8,474,566-galaxy train set. Set max_iters = 13241 = floor(8,474,566 / 640) so each run does exactly one epoch (~2.17B tokens, ~Chinchilla-optimal for the 100M model), and shrink lr_decay_iters to match so the cosine LR fully decays to min_lr by the end of the run rather than stopping mid-decay.

ksd3 mentioned this pull request Jun 13, 2026

Reproducible streaming data order across runs (fixed data_seed + pinned num_workers) #70

Open

ksd3 force-pushed the feat/one-epoch-budget branch 2 times, most recently from 8abd9ec to 8f8ad66 Compare June 13, 2026 21:50

ksd3 requested a review from Smith42 June 13, 2026 21:51

Smith42 mentioned this pull request Jun 14, 2026

Pythia configs: effective batch 320 + clean one-epoch stop #73

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Set scaling-study configs to exactly one epoch#71

Set scaling-study configs to exactly one epoch#71
ksd3 wants to merge 1 commit into
mainfrom
feat/one-epoch-budget

ksd3 commented Jun 13, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

ksd3 commented Jun 13, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

ksd3 commented Jun 13, 2026 •

edited

Loading