Skip to content

Add jetformer logic to main AstroPT branch#55

Draft
Smith42 wants to merge 11 commits into
mainfrom
sogol_branch
Draft

Add jetformer logic to main AstroPT branch#55
Smith42 wants to merge 11 commits into
mainfrom
sogol_branch

Conversation

@Smith42

@Smith42 Smith42 commented Nov 11, 2025

Copy link
Copy Markdown
Owner

We will need to add the Jetformer logic to the main AstroPT branch next. Lots of merging 🥲

Michael Smith and others added 2 commits October 14, 2025 10:49
…ain (#53)

* Use tokeniser in train.py

* Add NormalizingFlow example on CIFAR10

* Save mike galaxy photos as tensor, Add galaxy 32x32 dataset to jetformer

* Add valid loss to jetformer train, Generate sample that can be compared

* Larger GPT model and results

* Jet run with 256x256 image

* Remove unrelated comments

* remove imagery and text of legacy runs

---------

Co-authored-by: Mike Smith <mike@mjjsmith.com>
Smith42 and others added 9 commits November 11, 2025 13:35
Integrate jetformer into astropt
…ng scripts with batch size adjustments and epoch support, add job submission files
train_jetformer_sogol.py is the version used to produce the
sogol_checkpoint_step_0079999 paper checkpoint (768-dim, 12-layer,
80k iters, HuggingFace streaming, paper-style noise curriculum).

.gitignore additions exclude local caches, large data dumps, training
logs, and paper-artifact files from future commits.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants