-
Notifications
You must be signed in to change notification settings - Fork 18
Fix for incorrect sequence of prompts being sent over for inference #147
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Merged
ethantang-db
merged 110 commits into
single-controller-hackathon
from
ethantang-db/fix_multi_node
Aug 14, 2025
Merged
Changes from all commits
Commits
Show all changes
110 commits
Select commit
Hold shift + click to select a range
15c6ae3
adding logging to understand weight updates
ethantang-db 4ed2322
assert false
ethantang-db 453a610
logging more updateS
ethantang-db 5c15cb8
trying out llama 1b
ethantang-db 2bd7bc9
fix loading
ethantang-db a8d817b
different dataset
ethantang-db 5d64218
revert to r1
ethantang-db 3cf4432
trying out 2 nodes
ethantang-db f7184be
test
ethantang-db 7e66c92
log worker_wrap logic
ethantang-db 7e1d365
force crash
ethantang-db 7c10670
removing assert
ethantang-db b0a3467
jank logging
ethantang-db ccd2e4c
try gloo?
ethantang-db 34ce4fb
revert back to nccl
ethantang-db 536e513
try out cpu and gloo
ethantang-db 2f4de01
log tensors to file
ethantang-db c96a8d1
better logging
ethantang-db ac6b8d9
removed redundent debugging
ethantang-db 24b0d93
rank
ethantang-db f95a801
try env vars
ethantang-db b0a3441
try out other place for nccl
ethantang-db 5598942
f...
ethantang-db febdc69
further debugging
ethantang-db 05085f3
log what weights are updated
ethantang-db 0ebb94b
log weight updates
ethantang-db 2673c66
update weights
ethantang-db d3c1d20
this is trippin
ethantang-db 840e0d8
better weight logging
ethantang-db b9dfda7
like cursor bruh?
ethantang-db 50475d5
better logs
ethantang-db c80e25f
???
ethantang-db 2c8d11f
???
ethantang-db 1de2fff
try layer 25
ethantang-db 8089c52
cranking up the learning rate
ethantang-db f38d7db
new lines
ethantang-db 963a66f
remove new lines
ethantang-db 37403ef
even more ridiculous LR
ethantang-db 6fbdfc3
are you fking kidding me
ethantang-db 7c699d6
new opt
ethantang-db 5672519
Merge branch 'single-controller-hackathon' into ethantang-db/fix_mult…
ethantang-db 81b9c46
chaos updates
ethantang-db 00920bf
try new chaos values
ethantang-db a1c3594
logging weight updateS
ethantang-db 5949039
lr is 1
ethantang-db abbfa1d
god dammit cursor
ethantang-db fca0c37
correct rank
ethantang-db 85cfd6b
try other chaos update vlaues
ethantang-db ba7fb54
fix crash?
ethantang-db be9d152
remove chaos
ethantang-db 513ff59
fp32 everything
ethantang-db 6a81fd1
enable chaos
ethantang-db 2822d71
god dammit flash attention
ethantang-db 801099c
disable flash attention on vllm
ethantang-db 8015476
try out torch SDPA
ethantang-db fe7f76c
:/
ethantang-db a48416f
proper fsdp summon
ethantang-db 21fafa1
bfloat16
ethantang-db 35721b8
disable chaos
ethantang-db 7cb3c3b
better logging
ethantang-db f926283
logging how many examples being trained
ethantang-db b133fb6
more logging
ethantang-db 816a527
log loss
ethantang-db 5cbb05a
debugging batch better
ethantang-db c301055
shorter for better debugging
ethantang-db b5fc1f0
more len
ethantang-db aefd737
test
ethantang-db fd47fc0
test
ethantang-db 2a45675
not cpu
ethantang-db 09a0c11
gpu
ethantang-db 7872488
debug more
ethantang-db 3484cf8
disable logging
ethantang-db d3b16be
change gen len
ethantang-db 6cd3980
disable logging
ethantang-db 43e3df9
logging before
ethantang-db 96c98fe
increase global batch size
ethantang-db 33766c8
log rewards
ethantang-db 930ed2d
more logs
ethantang-db 35024c8
probe more
ethantang-db d634b86
log expanded advantages better
ethantang-db c0e95e1
more log
ethantang-db 05c90d2
log ids
ethantang-db 4672849
wtf?
ethantang-db 26ff5a9
???
ethantang-db 92f1090
inverse indices
ethantang-db 171fcf9
test wild theory
ethantang-db cdce7f0
validate 32 bs also
ethantang-db 594f87e
isolate which var it is
ethantang-db dde0142
trying out 16 samples
ethantang-db 26b6cc4
more debug
ethantang-db 2e1b64a
trying out something else
ethantang-db 7042619
further debugging
ethantang-db a768a0e
debug
ethantang-db 136d687
double checking type
ethantang-db d46ae8d
more check
ethantang-db 2289ee7
flipping this
ethantang-db 46e72b6
more corrections
ethantang-db 27a52f2
more debugging
ethantang-db a394e79
stack
ethantang-db bdeb027
fix bs
ethantang-db 2647386
more fix
ethantang-db c0c3dae
most random comma
ethantang-db 45b08fe
double checking shapes
ethantang-db f8bfcce
should be stack
ethantang-db 4075679
cleaning up stuff
ethantang-db 7b62267
let's do a run
ethantang-db 748168e
change cluster
ethantang-db 06248b7
the fix
ethantang-db d1bf233
white space
ethantang-db 9af0e4d
added checks for proper configs
ethantang-db File filter
Filter by extension
Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
There are no files selected for viewing
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Uh oh!
There was an error while loading. Please reload this page.