`19k gradient steps with bridge (~12 epochs) and 30k with fractal (~8 epochs` I wonder train on a mixture ? or train bridge dataset , then train on fractal?
19k gradient steps with bridge (~12 epochs) and 30k with fractal (~8 epochsI wonder train on a mixture ? or train bridge dataset , then train on fractal?