transformerless_lm: continuous self-distillation + cycle checkpointing#5
Draft
RandomCoder-lab wants to merge 1 commit into
Draft
transformerless_lm: continuous self-distillation + cycle checkpointing#5RandomCoder-lab wants to merge 1 commit into
RandomCoder-lab wants to merge 1 commit into