Bumblebee is a small Transformer.
-
Clone repo.
-
cp setenv.sh local_setenv.shConfigure
local_setenv.shas detailed in that file.
-
. local_setenv.sh -
python train.py
The first time you run train.py, it will download and preprocess a very large Hugging Face dataset into $HF_HOME. This may take hours.
See interactive-dp8.sh and batch-dp32.sh.
Testing: currently there is no automated testing. I manually test, from time to time, by running python train.py and declaring success when the loss looks like it's going down.
I use a pre-commit Git hook that calls ruff-check. To install this hook on your clone, run the following from the top-level directory:
# Install pre-commit and ruff, if you don't have them already:
pip install pre-commit ruff
# Install the hook, specified in .pre-commit-config.yaml, to .git/hooks/pre-commit
pre-commit install
# Ensure that it installed correctly.
pre-commit run --all-files
