RainMan is a Large Language Model from Scratch implementation project. It aims to build and understand LLMs from the ground up, starting with the core foundations and progressively adding modern architectural features.
src/v1: Foundations & Core Architecture. Contains the implementation of a standard Transformer-based language model, including tokenization, attention mechanisms, and training loops from scratch.
To get started with the v1 implementation:
cd src/v1