AtomGPT

A chaotic, evolutionary, dependency-free GPT built from scratch.

AtomGPT is an educational and experimental project that implements a Generative Pre-trained Transformer (GPT) entirely in Python standard library. No PyTorch, no NumPy, no TensorFlow. Just pure Python logic, from the autograd engine to the transformer blocks.

Beyond a simple implementation, AtomGPT introduces an Evolutionary Forge (forge.py), where models not only learn from data but also evolve their architecture over time—growing layers, adding heads, and pruning weights to survive.

Features

Zero Dependencies: Runs on pure Python. If you have Python 3, you can run AtomGPT.
Custom Autograd: A transparent backpropagation engine (Value) built from the ground up.
Evolutionary Training: Models compete in a population. The fittest survive, clone, and mutate (add/remove layers, heads, etc.).
Educational Core: microgpt.py contains the entire logic in a single file for easy study.

Installation

git clone https://github.com/pronzzz/atomgpt.git
cd atomgpt

Optional: Install graphviz if you want to visualize the computation graph (used in atomgpt/visualizer.py), but it is not required for the core model.

pip install graphviz

Usage

1. The Evolutionary Forge (Recommended)

Watch models evolve and generate fantasy names in real-time.

python3 forge.py

This script will:

Initialize a population of small GPT models.
Train them on a dataset of fantasy names.
Evolve the population (Select -> Clone -> Mutate).
Generate new names periodically.
Save the best names to generated_names.txt.

2. The Atomic Core

If you want to study the bare-metal implementation:

python3 microgpt.py

This script downloads a dataset (if missing), trains a model, and prints generated samples to the console.

Walkthrough: How It Works

Here is a step-by-step walkthrough of what happens when you run AtomGPT:

Step 1: The Spark (Initialization)

When forge.py starts, it initializes a Population of random GPT models. Each model starts small (e.g., 1 layer, 16 embedding dim) to survive the harsh environment of random initialization.

Step 2: The Learning (Forward & Backward)

In every step, the models are fed a name (e.g., "Drakon").

Tokenization: "Drakon" is broken down into characters.
Forward Pass: The characters flow through the user-defined GPT architecture.
- Embeddings lookup.
- Attention mechanisms weigh relationships between characters.
- MLPs process the information.
Loss Calculation: The model predicts the next character. We calculate the negative log-likelihood loss.
Backward Pass: The custom autograd engine traces the graph backwards, calculating gradients for every weight.
Update: An Adam-inspired optimizer tweaks the weights to reduce error.

Step 3: The Evolution (Survival of the Fittest)

After a set number of steps (a generation), the forge pauses to judge the models.

Evaluation: Models are scored based on their Loss (how well they predict) and Efficiency (parameter count).
Culling: The bottom 50% of models are deleted.
Reproduction: The survivors are cloned to refill the population.
Mutation: The clones undergo random mutations:
- Growth: "I need more power!" -> Adds a layer or attention head.
- Efficiency: "I am too heavy." -> Prunes small weights or shrinks embedding dimension.
- Chaos: Randomly sparsifies a dense layer.

Step 4: The Creation

Finally, the champion model is used to hallucinate new names. It samples character by character, following the statistical patterns it learned (and evolved to process efficiently).

Roadmap

Implement more complex mutation operators (e.g., skip connection rewiring).
Add saving/loading of model "species" (checkpoints).
visualization of the evolutionary tree.

License

MIT License. See LICENSE for details.

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
atomgpt		atomgpt
tests		tests
.gitignore		.gitignore
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
CONTRIBUTING.md		CONTRIBUTING.md
GUIDE.md		GUIDE.md
LICENSE		LICENSE
README.md		README.md
forge.py		forge.py
microgpt.py		microgpt.py
requirements.txt		requirements.txt
run1.txt		run1.txt
run2.txt		run2.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

AtomGPT

Features

Installation

Usage

1. The Evolutionary Forge (Recommended)

2. The Atomic Core

Walkthrough: How It Works

Step 1: The Spark (Initialization)

Step 2: The Learning (Forward & Backward)

Step 3: The Evolution (Survival of the Fittest)

Step 4: The Creation

Roadmap

License

About

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

AtomGPT

Features

Installation

Usage

1. The Evolutionary Forge (Recommended)

2. The Atomic Core

Walkthrough: How It Works

Step 1: The Spark (Initialization)

Step 2: The Learning (Forward & Backward)

Step 3: The Evolution (Survival of the Fittest)

Step 4: The Creation

Roadmap

License

About

Topics

Resources

License

Code of conduct

Contributing

Uh oh!

Stars

Watchers

Forks

Contributors

Uh oh!

Languages