GPT from scratch

This project is a minimal GPT-style language model built entirely from scratch in PyTorch. It follows the core architecture of the Transformer (multi-head self-attention + feed-forward blocks + residual connections + layer norm), trained on a text dataset to generate new sequences.

I got the code from a tutorial but to make it feel like I didn't just vibe code this, I added my own explanation + understanding in a .ipynb file which you can check out if you want!

Acknowledgements

Hyperparameters

(You can tweak these at the top of the script)

batch_size = 16 → number of sequences per batch

block_size = 32 → maximum context length

n_embed = 64 → embedding dimension

n_head = 4 → number of attention heads

n_layer = 4 → number of transformer blocks

dropout = 0.0 → dropout rate

learning_rate = 1e-3 → optimizer learning rate

max_iters = 5000 → training iterations

Run Locally

Clone the project

  git clone https://github.com/Aruniaaa/GPT-from-scratch

Go to the project directory

  cd GPT-from-scratch

Install dependencies

  pip3 install torch torchvision --index-url https://download.pytorch.org/whl/cu126

Run the code

  python bigram.py

NOTE - The pip install command to download Pytorch on your machine can be different. To check the compatibility of your device and a Pytorch version, click here.

Explanation/Documentation

The Google Colab notebook (GPT-from-scratch.ipynb) contains:

A breakdown of the code

A short explanation of how transformers work

I’m still learning, so my explanations might not be perfect, but feedback and corrections are always welcome!

Project Structure

├── Del-data.txt # dataset
├── GPT-from-scratch.ipynb # code + explanation
├── README.md
└── bigram.py  # main training and model script

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

GPT from scratch

Acknowledgements

Hyperparameters

Run Locally

Explanation/Documentation

Project Structure

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
Del-data.txt		Del-data.txt
GPT-from-scratch.ipynb		GPT-from-scratch.ipynb
README.md		README.md
bigram.py		bigram.py

Folders and files

Latest commit

History

Repository files navigation

GPT from scratch

Acknowledgements

Hyperparameters

Run Locally

Explanation/Documentation

Project Structure

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages