[lecture] Building a micro GPT (Blog Post)

This blog post is pretty cool

https://www.towardsdeeplearning.com/andrej-karpathy-just-built-an-entire-gpt-in-243-lines-of-python-7d66cfdfa301

It builds a GPT in 243 lines of python including the `autograd` engine. 

It learns baby names and then creates new ones. 

Q: How is this different to a simple transition matrix building probabilities off of word transitions? Token size, how it builds patterns?

I think this could form the foundation of an interesting lecture using python. 

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[lecture] Building a micro GPT (Blog Post) #287

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Uh oh!

[lecture] Building a micro GPT (Blog Post) #287

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions