Good article on how to do it from scratch "https://www.kaggle.com/code/soham1024/basic-neural-network-from-scratch-in-python"
This video and this github is what I followed:
- https://github.com/lukepolson/youtube_channel/blob/main/Python%20Tutorial%20Series/pytorch4.ipynb
- https://www.youtube.com/watch?v=gBw0u_5u0qU&t=607s
- Where are these random numbers coming from and do they matter? (
batch_size= 5,epoch= 20,lr=0.01) etc - Understading the
MyNeuralNetclass more (why 3 matrix's?, understanding the foward pass more?) - dont know what
squeezeorviewdoes - understanding why everything is a tensor
RELU- The cross entropy loss function
- Stochastic gradient descecnt
- why zero out the gradient
lr=0.01stands for learning rate and is a hyper paremeter in Stochastic gradient descecnt