References

Foundational Papers

Perceptrons (1960) — Frank Rosenblatt
- Abstract: Original work on perceptrons and neural networks
- Relevance: Historical foundation
Backpropagation (1986) — Rumelhart / Hinton / Williams
- Abstract: Development of the backpropagation algorithm
- Relevance: Critical for neural network training

ImageNet Classification (2012) — Alex Krizhevsky et al.
- Abstract: Breakthrough in deep learning with AlexNet
- Relevance: Modern neural network success
Deep Residual Learning (2015) — He et al.
- Abstract: Residual networks for deeper models
- Relevance: Advanced training techniques; baseline architectural prior behind many models discussed under [[MOC - Academic Paper Generation]].

Neural Networks and Learning Machines (2013) — Haykin
- Abstract: Comprehensive coverage of neural network theory
- Relevance: Core theory
Deep Learning (2016) — Goodfellow et al.
- Abstract: Detailed guide to deep learning
- Relevance: Advanced concepts; background for [[RLHF and RLAIF]] and [[Direct Preference Optimization]].
Build a Large Language Model (From Scratch) (2024) — Sebastian Raschka
- Abstract: End-to-end implementation of a GPT-style LLM from tokenizer through pre-training and fine-tuning.
- Relevance: Hands-on grounding for the training-strategy notes — especially [[Structural Token Insertion]], [[Negative Sampling]], and the fine-tuning loop targeted by [[Direct Preference Optimization]].
AI and Machine Learning for Coders (2020) — Laurence Moroney
- Abstract: Practical introduction to ML/DL with TensorFlow across vision, NLP, and on-device deployment.
- Relevance: Implementation-side companion to the more theory-heavy entries above; useful baseline reading before tackling the loss-function changes discussed in [[RLHF and RLAIF]] and [[Direct Preference Optimization]].

MITx 6.008.1x — Introduction to computational thinking
- Relevance: Foundational computational skills
YouTube: 3Blue1Brown — Visualizations of linear algebra and neural networks
- Relevance: Intuitive understanding of math concepts
YouTube: sentdex — Practical Python for ML
- Relevance: Hands-on implementation

Linear Algebra and Its Applications (2020) — Gilbert Strang
- Abstract: Core linear algebra for ML
- Relevance: Matrix operations, geometry; underpins [[Embedding Cross-Sectional Similarity]].
Matrix Analysis (1990) — Roger Horn, Charles Johnson
- Abstract: Advanced matrix theory
- Relevance: Deep mathematical foundations