Skip to content

harrystaley/open-source-data-science-degree-python

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

15 Commits
 
 
 
 

Repository files navigation

The Python Open Source Data Science Degree

A curated list of free or low-cost courses from reputable universities and organizations that satisfy the same requirements as an undergraduate Computer Science / Data Science degree, minus general education. Updated for 2026.


Learning How to Efficiently Learn

If it has been a while since you were in the classroom, this is mandatory. These are high-leverage meta-skills that pay dividends across every other course.

Courses School Duration Effort Frequency Prerequisites Provider
Learning How to Learn Deep Teaching Solutions 4 weeks 3-4 hrs/week self-paced none Coursera
A Mind for Numbers (Book) Barbara Oakley none Book

Computer Science Basics

Courses School Duration Effort Frequency Prerequisites Provider
CS50x: Introduction to Computer Science Harvard 12 weeks 10-20 hrs/week self-paced none edX
CS50P: Introduction to Programming with Python Harvard 10 weeks 10-20 hrs/week self-paced none edX
Mathematical Thinking in Computer Science UC San Diego 6 weeks 2-5 hrs/week once a month none Coursera

Programming

Courses School Duration Effort Frequency Prerequisites Provider
Python for Everybody Specialization U of Michigan 8 months 3-6 hrs/week self-paced none Coursera
Python OOP & Design Patterns Duke 4 weeks 4-6 hrs/week self-paced Python basics Coursera
Software Engineering: Introduction UBC 6 weeks 6-8 hrs/week self-paced Programming basics edX

Math

Single Variable Calculus

Courses School Duration Effort Frequency Prerequisites Provider
18.01x Single Variable Calculus MIT 10 months 6-10 hrs/week self-paced pre-calculus edX

Linear Algebra and Statistics

Courses School Duration Effort Frequency Prerequisites Provider
Linear Algebra - Foundations to Frontiers UT Austin 15 weeks 6-10 hrs/week self-paced pre-calculus edX
Introduction to Statistics Stanford 7 weeks 5 hrs/week self-paced none Coursera
Statistical Learning Stanford 9 weeks 5 hrs/week self-paced Linear Algebra, Stats edX

Databases

Courses School Duration Effort Frequency Prerequisites Provider
PostgreSQL for Everybody Specialization U of Michigan 4 months 4-6 hrs/week self-paced Programming basics Coursera
Databases: Relational Databases and SQL Stanford 6 weeks 5-10 hrs/week self-paced none edX
Vector Databases: from Embeddings to Applications Weaviate / DeepLearning.AI 1 week 2-3 hrs self-paced Python basics DeepLearning.AI

Data Science

Courses School Duration Effort Frequency Prerequisites Provider
Applied Data Science with Python Specialization U of Michigan 5 months 7 hrs/week self-paced Python Coursera
Data Science: Visualization Harvard 8 weeks 2-4 hrs/week self-paced R basics edX
Practical Data Ethics fast.ai 4 weeks 4 hrs/week self-paced none fast.ai

Machine Learning and AI

This section is now a core requirement, not optional.

Foundations

Courses School Duration Effort Frequency Prerequisites Provider
Machine Learning Specialization Stanford / DeepLearning.AI 3 months 9 hrs/week self-paced Python, Linear Algebra Coursera
Practical Deep Learning for Coders fast.ai 10 weeks 8-10 hrs/week self-paced Python, some math fast.ai
Deep Learning Specialization DeepLearning.AI 5 months 4-5 hrs/week self-paced ML basics Coursera

Large Language Models and Generative AI

Courses School Duration Effort Frequency Prerequisites Provider
Short Courses DeepLearning.AI 1-3 hrs each 1-3 hrs self-paced Python DeepLearning.AI
Hugging Face NLP Course Hugging Face 6 weeks 4-6 hrs/week self-paced Python, ML basics Hugging Face
CS324: Large Language Models Stanford 10 weeks 6-8 hrs/week self-paced Deep Learning Stanford (free)
LLM Bootcamp Full Stack Deep Learning 8 hours self-paced ML basics FSDL

MLOps

Courses School Duration Effort Frequency Prerequisites Provider
MLOps Specialization DeepLearning.AI 4 months 4 hrs/week self-paced ML basics Coursera
Weights & Biases Courses Weights & Biases varies self-paced Python, ML W&B

Computing Systems

Courses School Duration Effort Frequency Prerequisites Provider
Build a Modern Computer from First Principles I Hebrew University 6 weeks 10-15 hrs/week self-paced none Coursera
Build a Modern Computer from First Principles II Hebrew University 6 weeks 10-15 hrs/week self-paced Part I Coursera
Introduction to Operating Systems Georgia Tech 8 weeks 5-8 hrs/week self-paced Part II Udacity

Computer Science Theory

Courses School Duration Effort Frequency Prerequisites Provider
Algorithms, Part I Princeton 6 weeks 6-12 hrs/week once a month Programming basics Coursera
Algorithms, Part II Princeton 6 weeks 6-12 hrs/week once a month Part I Coursera
Automata Theory Stanford 6 weeks 4-8 hrs/week self-paced Theory basics Coursera

Cloud Computing

Cloud is now a foundational skill for data scientists and engineers.

Courses School Duration Effort Frequency Prerequisites Provider
Cloud Computing Foundations Duke 5 weeks 4-6 hrs/week self-paced Linux basics Coursera
AWS Cloud Practitioner Essentials AWS 6 hours self-paced none Coursera
Google Cloud Fundamentals: Core Infrastructure Google 1 week 8-10 hrs self-paced none Coursera
Recommended Certification Path

Pursue one cloud provider certification (AWS, GCP, or Azure) at the associate level after completing the foundational course.


DevOps and Containers

Courses School Duration Effort Frequency Prerequisites Provider
Docker for Beginners Docker 4-8 hrs self-paced Linux basics Docker
Kubernetes for Developers Linux Foundation 14 weeks 2-3 hrs/week self-paced Docker basics edX
DevOps, DataOps, MLOps Duke 7 weeks 4-6 hrs/week self-paced Programming basics Coursera

Computer Applications

Courses School Duration Effort Frequency Prerequisites Provider
Cryptography I Stanford 7 weeks 5 hrs/week once a month Linear Algebra Coursera
Computer Networking: A Top-Down Approach UMass 8 weeks 5 hrs/week self-paced CS basics Free (textbook)

Unix and Linux

Courses School Duration Effort Frequency Prerequisites Provider
The Unix Workbench JHU 4 weeks 4 hrs/week once a month none Coursera
Linux Command Line Basics Udacity 1 week 5 hrs/week self-paced none Udacity

Open Source Development and Version Control

Courses School Duration Effort Frequency Prerequisites Provider
Version Control with Git Google 1 week 5 hrs self-paced none Coursera
Open Source Software Development, Linux and Git Specialization Linux Foundation 4 months 3-5 hrs/week self-paced some programming Coursera
GitHub Actions GitHub 1-2 hrs self-paced Git basics GitHub Skills

Recommended Learning Order

  1. Learning How to Learn
  2. CS50x → CS50P
  3. Python for Everybody
  4. Math (Calculus → Linear Algebra → Statistics)
  5. Algorithms Part I & II
  6. Databases
  7. Computing Systems
  8. Machine Learning Specialization
  9. Practical Deep Learning for Coders
  10. LLM courses (DeepLearning.AI short courses)
  11. Cloud Computing + DevOps
  12. MLOps

What Changed from the Previous Version

  • Added: Machine Learning and AI section (now a core requirement)
  • Added: LLM / Generative AI subsection with Hugging Face and DeepLearning.AI courses
  • Added: MLOps subsection
  • Added: Cloud Computing section (AWS, GCP)
  • Added: DevOps and Containers section (Docker, Kubernetes)
  • Added: Vector Databases to the databases section
  • Replaced: Individual edX CS50 links → direct Harvard links (more stable)
  • Replaced: Database individual courses → consolidated specialization + modern additions
  • Removed: IBM Data Science Certificate (superseded by more targeted offerings)
  • Removed: Applied Cryptography (edX) — Cryptography I (Stanford) covers the same ground better

About

A fully curated, open-source Data Science curriculum focused on Python. Includes top-tier university courses (MIT, Stanford, Princeton) covering essential topics in computer science, data analysis, machine learning, and statistics — everything you need to build a solid foundation in Data Science, 100% free.

Topics

Resources

Stars

Watchers

Forks

Packages

 
 
 

Contributors