A hands-on PySpark training.
- Sign Up: Create a free account at Databricks Community Edition.
- Clone Repo:
- Go to Workspace (left sidebar) -> Git Folders (or Repos).
- Click Add -> Repo and paste this repository's URL.
Requires Docker installed.
- Run the following command in your terminal inside this repository's folder:
(Windows users: replace
docker run -p 8888:8888 -v "$PWD":/home/jovyan/work jupyter/pyspark-notebook"$PWD"with"%cd%") - Click the
http://127.0.0.1:8888...link printed in your terminal. - Open the notebook inside the
workfolder.