Spatiotemporal SVGP Experiments

This repository contains code and supporting assets for modelling urban heat with sparse variational Gaussian processes (SVGPs). The workflow ingests hourly weather observations, engineers spatiotemporal features, and trains GP models that can be validated or used to generate predictive maps.

Highlights

SVGP models for temperature and dewpoint with custom covariance structures.
Data pipeline that filters, normalises, and weights station observations.
Training and evaluation loops with weighted likelihood support and GPU acceleration.
Notebooks and scripts for exploratory analysis, preprocessing, and figure generation.

Repository Layout

main.py – command-line entry point used for SLURM array jobs.
Datasets/ – dataset loader, QC routines, and LGCP-based weighting helpers.
src/ – python package exposing models, trainers, and lightweight utilities.
scripts/ – auxiliary scripts for preprocessing and experiment management.
notebooks/ – exploratory analysis, diagnostics, and figure notebooks.
data/, experiments/, results/, figures/ – local working directories for inputs and outputs (not tracked in git).

Getting Started

Install dependencies
```
python -m venv env
source env/bin/activate
pip install -r requirements.txt
```
PyTorch and GPyTorch are required; install the CUDA build that matches your system.
Prepare data
Place Weather Underground station exports under data/ (e.g. station=XXXX/…) and reference ERA5 tables under the directory you pass via --ref_data. The dataset loader expects monthly CSVs named YYYY-MM.csv.

Train a model

python main.py \
  --input data/weather_underground \
  --output results/temp_pll \
  --variable tempAvg \
  --likelihood Student \
  --loss PLL \
  --train_size 0.8 \
  --num_epochs 20 \
  --batch_size 512 \
  --num_inducing_points 1000 \
  --ref_data data/era5_reference

When SLURM_ARRAY_TASK_ID is set, the script derives (year, month) automatically. Without SLURM you can set the variable manually (e.g. SLURM_ARRAY_TASK_ID=24 python main.py …).

Examine outputs
- model_YYYY_MM.pt and likelihood_YYYY_MM.pt store learned parameters.
- results_YYYY_MM.json (for held-out validation) reports MAE/MSE/NLPD and quantile coverage metrics.
- {YYYY}-{MM}.csv (when training on full data) stores map-ready predictions and uncertainty bounds.

Development Tips

Core code lives under src/; import everything via from src import ….
Datasets/ contains code which is specific to how the data is saved on my machine.
Custom loggers and GP helpers are defined in src/utils.py.

License

Distributed under the MIT License. You may use, copy, modify, merge, publish, distribute, sublicense, and/or sell copies of the software, provided the copyright notice and license text are included. The software is supplied “as is” without warranty, and the authors are not liable for damages arising from its use.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Spatiotemporal SVGP Experiments

Highlights

Repository Layout

Getting Started

Development Tips

License

About

Uh oh!

Releases 1

Packages

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
Datasets		Datasets
notebooks		notebooks
scripts		scripts
src		src
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
main.py		main.py
requirements.txt		requirements.txt

Folders and files

Latest commit

History

Repository files navigation

Spatiotemporal SVGP Experiments

Highlights

Repository Layout

Getting Started

Development Tips

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 1

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages