This repository contains the code and some resources used for experiments involving biological image compression and analysis related to the paper:
Accepted at the 3rd Workshop on Imageomics: Discovering Biological Knowledge from Images Using AI, held at the 39th Conference on Neural Information Processing Systems (NeurIPS 2025).
https://arxiv.org/abs/2510.19887
cpg0000-jump-pilot/
Contains metadata and scripts required to run experiments with the CPJUMP1 dataset. Includes scripts for downloading and preprocessing the images.
lsun/
Includes partition_names.txt for parallelizing inference on the LSUN dataset.
dataset/
Contains files for defining Dataset objects used in inference.
models/
Holds definitions of all pre-trained models and their associated inference functions.
inference_cpg0000.py
Inference of pre-trained models on the CPJUMP1 dataset.
inference_lsun.py
Inference of pre-trained models on the LSUN dataset.
notebook.sh
Sets up the environment for running Jupyter notebooks on a cluster.
phenotypic_activity.ipynb
Computes the phenotypic activity (FR) on the CPJUMP1 dataset.
plots.ipynb
Generates plots for evaluation metrics including MAE, SSIM, EMD, KLD, and FID.
requirements.txt
List of all libraries installed in the virtual environment using pip3 freeze.
- Download CPJUMP1 images
Run cpg0000-jump-pilot/download.sh to download the dataset.
- Preprocess images
Use processing.sh to preprocess the downloaded images.
- Download LSUN dataset
Follow instructions or use provided scripts to obtain the LSUN dataset.
- Run inference
Execute inference_cpg0000.sh and inference_lsun.sh to perform inference on the respective datasets.
- Analyze results
-
Use
phenotypic_activity.ipynbto compute phenotypic activity. -
Use
plots.ipynbto visualize evaluation metrics.