Visualizing Monosemantic Features in Diffusion Models

This is a code repository for my engineering thesis Visualizing Monosemantic Features in Diffusion Models.

Requirements

Conda
Python 3.12
Make

It's advised to run these experiments on a computing cluster as they require a lot of computation. Some of the stages could run on a local machine but only if the results from the computationally heavy stages (ex. capture stage) are precomputed and available.

Initial Setup

Local

make env
conda activate diffusion-deep-dream-research-env
make install

PLGrid

The PLGrid setup is complicated. I provide a few scripts to streamline the process. Note that this instruction is not universal for all SLURM clusters and is made specifically for PLGrid.

chmod +x ./scripts/start-interactive-session.sh
chmod +x ./scripts/setup-plg.sh
./scripts/start-interactive-session.sh
source ./scripts/setup-plg.sh
make env
conda activate diffusion-deep-dream-research-env
make install
exit
source ./scripts/setup-plg.sh
conda activate diffusion-deep-dream-research-env

Runnng these commands sets up a Conda environment in the $SCRATCH directory so it can be removed when the environment is unused for a few days. If that happens, just rerun the commands above.

Environment variables

Copy .env.example to .env and fill in your values:

cp .env.example .env

The .env file configures user-specific settings used by the athena infrastructure config:

Variable	Description
`PLG_GROUP_NAME`	Your PLGrid group name
`PLG_USERNAME`	Your PLGrid username
`PLG_SLURM_PARTITION`	Your SLURM GPU partition
`PLG_SLURM_ACCOUNT`	Your SLURM account for GPU partition
`NOTIFICATION_EMAIL`	Email for SLURM job notifications

When you come back

Local

conda activate diffusion-deep-dream-research-env

PLGrid

source ./scripts/setup-plg.sh
conda activate diffusion-deep-dream-research-env

Repository structure

├── conf/                                    # Hydra configuration
│   ├── config.yaml                          # Main config (composes all defaults)
│   ├── models/                              # Model asset definitions
│   ├── datasets/                            # Dataset asset definitions
│   ├── stages/                              # Per-stage hyperparameters
│   ├── infrastructure/                      # local vs athena (PLGrid) settings
│   └── transformation_robustness/           # Augmentation level presets
├── diffusion_deep_dream_research/           # Main source package
│   ├── config/config_schema.py              # Pydantic/dataclass config schema
│   ├── core/
│   │   ├── hooks/                           # CaptureHook, SteeringHook (PyTorch forward hooks)
│   │   ├── model/                           # HookedModelWrapper, ModifiedDiffusionPipelineAdapter
│   │   ├── data/                            # UniquePromptDataset, IndexDataset
│   │   └── regularisation/                  # Penalties, gradient transforms, latent augmentation
│   ├── stages/                              # Pipeline stage implementations (s00–s06)
│   └── utils/                               # Logging, result reading, torch helpers
├── scripts/                                 # PLGrid setup & inspector app
│   ├── inspector.py                         # Streamlit result inspector
│   ├── setup-plg.sh                         # PLGrid environment setup
│   └── start-interactive-session.sh         # PLGrid interactive session
├── submodules/SAeUron/                      # Sparse Autoencoder (git submodule)
├── notebooks/                               # Example notebooks
├── exploration/                             # Exploratory scripts
├── tests/                                   # Unit tests (pytest)
├── main.py                                  # Entry point
├── Makefile                                 # Build/install/test commands
└── pyproject.toml                           # Project metadata, dependencies, ruff config

Experiment config

Configuration uses Hydra with type-safe schemas defined as Python dataclasses in diffusion_deep_dream_research/config/config_schema.py.

The main config file conf/config.yaml composes defaults from several groups:

models/ — asset definitions for Stable Diffusion, SAeUron, Style50 (name, source, download URL)
datasets/ — dataset assets (e.g. unlearn_canvas)
stages/ — hyperparameters for each pipeline stage
infrastructure/ — local (CPU, submitit_local) or athena (GPU A100, submitit_slurm)

Key top-level settings in config.yaml:

Parameter	Description
`stage`	Required. Which pipeline stage to run (see below)
`experiment_name`	Name for the experiment run (affects output directory)
`model_to_analyse`	Which model to use (default: `style50`)
`target_layer_name`	UNet layer to hook into (default: `up_blocks.1.attentions.1`)
`use_sae`	Whether to use the Sparse Autoencoder
`infrastructure`	`local` or `athena` (default: `athena`)

The data_root setting controls where all data lives. Two directories are derived from it:

assets_dir = <data_root>/assets — downloaded models and datasets
outputs_dir = <data_root>/outputs — experiment results

Each infrastructure config sets data_root differently:

local — data_root is the project root directory
athena — data_root is $PLG_GROUPS_STORAGE/$PLG_GROUP_NAME/$PLG_USERNAME/<project_name> ($PLG_GROUPS_STORAGE is preconfigured on PLGrid)

You can override it directly:

python main.py stage=capture data_root=/path/to/my/data

Override any parameter from the command line using Hydra syntax:

python main.py stage=capture experiment_name=my_experiment infrastructure=local

Later stages reference outputs from earlier stages via *_results_dir parameters in their config (e.g. capture_results_dir, timestep_analysis_results_dir, prior_results_dir). These are relative paths under the outputs directory.

Infrastructure and GPU configuration

Each infrastructure config sets the Lightning Fabric accelerator and the Hydra launcher:

local — fabric.accelerator=cpu, submitit_local launcher
athena — fabric.accelerator=cuda, submitit_slurm launcher

Switch between them with the infrastructure override:

python main.py --multirun stage=capture infrastructure=local   # CPU, submitit_local
python main.py --multirun stage=capture infrastructure=athena  # A100 GPU, submitit_slurm

You can also override the accelerator directly:

python main.py stage=capture infrastructure=local fabric.accelerator=cuda

To run on a different cluster or machine, create a new config file in conf/infrastructure/ (e.g. my_cluster.yaml) following the structure of local.yaml or athena.yaml, then use infrastructure=my_cluster.

The athena config (conf/infrastructure/athena.yaml) sets SLURM launcher parameters. To change GPU and node allocation, override the Hydra launcher settings:

python main.py --multirun stage=capture \
  infrastructure=athena \
  hydra.launcher.gpus_per_node=4 \
  hydra.launcher.tasks_per_node=4 \
  hydra.launcher.timeout_min=120

Key SLURM parameters in athena.yaml for package hydra.launcher:

Parameter	Default	Description
`gpus_per_node`	0	Number of GPUs per node. Keep equal to `tasks_per_node` (except 0 for CPU-only jobs)
`tasks_per_node`	1	Number of tasks per node. Keep equal to `gpus_per_node`
`cpus_per_task`	16	CPU cores per task
`mem_gb`	125	Memory per node
`timeout_min`	30	Job time limit in minutes

The athena config also sends email notifications on job BEGIN, END, and FAIL. The recipient is read from the NOTIFICATION_EMAIL variable in your .env file.

Running experiments

All experiments are run through main.py with the stage parameter specifying which pipeline stage to execute:

python main.py --multirun stage=<stage_name>

The seven stages, meant to be run in order:

Stage	Command	Description
Provision	`python main.py stage=provision`	Downloads models and datasets from HuggingFace/GDrive to `<output_dir>/assets/`
Capture	`python main.py --multirun stage=capture hydra.launcher.gpus_per_node=4 hydra.launcher.tasks_per_node=4 hydra.launcher.timeout_min=240`	Runs inference on dataset prompts, captures neuron activations at each timestep. Supports distributed execution via Lightning Fabric. I recommend using at least 4 GPUs
Timestep Analysis	`python --multirun main.py stage=timestep_analysis`	Analyzes captured activations and computes active timesteps, activity peaks, and dataset examples. No GPU needed
Plots	`python main.py --multirun stage=plots`	Generates activity profile visualizations. No GPU needed.
Prior	`python main.py --multirun stage=prior hydra.launcher.gpus_per_node=4 hydra.launcher.tasks_per_node=4 hydra.launcher.timeout_min=600`	Generates steered priors. I recommend using at least 4 GPUs
Deep Dream	`python main.py --multirun stage=deep_dream hydra.launcher.gpus_per_node=4 hydra.launcher.tasks_per_node=4 hydra.launcher.timeout_min=600`	Main optimization. I recommend using at least 4 GPUs
Representation	`python main.py --multirun stage=representation`	Compiles final results into per-channel data shards and an index for inspection. No GPUs needed,

Multi-run (SLURM)

For cluster execution, use the --multirun flag along with infrastructure=athena (for PLGrid, default):

python main.py --multirun stage=capture

This command should be executed on a name node with the Conda environment setup as mentioned in the Initial Setup section.

Keep in mind that jobs executing with multi-run on a GPU partition do not have access to the internet. So the provision stage must be run either on the name node (just without the --multirun flag) or in an interactive session (./scripts/start-interactive-session.sh)

Overriding stage parameters

Stage-specific parameters can be overridden directly:

python main.py stage=deep_dream stages.deep_dream.num_steps=200 stages.deep_dream.learning_rate=0.1

Stage parameters reference

All stage parameters are defined as Pydantic dataclasses in diffusion_deep_dream_research/config/config_schema.py with defaults set in conf/stages/<stage>.yaml. Some capture parameters (batch size, workers) are set per-infrastructure in conf/infrastructure/. You can mofify them in the .yaml files or by overriding them in CLI.

Capture (`stages.capture.*`)

Parameter	Default (local / athena)	Description
`num_images_per_prompt`	1 / 5	Images generated per prompt
`batch_size`	1 / 1	Batch size for inference
`num_workers`	1 / 1	DataLoader workers
`log_every_n_steps`	1 / 10	Logging frequency
`dev_n_prompts`	None	Limit number of prompts (for debugging)

Timestep Analysis (`stages.timestep_analysis.*`)

Parameter	Default	Description
`top_k`	20	Number of top dataset examples to keep per channel
`total_timesteps`	1000	Total diffusion timesteps
`peak_threshold`	0	Minimum value for a peak
`peak_separation`	100	Minimum distance between peaks
`top_peak_count`	3	Number of top peaks to select
`capture_results_dir`	(set in yaml)	Path to capture stage output (relative to `outputs_dir`)

Plots (`stages.plots.*`)

Parameter	Default	Description
`frame_duration`	2	Duration per frame in generated GIFs
`timestep_analysis_results_dir`	(set in yaml)	Path to timestep analysis output

Prior (`stages.prior.*`)

Parameter	Default	Description
`timesteps`	`active_timesteps`	Which timesteps to steer at (`active_timesteps`, `all_timesteps`, `activity_peaks`)
`n_results`	5	Number of results per channel
`seeds`	`[1024, 1111, 4359, 3248, 1026]`	Random seeds for generation
`steer_strength_scale`	50	Steering strength multiplier
`steer_strength_scale_sae`	500	Steering strength for SAE features
`start_channel` / `end_channel`	None	Optional channel range (for partial runs)
`timestep_analysis_results_dir`	(set in yaml)	Path to timestep analysis output

Deep Dream (`stages.deep_dream.*`)

Parameter	Default	Description
`timesteps`	`[0, "activity_peaks"]`	Timesteps to optimize at (ints or `"activity_peaks"`)
`use_just_one_timestep`	false	If true, optimize one timestep at a time
`see_through_schedule_noise`	true	See through scheduler noise during optimization
Regularization
`use_prior`	true	Initialize from prior stage output
`total_variation_penalty_weight`	0.5	Spatial smoothness penalty
`range_penalty_weight`	0.5	Value range penalty
`range_penalty_threshold`	3.0	Threshold for range penalty
`moment_penalty_weight`	0.0	Moment penalty
`gradient_smoothing_sigma_start`	0.5	Gaussian blur sigma (start of schedule)
`gradient_smoothing_sigma_end`	0.0	Gaussian blur sigma (end of schedule)
`gradient_smoothing_kernel_size`	9	Kernel size for gradient smoothing
`use_gradient_spectral_filtering`	true	Frequency-domain gradient preconditioning
Augmentation
`jitter_max`	1	Max jitter pixels
`rotate_max`	5	Max rotation degrees
`scale_max`	1.1	Max scale factor
Optimization
`num_steps`	100	Optimization steps
`learning_rate`	0.05	Learning rate
`n_results`	5	Results per channel (when not using prior)
`seeds`	`[1024, 1111, 4359, 3248, 1026]`	Seeds (when not using prior)
`start_channel` / `end_channel` / `channels`	None	Optional channel subset
`prior_results_dir`	(set in yaml)	Path to prior stage output
`timestep_analysis_results_dir`	(set in yaml)	Path to timestep analysis output

Most parameters also have *_sae variants (e.g. total_variation_penalty_weight_sae) for running experiments with SAE features using different hyperparameters.

Representation (`stages.representation.*`)

Parameter	Default	Description
`timestep_analysis_results_dir`	(set in yaml)	Path to timestep analysis output
`prior_results_dir`	(set in yaml)	Path to prior output
`deep_dream_results_dir_noise`	(set in yaml)	Path to deep dream output (with noise)
`deep_dream_results_dir_no_noise`	None	Optional path to deep dream output (without noise)

Sweeps

Use Hydra's --multirun flag with comma-separated values or glob syntax to sweep over parameters:

python main.py --multirun stage=deep_dream \
  stages.deep_dream.learning_rate=0.01,0.05,0.1 \
  stages.deep_dream.total_variation_penalty_weight=0.1,0.5,1.0

Each combination launches a separate job. On SLURM (infrastructure=athena), jobs are submitted in parallel to the cluster. Locally, they run sequentially via submitit_local.

Output directory structure

Results are written to <data_root>/outputs/<experiment_name>/<stage>/<date>/<time>/. Each run gets a unique timestamped directory. When using --multirun, an additional multirun/ level is added with per-job subdirectories.

Result inspector

A Streamlit app for browsing experiment results is available at scripts/inspector.py. It requires the representation stage to have been run first.

streamlit run scripts/inspector.py -- --base_dir <path_to_representation_output>

When running on PLGrid:

On PLG: ./scripts/start-interactive-session.sh
Wait for the session to start. Note the node its running on ex. t0048
On your local machine: ssh -L 8888:t0048:8888 plg<username>@athena.cyfronet.pl
On PLG: cd scripts
On PLG: streamlit run inspector.py --server.port 8888 -- --base_dir <path_to_representation_output>
On your local machine open localhost:8888 in your browser

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Visualizing Monosemantic Features in Diffusion Models

Requirements

Initial Setup

Local

PLGrid

Environment variables

When you come back

Local

PLGrid

Repository structure

Experiment config

Infrastructure and GPU configuration

Running experiments

Multi-run (SLURM)

Overriding stage parameters

Stage parameters reference

Capture (`stages.capture.*`)

Timestep Analysis (`stages.timestep_analysis.*`)

Plots (`stages.plots.*`)

Prior (`stages.prior.*`)

Deep Dream (`stages.deep_dream.*`)

Representation (`stages.representation.*`)

Sweeps

Output directory structure

Result inspector

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 78 Commits
conf		conf
diffusion_deep_dream_research		diffusion_deep_dream_research
notebooks		notebooks
scripts		scripts
submodules		submodules
tests		tests
.env.example		.env.example
.gitignore		.gitignore
.gitmodules		.gitmodules
LICENSE		LICENSE
Makefile		Makefile
README.md		README.md
main.py		main.py
obrona.pdf		obrona.pdf
pyproject.toml		pyproject.toml

Folders and files

Latest commit

History

Repository files navigation

Visualizing Monosemantic Features in Diffusion Models

Requirements

Initial Setup

Local

PLGrid

Environment variables

When you come back

Local

PLGrid

Repository structure

Experiment config

Infrastructure and GPU configuration

Running experiments

Multi-run (SLURM)

Overriding stage parameters

Stage parameters reference

Capture (stages.capture.*)

Timestep Analysis (stages.timestep_analysis.*)

Plots (stages.plots.*)

Prior (stages.prior.*)

Deep Dream (stages.deep_dream.*)

Representation (stages.representation.*)

Sweeps

Output directory structure

Result inspector

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Capture (`stages.capture.*`)

Timestep Analysis (`stages.timestep_analysis.*`)

Plots (`stages.plots.*`)

Prior (`stages.prior.*`)

Deep Dream (`stages.deep_dream.*`)

Representation (`stages.representation.*`)

Packages