PRISM: A Clinically Interpretable Stepwise Framework for Multimodal Skin Cancer Diagnosis

This repository contains the code and benchmarks for PRISM, focusing on multimodal skin cancer diagnosis using the PAD-UFES-20 and MILK10k datasets.

Before running anything: update config.py to point to your local dataset copies.

You must set the following paths correctly:

PAD_20_PATH and PAD_20_IMAGES_FOLDER

MILK10K_PATH and MILK10K_TRAIN_IMAGES_FOLDER

1. Preprocessing

Before running the experiments, you must generate the one-hot and sentence encodings for the metadata of the respective dataset.

# For PAD-UFES-20
python -m benchmarks.pad20.preprocess.onehot
python -m benchmarks.pad20.preprocess.sentence

# For MILK10k
python -m benchmarks.milk10k.preprocess.onehot
python -m benchmarks.milk10k.preprocess.sentence

2. Cross-Validation

You can run cross-validation on the feature fusion models using the benchmarks.kfold script.

python -m benchmarks.kfold {pad20|milk10k} [OPTIONS]

Options:

--all: Runs all baseline methods (no_metadata, cross_attention, remixformer, and metablock).
--fusion <method>: Runs a single, specific fusion method (defaults to no_metadata).
--priors <timestamp>: Required if running the naive_bayes model. You must provide the timestamp of the results folder from the no_metadata baseline.

Note: Results are saved in the benchmarks/{dataset}/results/{timestamp} folder.

3. Bayesian Network

The Bayesian Network approach is implemented distinctly using Pyro.

First, run the Bayesian preprocessing script, passing the timestamp of the CNN no_metadata baseline results:

python -m benchmarks.{pad20|milk10k}.preprocess.bayesian --no-metadata-timestamp <timestamp>

Afterward, execute the cross-validation across all backbones:

python -m benchmarks.kfoldbayesian {pad20|milk10k}

4. Aggregating Results and Generating Tables

To generate LaTeX tables and aggregate all results, ensure that the output directories for all fusion methods (e.g., cross_attention, naive_bayes, bayesiannetwork, etc.) are placed under a single benchmarks/{dataset}/results/{timestamp} directory.

Then run the metrics script:

python -m utils.metrics {pad20|milk10k} -t <timestamp>

This generates the LaTeX table with all metrics for all fusion methods and backbones.

5. Statistical Tests

After aggregating the results under the same results/{timestamp} directory, you can perform statistical tests:

python -m utils.statistical {pad20|milk10k} -t <timestamp> [--control <method>]

Options:

--control: Specify the control method for comparisons (defaults to naive_bayes).

6. Incremental Evaluation on PAD-UFES-20

After aggregating the results for all methods under the same results/{timestamp} directory, you can run:

python -m benchmarks.pad20.incremental -t <timestamp>

Options:

--backbone: Specifies the backbone (defaults to efficientnet-b0).
--cached: Whether to use pre-computed results (defaults to False).

The new figures are saved to benchmarks/pad20/results/{timestamp}/incremental_metadata_performance_subplots_*.png

Note

Recently, the sentence-transformers changed its API and started displaying the following warnings:

AlbertModel LOAD REPORT from: sentence-transformers/paraphrase-albert-small-v2
Key                     | Status     |  |
------------------------+------------+--+-
embeddings.position_ids | UNEXPECTED |  |

Notes:
- UNEXPECTED    :can be ignored when loading from different task/architecture; not ok if you expect identical arch."

You can safely ignore them: according to this issue, the position_ids were "just an integer range tensor from 0 to the max sequence length. [...] there's not much point in saving this in the checkpoints, since it can easily be recomputed on the fly", hence the error.

Name		Name	Last commit message	Last commit date
Latest commit History 21 Commits
benchmarks		benchmarks
models		models
trainer		trainer
utils		utils
.gitignore		.gitignore
README.md		README.md
config.py		config.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

PRISM: A Clinically Interpretable Stepwise Framework for Multimodal Skin Cancer Diagnosis

1. Preprocessing

2. Cross-Validation

3. Bayesian Network

4. Aggregating Results and Generating Tables

5. Statistical Tests

6. Incremental Evaluation on PAD-UFES-20

Note

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

PRISM: A Clinically Interpretable Stepwise Framework for Multimodal Skin Cancer Diagnosis

1. Preprocessing

2. Cross-Validation

3. Bayesian Network

4. Aggregating Results and Generating Tables

5. Statistical Tests

6. Incremental Evaluation on PAD-UFES-20

Note

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages