|
3 | 3 | > [!WARNING] |
4 | 4 | > **Work in Progress**: This project is under active development. Core architectures, CLI flags, and data formats are subject to major changes. |
5 | 5 |
|
6 | | -**SpatialTranscriptFormer** bridges histology and biological pathways through a high-performance transformer architecture. By modeling the dense interplay between morphological features and gene expression signatures, it provides an interpretable and spatially-coherent mapping of the tissue microenvironment. |
| 6 | +> [!TIP] |
| 7 | +> **Framework Release**: SpatialTranscriptFormer has been restructured from a research codebase into a robust framework. You can now use the Python API to train on your own spatial transcriptomics data with custom backbones and architectures. |
| 8 | +
|
| 9 | +**SpatialTranscriptFormer** is a modular deep learning framework designed to bridge histology and biological pathways. It leverages transformer architectures to model the interplay between morphological features and gene expression signatures, providing interpretable mapping of the tissue microenvironment. |
7 | 10 |
|
8 | 11 | ## Key Technical Pillars |
9 | 12 |
|
@@ -35,13 +38,13 @@ This project requires [Conda](https://docs.conda.io/en/latest/). |
35 | 38 | 1. Clone the repository. |
36 | 39 | 2. Run the automated setup script: |
37 | 40 | 3. On Windows: `.\setup.ps1` |
38 | | - - On Linux/HPC: `bash setup.sh` |
| 41 | +4. On Linux/HPC: `bash setup.sh` |
39 | 42 |
|
40 | | -## Usage |
| 43 | +## Usage: HEST-1k Benchmark Recipe |
41 | 44 |
|
42 | | -### Dataset Access |
| 45 | +While the core `SpatialTranscriptFormer` framework can be integrated programmatically with any dataset (see the **[Python API Reference](docs/API.md)** and **[Bring Your Own Data Guide](src/spatial_transcript_former/recipes/custom/README.md)**), this repository includes a complete, out-of-the-box CLI pipeline specifically for reproducing our benchmarks on the [HEST-1k dataset](https://huggingface.co/datasets/MahmoodLab/hest). |
43 | 46 |
|
44 | | -The model uses the **HEST1k** dataset. You can download specific subsets (by organ, technology, etc.) or the entire dataset using the `stf-download` utility: |
| 47 | +### Dataset Access |
45 | 48 |
|
46 | 49 | ```bash |
47 | 50 | # List available filtering options |
@@ -94,10 +97,19 @@ Visualization plots will be saved to the `./results` directory. |
94 | 97 |
|
95 | 98 | ## Documentation |
96 | 99 |
|
97 | | -- [Models](docs/MODELS.md): Detailed model architectures and scaling parameters. |
98 | | -- [Data Structure](docs/DATA_STRUCTURE.md): Organization of HEST data on disk. |
99 | | -- [Pathway Mapping](docs/PATHWAY_MAPPING.md): Clinical interpretability and pathway integration. |
100 | | -- [Gene Analysis](docs/GENE_ANALYSIS.md): Modeling strategies for high-dimensional gene space. |
| 100 | +### Framework APIs & Usage |
| 101 | + |
| 102 | +- **[Python API Reference](docs/API.md)**: Full documentation for `Trainer`, `Predictor`, and `SpatialDataset`. |
| 103 | +- **[Bring Your Own Data Guide](src/spatial_transcript_former/recipes/custom/README.md)**: Templates and examples for training on your own non-HEST spatial transcriptomics data. |
| 104 | +- **[HEST Recipe Docs](src/spatial_transcript_former/recipes/hest/README.md)**: Detailed documentation for the included HEST-1k dataset recipe. |
| 105 | +- **[Training Guide](docs/TRAINING_GUIDE.md)**: Complete list of configuration flags and preset configurations for HEST models. |
| 106 | + |
| 107 | +### Theory & Interpretability |
| 108 | + |
| 109 | +- **[Models & Architecture](docs/MODELS.md)**: Deep dive into the quad-flow interaction logic and network scaling. |
| 110 | +- **[Pathway Mapping](docs/PATHWAY_MAPPING.md)**: Clinical interpretability, pathway bottleneck design, and MSigDB integration. |
| 111 | +- **[Gene Analysis](docs/GENE_ANALYSIS.md)**: Modeling strategies for mapping morphology to high-dimensional gene spaces. |
| 112 | +- **[Data Structure](docs/DATA_STRUCTURE.md)**: Detailed breakdown of the HEST data structure on disk, metadata conventions, and preprocessing invariants. |
101 | 113 |
|
102 | 114 | ## Development |
103 | 115 |
|
|
0 commit comments