🏷️ License

🧠BrainFLORA: Uncovering Brain Concept Representation via Multimodal Neural Embeddings

A comparative overview of multimodal decoding paradigms.

Overall architecture of BrainFLORA.

🐣 Update

2025/12/21, we released the preprocessed datasets and pretrained checkpoints on 🤗 Hugging Face.
2025/07/15, the arxiv paper is public.
2025/07/12, we officially released the code.
2025/07/05, BrainFLORA is accepted by ACM MM 2025.

🛠️ Environment Setup

Quick Start

Option 1: Using setup script (Recommended)

bash setup.sh
conda activate BrainFLORA

Option 2: Using conda environment file

conda env create -f environment.yml
conda activate BrainFLORA

Option 3: Using pip

pip install -r requirements.txt

Important: Install as editable package

After setting up the environment using any of the above options, install the project in editable mode to enable proper module imports:

pip install -e .

📊 Dataset Preparation

Option 1: Using Preprocessed Data (Recommended)

We provide preprocessed datasets ready for training on Hugging Face:

from datasets import load_dataset
# Load the preprocessed BrainFLORA dataset
dataset = load_dataset("LidongYang/BrainFLORA")

👉 Download from Hugging Face

Option 2: Download Raw Data

To download and preprocess the raw data yourself:

Dataset	Download path	Dataset	Download path
THINGS-EEG1	Download	THINGS-EEG2	Download
THINGS-MEG	Download	THINGS-fMRI	Download
THINGS-Images	Download

After downloading, use the preprocessing scripts in data_preparing/ directory to process the raw data.

🚴‍♂️Quick Training

1. Visual Retrieval

We provide the script to train the modality encoders for joint subject training in THINGS-EEG2 dataset. Please modify your dataset path and run:

python Retrieval/retrieval_joint_train_medformer.py --logger True --gpu cuda:0 --output_dir ./outputs/contrast

Additionally, replicate the results of other modalities (e.g. MEG, fMRI) by running:

# MEG
python Retrieval/retrieval_joint_train_MEG_medformer.py --logger True --gpu cuda:0 --output_dir ./outputs/contrast

# fMRI
python Retrieval/retrieval_joint_train_fMRI_medformer.py --logger True --gpu cuda:0 --output_dir ./outputs/contrast

2. Visual Reconstruction

We provide quick training and inference scripts for high level and low level pipeline of visual reconstruction. Please modify your dataset path and run:

# Train and get multimodal neural embeddings aligned with CLIP embedding:
python train/train_unified_encoder_highlevel_diffprior.py \
    --modalities eeg meg fmri \
    --gpu cuda:0 \
    --output_dir ./outputs/contrast

3. Visual Captioning

We provide scripts for visual caption generation:

# Train feature adapter with caption support
python train/train_unified_encoder_highlevel_diffprior_caption.py \
    --modalities eeg meg fmri \
    --gpu cuda:0 \
    --output_dir ./outputs/contrast

4. Distributed Training (Multi-GPU)

For multi-GPU training with accelerate:

accelerate launch train/train_unified_encoder_highlevel_diffprior_parallel.py \
    --modalities eeg meg fmri \
    --output_dir ./outputs/contrast

🚴‍♂️Quick Evaluation

1.Visual Retrieval

We provide the script to evaluation the models:

cd eval/
FLORA_inference.ipynb

2.Visual Reconstruction

# Reconstruct images by assigning modalities and subjects:
cd eval/
python FLORA_inference_reconst.py

3.Visual Captioning

# Get captions from prior latent
cd eval/
FLORA_inference_caption.ipynb

👍 Citations

If you find our work useful, please consider citing:

@inproceedings{li2025brainflora,
  author = {Li, Dongyang and Qin, Haoyang and Wu, Mingyang and Wei, Chen and Liu, Quanying},
  title = {BrainFLORA: Uncovering Brain Concept Representation via Multimodal Neural Embeddings},
  year = {2025},
  isbn = {9798400720352},
  publisher = {Association for Computing Machinery},
  address = {New York, NY, USA},
  url = {https://doi.org/10.1145/3746027.3754996},
  doi = {10.1145/3746027.3754996},
  booktitle = {Proceedings of the 33rd ACM International Conference on Multimedia},
  pages = {5577–5586}
}

@article{li2024visual,
  title={Visual Decoding and Reconstruction via EEG Embeddings with Guided Diffusion},
  author={Li, Dongyang and Wei, Chen and Li, Shiying and Zou, Jiachen and Liu, Quanying},
  journal={Advances in Neural Information Processing Systems},
  volume={37},
  pages={102822--102864},
  year={2024}
}
@inproceedings{wei2024cocog,
  title={CoCoG: controllable visual stimuli generation based on human concep08/03/2024t representations},
  author={Wei, Chen and Zou, Jiachen and Heinke, Dietmar and Liu, Quanying},
  booktitle={Proceedings of the Thirty-Third International Joint Conference on Artificial Intelligence},
  pages={3178--3186},
  year={2024}
}

😺Acknowledge

1.Thanks to Y Song et al. for their contribution in data set preprocessing and neural network structure, we refer to their work:"Decoding Natural Images from EEG for Object Recognition". Yonghao Song, Bingchuan Liu, Xiang Li, Nanlin Shi, Yijun Wang, and Xiaorong Gao.

2.We also thank the authors of SDRecon for providing the codes and the results. Some parts of the training script are based on MindEye and MindEye2. Thanks for the awesome research works.

3.Here we provide the THING-EEG2 dataset cited in the paper: "A large and rich EEG dataset for modeling human visual object recognition". Alessandro T. Gifford, Kshitij Dwivedi, Gemma Roig, Radoslaw M. Cichy.

4.Another used THINGS-MEG and THINGS-fMRI data set provides a reference:"THINGS-data, a multimodal collection of large-scale datasets for investigating object representations in human brain and behavior". Hebart, Martin N., Oliver Contier, Lina Teichmann, Adam H. Rockter, Charles Y. Zheng, Alexis Kidder, Anna Corriveau, Maryam Vaziri-Pashkam, and Chris I. Baker.

5.We use the "BrainHub" for visual caption evaluation from "UMBRAE: Unified Multimodal Brain Decoding (ECCV 2024)" Xia, Weihao and de Charette, Raoul and Oztireli, Cengiz and Xue, Jing-Hao.

Contact Dongyang Li if you have any questions or suggestions.

🏷️ License

This repository is released under the MIT license. See LICENSE for additional details.

Name		Name	Last commit message	Last commit date
Latest commit History 28 Commits
.history		.history
Retrieval		Retrieval
configs		configs
data_preparing		data_preparing
eval		eval
imgs		imgs
layers		layers
model		model
train		train
utils		utils
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
environment.yml		environment.yml
requirements.txt		requirements.txt
setup.py		setup.py
setup.sh		setup.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

🧠BrainFLORA: Uncovering Brain Concept Representation via Multimodal Neural Embeddings

🐣 Update

🛠️ Environment Setup

Quick Start

📊 Dataset Preparation

Option 1: Using Preprocessed Data (Recommended)

Option 2: Download Raw Data

🚴‍♂️Quick Training

1. Visual Retrieval

2. Visual Reconstruction

3. Visual Captioning

4. Distributed Training (Multi-GPU)

🚴‍♂️Quick Evaluation

1.Visual Retrieval

2.Visual Reconstruction

3.Visual Captioning

👍 Citations

😺Acknowledge

🏷️ License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

🧠BrainFLORA: Uncovering Brain Concept Representation via Multimodal Neural Embeddings

🐣 Update

🛠️ Environment Setup

Quick Start

📊 Dataset Preparation

Option 1: Using Preprocessed Data (Recommended)

Option 2: Download Raw Data

🚴‍♂️Quick Training

1. Visual Retrieval

2. Visual Reconstruction

3. Visual Captioning

4. Distributed Training (Multi-GPU)

🚴‍♂️Quick Evaluation

1.Visual Retrieval

2.Visual Reconstruction

3.Visual Captioning

👍 Citations

😺Acknowledge

🏷️ License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages