FlashOCC v2.0

An improved engineering version of FlashOCC, removing mmcv/mmdet3d dependencies, supporting the latest torch and CUDA devices.

Features

Fixed BGR color channel issue: HuangJunJie2017/BEVDet#274
New BEV Pool V3 with torch.compile support, significantly improving performance.
DAIL data pipeline.

Request for Assistance!

Currently, we are unable to reproduce the FlashOCC_ResNet50 results (mIoU 32.08). We haven't had the bandwidth to investigate the cause yet. If you have any findings, please share them with us. Thank you in advance!

Note: @Yzichen is not involved in the maintenance of this project. His name appears due to early forks and compressed commit history, but this does not diminish his original contributions!

Quick Start

# Installation
uv venv .venv --python 3.12 && source .venv/bin/activate
uv sync

# Prepare Data (see doc/nuscenes_data.md)
python tools/create_data_flashocc2.py

# Training (Single GPU)
python tools/train.py configs/flashocc_r50.py

# Training (Multi-GPU)
torchrun --nproc_per_node=4 tools/train.py configs/flashocc_r50.py

# Testing
python tools/test.py configs/flashocc_r50.py work_dirs/flashocc_r50/epoch_24.pth --eval occ

# Dataset Class Distribution Statistics
python tools/analyze_class_distribution.py data/flashocc2-nuscenes_infos_train.pkl --no-show

Project Structure

FlashOCC/
├── configs/                  # Python configuration files
│   └── flashocc_r50.py       # R50 single-frame configuration
├── src/flashocc/             # Source code package
│   ├── constants.py          # Global constants (class names, grid parameters, etc.)
│   ├── config/               # Configuration system (Lazy descriptors, data classes)
│   ├── core/                 # Core infrastructure
│   │   ├── base_module.py    # BaseModule (supports init_cfg)
│   │   ├── checkpoint.py     # Weight loading/saving
│   │   ├── dist.py           # Distributed utilities
│   │   ├── fp16.py           # Mixed precision (force_fp32, wrap_fp16_model)
│   │   ├── functional.py     # multi_apply, reduce_mean
│   │   ├── nn.py             # ConvModule, build_conv/norm_layer, initialization
│   │   ├── registry.py       # Registry pattern
│   │   ├── bbox/             # 3D bounding boxes & point classes
│   │   └── ops/              # CUDA operators (bev_pool_v2, bev_pool_v3)
│   ├── datasets/             # Data loading and evaluation
│   │   ├── base_dataset.py   # Custom3DDataset base class
│   │   ├── nuscenes_occ.py   # NuScenesOccDataset (occupancy evaluation)
│   │   ├── nuscenes_bevdet.py# NuScenesDatasetBEVDet (detection evaluation)
│   │   ├── pipelines/        # Data pipelines (loading, augmentation, formatting)
│   │   └── evaluation/       # Evaluation metrics (mIoU, RayIoU, RayPQ)
│   ├── engine/               # Training and inference
│   │   ├── trainer.py        # Training loop
│   │   ├── tester.py         # Testing loop
│   │   ├── inference.py      # single_gpu_test
│   │   ├── seed.py           # Seed utilities
│   │   ├── parallel.py       # DataParallel wrapper
│   │   └── hooks/            # Training hooks (EMA, SyncBN, etc.)
│   └── models/               # Model definitions
│       ├── backbones/        # ResNet, CustomResNet
│       ├── necks/            # FPN, LSSFPN, ViewTransformer
│       ├── heads/            # OccHead
│       ├── detectors/        # BEVDet → BEVDetOCC pipeline
│       └── losses/           # Cross-entropy loss utilities
├── tools/                    # Command-line scripts
│   ├── train.py              # Training entry point
│   ├── test.py               # Testing entry point
│   ├── analyze_class_distribution.py  # Class distribution analysis
│   ├── dist_train.sh         # Multi-GPU training script
│   ├── dist_test.sh          # Multi-GPU testing script
│   └── create_data_flashocc2.py # Data preparation
├── data/nuscenes/            # Dataset (not tracked by git)
├── ckpts/                    # Pre-trained weights
└── pyproject.toml            # Dependencies and build configuration

Configuration System

Configuration files are pure Python, using Lazy descriptors and data classes—no YAML, no dictionary-based registries:

# configs/flashocc_r50.py
from flashocc.config import Experiment, Lazy, DataConfig, GridConfig
from flashocc.models.backbones.resnet import ResNet
from flashocc.models.necks.fpn import FPN

experiment = Experiment(
    model=Lazy(BEVDetOCC,
        img_backbone=Lazy(ResNet, depth=50, ...),
        img_neck=Lazy(FPN, ...),
        ...
    ),
    data=DataConfig(...),
    ...
)

All model components are referenced via Python import directly—IDE navigation, refactoring, and type checking work out of the box.

Benchmarks

Model	Backbone	Input Size	mIoU	Parameters
FlashOCC M1 (1f)	R50	256×704	32.08	44.74M
FlashOCC-4D-Stereo (2f)	R50	256×704	37.84	-
FlashOCC-4D-Stereo (2f)	Swin-B	512×1408	43.52	144.99M

Documentation

Citation

@article{yu2024ultimatedo,
  title={UltimateDO: An Efficient Framework to Marry Occupancy Prediction with 3D Object Detection via Channel2height},
  author={Yu, Zichen and Shu, Changyong},
  journal={arXiv preprint arXiv:2409.11160},
  year={2024}
}

@article{yu2024panoptic,
  title={Panoptic-FlashOcc: An Efficient Baseline to Marry Semantic Occupancy with Panoptic via Instance Center},
  author={Yu, Zichen and Shu, Changyong and Sun, Qianpu and Linghu, Junjie and Wei, Xiaobao and Yu, Jiangyong and Liu, Zongdai and Yang, Dawei and Li, Hui and Chen, Yan},
  journal={arXiv preprint arXiv:2406.10527},
  year={2024}
}

@article{yu2023flashocc,
  title={FlashOcc: Fast and Memory-Efficient Occupancy Prediction via Channel-to-Height Plugin},
  author={Zichen Yu and Changyong Shu and Jiajun Deng and Kangjie Lu and Zongdai Liu and Jiangyong Yu and Dawei Yang and Hui Li and Yan Chen},
  year={2023},
  eprint={2311.12058},
  archivePrefix={arXiv},
  primaryClass={cs.CV}
}

Acknowledgements

Based on BEVDet, FB-BEV, RenderOcc, and SparseBEV.

Name		Name	Last commit message	Last commit date
Latest commit History 16 Commits
configs		configs
doc		doc
src/flashocc		src/flashocc
third_party		third_party
tools		tools
.gitignore		.gitignore
.gitmodules		.gitmodules
.python-version		.python-version
LICENSE		LICENSE
README.md		README.md
README_CN.md		README_CN.md
TODO.md		TODO.md
plan.md		plan.md
plan_f.md		plan_f.md
pyproject.toml		pyproject.toml
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

FlashOCC v2.0

Features

Request for Assistance!

Quick Start

Project Structure

Configuration System

Benchmarks

Documentation

Citation

Acknowledgements

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

FlashOCC v2.0

Features

Request for Assistance!

Quick Start

Project Structure

Configuration System

Benchmarks

Documentation

Citation

Acknowledgements

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages