AV1-Optical-Flow

Extract and pre-process motion vectors from AV1 bitstreams for fast and cheap optical flow estimation.

AV1 Motion Vector Fidelity and Application for Efficient Optical Flow
Julien Zouein, Vibhoothi, Anil Kokaram
Picture Coding Symposium (PCS) 2025

extract_motion_small.mp4

Overview

AV1-Optical-Flow extracts the motion vectors that the AV1 encoder already computes during compression and converts them into standard optical flow fields (.flo5 format). Because the motion information is a free by-product of video encoding, this approach is orders of magnitude faster than running a dedicated optical flow network, making it suitable for real-time or large-scale video analysis pipelines.

The pipeline:

Decodes the AV1 bitstream with the AOM inspect tool, which dumps per-frame metadata (motion vectors, reference maps, block modes) to JSON.
Parses the raw bitstream to extract reference frame order hints, converting AV1's internal reference indices into actual frame numbers.
Post-processes the motion vectors with optional linear interpolation, upscaling, and bidirectional filling.
Writes the result as .flo5 (HDF5-compressed) flow files.

Requirements

Python 3.10+
uv (Python package manager)
CMake and a C compiler (for building AOM)
libvmaf (for building AOM with VMAF tuning)

Installation

# Clone the repository
git clone https://github.com/sigmedia/AV1-Optical-Flow.git
cd AV1-Optical-Flow

# Run the setup script (installs Python deps via uv + builds AOM from source)
bash setup.sh

The setup script will:

Run uv sync to create a virtual environment and install all dependencies from pyproject.toml
Clone and build AOM with the inspection API enabled (CONFIG_INSPECTION=1)
Install pre-commit hooks

If you don't have uv installed yet:

curl -LsSf https://astral.sh/uv/install.sh | sh

Usage

The input must be an IVF-wrapped AV1 file.

uv run python main.py \
    --input_file path/to/video.ivf \
    --output_directory path/to/output/

Options

Flag	Description
`--input_file`	Path to the input AV1 `.ivf` file.
`--output_directory`	Directory where `.flo5` flow files are written.
`--linear_interpolation`	Normalise motion vectors by temporal distance to the reference frame.
`--upscale_function`	Upscale the motion field to frame resolution. Choices: `bicubic`, `nearest`, `bilinear`, `area`, `lanczos`.
`--bidirectional_filling`	Fill zero-motion intra blocks using the motion from the opposite direction.
`--logger_level`	Logging verbosity. Choices: `DEBUG`, `INFO`, `WARNING`, `ERROR`, `CRITICAL`. Default: `INFO`.
`--version`	Print version information and exit.

Example

Extract motion vectors with linear interpolation and bicubic upscaling:

uv run python main.py \
    --input_file input.ivf \
    --output_directory ./flows/ \
    --linear_interpolation \
    --upscale_function bicubic \
    --bidirectional_filling

This produces two files per frame in the output directory:

motion_backward_<N>.flo5 — backward motion field (current frame to past reference)
motion_forward_<N>.flo5 — forward motion field (current frame to future reference)

Development

# Install dev dependencies
uv sync --group dev

# Run linter
uv run ruff check .

# Run type checker
uv run pyright

# Run pre-commit on all files
uv run pre-commit run --all-files

Project Structure

AV1-Optical-Flow/
├── main.py                          # Entry point: orchestrates the full pipeline
├── pyproject.toml                   # Project metadata and dependencies (uv)
├── setup.sh                         # Builds AOM and installs dependencies via uv
├── src/
│   └── modules/
│       ├── av1_parser.py            # Pure-Python AV1 bitstream parser (order hints)
│       ├── json_processing.py       # Processes AOM inspect JSON (motion vectors)
│       ├── flow_io.py               # Read/write optical flow in multiple formats
│       ├── utils.py                 # Upscaling, bidirectional filling, IVF validation
│       └── logger.py                # Logging configuration
├── assets/
│   └── extract_motion.mp4          # Demo video

How It Works

Motion Vector Extraction

AV1 stores motion vectors at block granularity (typically 4x4 pixels). Each block references one or two previously decoded frames and carries a 2D motion vector per reference. The AOM inspect tool exposes these as a JSON array of shape (H/4, W/4, 4) — two components each for the backward and forward references.

Order Hint Parsing

AV1 identifies reference frames by type (LAST, GOLDEN, BWDREF, etc.), not by frame number. To convert these into actual frame numbers — needed for temporal normalisation — the pipeline includes a pure-Python AV1 bitstream parser (av1_parser.py) that reads the IVF container and extracts the order_hint and ref_frame_idx fields from each frame header, tracking the 8-slot reference buffer across the entire sequence.

Post-Processing

Step	Description
Linear interpolation	Divides each motion vector by the temporal distance to its reference frame, normalising to a per-frame displacement.
Bidirectional filling	Copies the negated motion vector from the opposite direction into blocks that have zero motion (intra blocks).
Upscaling	Resizes the block-level motion field to full frame resolution using OpenCV interpolation.

License

This project is licensed under the GNU Affero General Public License v3.0.

Copyright (C) 2026  Sigmedia.tv / Julien Zouein (zoueinj@tcd.ie)

Citation

@inproceedings{inproceedings,
author = {Zouein, Julien and Vibhoothi, Vibhoothi and Kokaram, Anil},
year = {2025},
month = {12},
pages = {1-5},
title = {AV1 Motion Vector Fidelity and Application for Efficient Optical Flow},
doi = {10.1109/PCS65673.2025.11417638}
}

Name		Name	Last commit message	Last commit date
Latest commit History 48 Commits
assets		assets
src		src
test		test
.pre-commit-config.yaml		.pre-commit-config.yaml
LICENSE		LICENSE
README.md		README.md
main.py		main.py
pyproject.toml		pyproject.toml
setup.sh		setup.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

AV1-Optical-Flow

Overview

Requirements

Installation

Usage

Options

Example

Development

Project Structure

How It Works

Motion Vector Extraction

Order Hint Parsing

Post-Processing

License

Citation

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

AV1-Optical-Flow

Overview

Requirements

Installation

Usage

Options

Example

Development

Project Structure

How It Works

Motion Vector Extraction

Order Hint Parsing

Post-Processing

License

Citation

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages