API Reference

MediaRef(uri: str | DataURI, pts_ns: int | None = None)

Properties: is_embedded, is_video, is_remote, is_relative_path

Methods:

to_ndarray(format="rgb") -> np.ndarray - Load as numpy array
- Formats: "rgb" (default), "bgr", "rgba", "bgra", "gray"
- Returns: (H, W, 3) for RGB/BGR, (H, W, 4) for RGBA/BGRA, (H, W) for grayscale
to_pil_image(format="rgb") -> PIL.Image - Load as PIL Image
- Formats: "rgb" (default), "rgba", "gray"
resolve_relative_path(base_path, on_unresolvable="warn") -> MediaRef - Resolve relative paths
- on_unresolvable: How to handle embedded/remote URIs: "error", "warn" (default), or "ignore"
validate_uri() -> bool - Check if URI exists (local files only)
model_dump() -> dict - Serialize to dict
model_dump_json() -> str - Serialize to JSON
model_validate(data) -> MediaRef - Deserialize from dict
model_validate_json(json_str) -> MediaRef - Deserialize from JSON

Class Methods:

Methods:

to_ndarray(format="rgb") -> np.ndarray - Convert to numpy array
- Formats: "rgb" (default), "bgr", "rgba", "bgra", "gray"
to_pil_image() -> PIL.Image - Convert to PIL Image

Properties:

batch_decode(refs, decoder="pyav") -> list[np.ndarray] - Batch decode video frames
- refs: List of MediaRef objects to decode
- decoder: Decoder backend ("pyav" or "torchcodec")
cleanup_cache() - Clear video container cache (PyAV only)

Both decoders follow the same playback semantics, ensuring consistent frame selection regardless of backend.

PyAVVideoDecoder(source) - PyAV-based decoder
- CPU-based decoding using FFmpeg
- Automatic container caching with reference counting
TorchCodecVideoDecoder(source) - TorchCodec-based decoder
- Requires torchcodec (install separately)
- GPU-accelerated decoding with CUDA support

Decoder Comparison: