dct-io

Read and write the hidden numbers inside JPEG files — without touching the pixels.

What even is a JPEG?

When you save a photo as a JPEG, your computer doesn't store every pixel's colour directly. Instead it chops the image into tiny 8×8 pixel tiles and runs a maths trick called the Discrete Cosine Transform (DCT) on each one. Think of it like describing a song with a list of bass, mid, and treble levels rather than writing out every sound wave. The result is a list of 64 numbers per tile — the DCT coefficients — that capture the image's frequencies from coarse (the big shapes) to fine (tiny details).

Your photo (e.g. 480×320 pixels)
┌──────────────────────────────────────┐
│ ┌──┬──┬──┬──┬──┬──┬──┐              │
│ │  │  │  │  │  │  │  │  ...         │
│ ├──┼──┼──┼──┼──┼──┼──┤              │
│ │  │  │  │  │  │  │  │  ...  each □ │
│ ├──┼──┼──┼──┼──┼──┼──┤  = 8×8 pixels│
│ │  │  │  │  │  │  │  │  ...         │
│ └──┴──┴──┴──┴──┴──┴──┘              │
│         ... more rows ...            │
└──────────────────────────────────────┘
  DCT runs on each tile independently

Those numbers are then rounded (this is the "lossy" part) and packed tightly using Huffman coding — a compression trick that assigns shorter codes to more common values.

pixels     →   [DCT]   →  coefficients  →  [quantize]  →  integers  →  [Huffman]  →  .jpg
(8×8 tile)              (64 frequencies)   (round down)   (64 ints)                   file
                                ▲                              ▲
                          dct-io reads                  dct-io works
                          and writes here               with these

This crate peels back those layers. It reads the compressed data, unpacks the Huffman codes, and hands you the raw coefficient numbers. You can change them and write a new JPEG that looks identical to any image viewer but has your modifications baked in at the compression level.

What can you do with this?

Steganography — hide data inside a JPEG by tweaking the least significant bit of coefficients (the JSteg technique). The image looks the same; the bits are yours.

coefficient = 42  →  binary: 0 1 0 1 0 1 [0]
                                          └── you own this bit

flip to 1:  42 → 43   (invisible to viewers)
flip to 0:  43 → 42   (reversible — read it back later)

Rule: only use coefficients where |value| ≥ 2 so flipping
the LSB never pushes a value through zero (that would change
the Huffman encoding and corrupt the file).

Watermarking — embed an invisible signature that survives re-saving.
Forensic analysis — inspect the raw coefficient structure to detect tampering or double compression.
Research / signal processing — work directly in the frequency domain without decoding to pixels first.

What this crate does NOT do

Decode pixel values (no inverse-DCT, no dequantisation — pixels stay out of it)
Support progressive JPEG, lossless JPEG, or arithmetic coding (returns an error)
Support JPEG 2000

Supported JPEG variants

Baseline DCT (SOF0) — the most common JPEG you'll encounter
Extended sequential DCT (SOF1)
Grayscale (1 channel) and colour (3 channels, typically Y/Cb/Cr)
All standard chroma subsampling ratios (4:4:4, 4:2:2, 4:2:0, etc.)
EXIF and JFIF headers
Restart markers (DRI / RST0–RST7)

Installation

[dependencies]
dct-io = "0.1"

Examples

Read and modify coefficients

use dct_io::{read_coefficients, write_coefficients};

let jpeg = std::fs::read("photo.jpg")?;
let mut coeffs = read_coefficients(&jpeg)?;

// Flip the LSB of every AC coefficient with |v| >= 2 in the Y (luminance) channel.
// These are "eligible" positions — changing them doesn't shift zero runs,
// so the output is a valid JPEG that looks identical to the original.
for block in &mut coeffs.components[0].blocks {
    for coeff in block[1..].iter_mut() {   // index 0 is DC; 1–63 are AC
        if coeff.abs() >= 2 {
            *coeff ^= 1;
        }
    }
}

let modified = write_coefficients(&jpeg, &coeffs)?;
std::fs::write("photo_modified.jpg", modified)?;

Inspect image metadata cheaply

use dct_io::inspect;

let jpeg = std::fs::read("photo.jpg")?;
let info = inspect(&jpeg)?;    // does NOT decode the entropy stream
println!("{}×{}, {} components", info.width, info.height, info.components.len());
for comp in &info.components {
    println!("  id={} h={} v={} blocks={}", comp.id, comp.h_samp, comp.v_samp, comp.block_count);
}

Count how many bits you can hide

use dct_io::{read_coefficients, eligible_ac_count};

let jpeg = std::fs::read("photo.jpg")?;

// Quick path — counts without allocating the full coefficient array:
let n = eligible_ac_count(&jpeg)?;
println!("{n} positions available for LSB embedding ({} bytes)", n / 8);

// Or after you already have coefficients:
let coeffs = read_coefficients(&jpeg)?;
println!("{} positions available", coeffs.eligible_ac_count());

Query block counts

use dct_io::block_count;

let jpeg = std::fs::read("photo.jpg")?;
let counts = block_count(&jpeg)?;
for (i, &n) in counts.iter().enumerate() {
    println!("component {i}: {n} 8×8 blocks");
}

Coefficient layout

Each block: [i16; 64] is in JPEG zigzag scan order:

Index 0 — the DC coefficient (represents the average brightness/colour of the tile)
Indices 1–63 — AC coefficients in zigzag order (higher index = higher frequency detail)

8×8 block — coefficient indices in zigzag order

  ◄── low frequency                  high frequency ──►
  ┌────┬────┬────┬────┬────┬────┬────┬────┐
  │  0 │  1 │  5 │  6 │ 14 │ 15 │ 27 │ 28 │  ▲ low
  ├────┼────┼────┼────┼────┼────┼────┼────┤  │ freq
  │  2 │  4 │  7 │ 13 │ 16 │ 26 │ 29 │ 42 │  │
  ├────┼────┼────┼────┼────┼────┼────┼────┤  │
  │  3 │  8 │ 12 │ 17 │ 25 │ 30 │ 41 │ 43 │  │
  ├────┼────┼────┼────┼────┼────┼────┼────┤  │
  │  9 │ 11 │ 18 │ 24 │ 31 │ 40 │ 44 │ 53 │  ▼
  ├────┼────┼────┼────┼────┼────┼────┼────┤
  │ 10 │ 19 │ 23 │ 32 │ 39 │ 45 │ 52 │ 54 │  ▲
  ├────┼────┼────┼────┼────┼────┼────┼────┤  │
  │ 20 │ 22 │ 33 │ 38 │ 46 │ 51 │ 55 │ 60 │  │ high
  ├────┼────┼────┼────┼────┼────┼────┼────┤  │ freq
  │ 21 │ 34 │ 37 │ 47 │ 50 │ 56 │ 59 │ 61 │  │
  ├────┼────┼────┼────┼────┼────┼────┼────┤  │
  │ 35 │ 36 │ 48 │ 49 │ 57 │ 58 │ 62 │ 63 │  ▼
  └────┴────┴────┴────┴────┴────┴────┴────┘
    ↑ block[0] = DC (average brightness of this tile)
    block[1..63] = AC (the detail, from coarse to fine)

The values are the quantized coefficients exactly as stored in the file. They have not been dequantized; multiply by the quantization table if you want the pre-quantized DCT values.

The "eligible" positions for safe LSB embedding are AC coefficients with |v| >= 2. Modifying those never changes whether a coefficient is zero or non-zero, so the Huffman code lengths (and therefore the re-encoded stream structure) stay the same.

Safety and security

#![forbid(unsafe_code)] — zero unsafe Rust in this crate, guaranteed at compile time
No panics on crafted input — every error path returns a DctError; the parser validates dimensions, sampling factors, Huffman table structure, component indices, and MCU counts before touching any data
Allocation cap — MCU count is capped at 1 million (~67 megapixels) to prevent memory exhaustion from a malicious input
Huffman overflow guard — canonical code overflow in malformed DHT segments is caught before it can write out-of-bounds into the LUT
Fuzz targets included (see fuzz/) — run with cargo fuzz run fuzz_read

Error handling

pub enum DctError {
    NotJpeg,              // doesn't start with a JPEG SOI marker
    Truncated,            // file ends before parsing is complete
    CorruptEntropy,       // invalid or malformed Huffman data
    Unsupported(String),  // progressive, lossless, arithmetic coding, etc.
    Missing(String),      // a required marker or table is absent
    Incompatible(String), // coefficient data doesn't match this JPEG's structure
}

All public functions are marked #[must_use] — the compiler will warn if you forget to handle the returned Result.

Limitations

Roundtrip identity

write_coefficients(jpeg, read_coefficients(jpeg)?) produces byte-identical output for JPEGs encoded by libjpeg, libjpeg-turbo, and most standard encoders. Exotic encoders that use non-standard Huffman table construction or non-standard EOB placement may decode identically but not round-trip byte-for-byte.

Multiple scans

Only the first SOS (start-of-scan) segment is processed. Baseline JPEG always has exactly one scan; progressive JPEG is not supported.

Fuzzing

cargo install cargo-fuzz
cargo fuzz run fuzz_read        # throw random bytes at the parser
cargo fuzz run fuzz_roundtrip   # verify read→write→read consistency

Licence

Licensed under either of:

at your option.

Name		Name	Last commit message	Last commit date
Latest commit History 19 Commits
.github/workflows		.github/workflows
examples		examples
fuzz		fuzz
src		src
.gitignore		.gitignore
CHANGELOG.md		CHANGELOG.md
Cargo.toml		Cargo.toml
LICENSE-APACHE		LICENSE-APACHE
LICENSE-MIT		LICENSE-MIT
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

dct-io

What even is a JPEG?

What can you do with this?

What this crate does NOT do

Supported JPEG variants

Installation

Examples

Read and modify coefficients

Inspect image metadata cheaply

Count how many bits you can hide

Query block counts

Coefficient layout

Safety and security

Error handling

Limitations

Roundtrip identity

Multiple scans

Fuzzing

Licence

About

Licenses found

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

dct-io

What even is a JPEG?

What can you do with this?

What this crate does NOT do

Supported JPEG variants

Installation

Examples

Read and modify coefficients

Inspect image metadata cheaply

Count how many bits you can hide

Query block counts

Coefficient layout

Safety and security

Error handling

Limitations

Roundtrip identity

Multiple scans

Fuzzing

Licence

About

Resources

License

Licenses found

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages