Multi-14: Sudoku Logic Data Generator

Generates synthetic datasets for stepwise Sudoku solving. The agent must fill in a partially completed 9×9 Sudoku grid one cell at a time, respecting row, column, and 3×3-region uniqueness constraints — a canonical constraint-satisfaction problem with deeply nested logical dependencies.

Each sample pairs a task (first frame + prompt describing what needs to happen) with its ground truth solution (final frame showing the result + video demonstrating how to achieve it). This structure enables both model evaluation and training.

📌 Basic Information

Property	Value
Task ID	Multi-14
Task	Sudoku Stepwise Solving
Category	Constraint Satisfaction Puzzles
Resolution	1024×1024 px
FPS	16 fps
Duration	varies
Output	PNG images + MP4 video

🚀 Usage

Installation

# 1. Clone the repository
git clone https://github.com/VBVR-DataFactory/Multi-14_sudoku_logic_data-generator.git
cd Multi-14_sudoku_logic_data-generator

# 2. Create and activate virtual environment
python3 -m venv venv
source venv/bin/activate  # On Windows: venv\Scripts\activate

# 3. Install dependencies
pip install --upgrade pip
pip install -r requirements.txt
pip install -e .

Generate Data

# Generate 50 samples
python examples/generate.py --num-samples 50

# Reproducible generation with seed
python examples/generate.py --num-samples 50 --seed 42

# Custom output directory
python examples/generate.py --num-samples 100 --output data/my_dataset

# Without videos (faster, images only)
python examples/generate.py --num-samples 50 --no-videos

Command-Line Options

Argument	Description
`--num-samples`	Number of tasks to generate (required)
`--output`	Output directory (default: data/questions)
`--seed`	Random seed for reproducibility
`--no-videos`	Skip video generation (images only)

📖 Task Example

Prompt

[Scenario] The image displays a 9x9 Sudoku puzzle grid with some pre-filled numeric clues.
[Rules]
1. Each row must contain the digits 1-9 exactly once.
2. Each column must contain the digits 1-9 exactly once.
3. Each of the nine 3x3 sub-grids must contain the digits 1-9 exactly once.
[Task] Generate a video animating the logical completion of the Sudoku grid. Fill all empty cells with the correct digits step-by-step, strictly following a top-to-bottom, left-to-right order, until the unique solution is fully revealed.

Visual


Initial Frame Sudoku puzzle with given clues	Animation Cells filled in valid solving order	Final Frame Completed 9×9 grid satisfying all constraints

📖 Task Description

Objective

Complete a 9×9 Sudoku puzzle so that every row, column, and 3×3 region contains each digit 1–9 exactly once, while preserving the fixed starting clues.

Task Setup

Grid: Standard 9×9 Sudoku, partitioned into nine 3×3 regions.
Givens: A subset of clue cells fixed at generation time.
Constraints: Each row, column, and 3×3 region must contain {1, 2, …, 9} with no repeats.
Uniqueness: Puzzles are generated with a unique solution (verified via solver backtracking).
Animation: Cells are revealed in a logically valid order — each new digit visibly satisfies the constraint set at the moment of placement.

Key Features

Global constraint propagation: Each digit placement narrows the candidate set of every related row/column/region.
Long-horizon dependency chains: Late-stage cells often depend on inferences from cells filled dozens of steps earlier.
Deterministic ground truth: The puzzle has a unique solution; the solving sequence is reproducible.
Visible incremental commitment: Frame-by-frame reveal lets evaluators verify intermediate state correctness, not just final outcome.

📦 Data Format

data/questions/Multi-14_sudoku_logic_data-generator_task/Multi-14_sudoku_logic_data-generator_00000000/
├── first_frame.png            # Sudoku puzzle with given clues
├── final_frame.png            # Fully completed grid
├── prompt.txt                 # Task instruction
├── ground_truth.mp4           # Animation of stepwise filling
└── question_metadata.json     # Standardized VBVR task metadata

File specifications:

Images: 1024×1024 PNG format
Video: MP4 format, 16 fps, H.264 + yuv420p
Metadata: VBVR canonical schema with task_id, vbvr_task_code, media, parameters

🏷️ Tags

sudoku constraint-satisfaction csp logic-puzzle stepwise-solving multi-step-reasoning long-horizon

Part of the 36-Task Long-Horizon Multi-Step Reasoning Benchmark.

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
core		core
examples		examples
samples		samples
src		src
.gitignore		.gitignore
.gitmodules		.gitmodules
LICENSE		LICENSE
README.md		README.md
requirements.txt		requirements.txt
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Multi-14: Sudoku Logic Data Generator

📌 Basic Information

🚀 Usage

Installation

Generate Data

Command-Line Options

📖 Task Example

Prompt

Visual

📖 Task Description

Objective

Task Setup

Key Features

📦 Data Format

🏷️ Tags

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Multi-14: Sudoku Logic Data Generator

📌 Basic Information

🚀 Usage

Installation

Generate Data

Command-Line Options

📖 Task Example

Prompt

Visual

📖 Task Description

Objective

Task Setup

Key Features

📦 Data Format

🏷️ Tags

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages