LearnfunPlayfun for Atari 2600

An AI that learns to play Atari 2600 games by watching human gameplay, then plays autonomously using brute-force search over discovered objective functions.

Ported from LearnfunPlayfun-Revival (NES/FCEUX) to Atari 2600 via the stella2023 libretro core.

Based on Tom Murphy VII's original learnfun & playfun — the paper is a great read if you want to understand the theory.

How it works

Record a human playing an Atari game (recordfun)
Learn objective functions from the recording by finding lexicographic orderings on RAM bytes that correlate with "progress" (learnfun)
Play the game autonomously by searching over possible input sequences and scoring them against the learned objectives (playfun)
Replay the AI's best run visually with sound (replayfun)

The approach is entirely game-agnostic — it knows nothing about any specific game. It discovers what "progress" means by observing which RAM values tend to increase when a human plays well.

Tools

Executable	Purpose
`recordfun`	Play the game with keyboard and save inputs as `.a26inp`
`learnfun`	Analyze a recording to discover objective functions
`playfun`	AI plays the game using learned objectives (master/helper distributed search)
`replayfun`	Watch a `.a26inp` replay with video and audio
`scopefun`	Visualize RAM activity during a replay
`pinviz`	Visualize input pin activity
`tasbot_gui.pyw`	Python/tkinter GUI for the full workflow

Building

Prerequisites

Visual Studio 2022 (MSVC v143 toolset)
CMake 3.21+
vcpkg (with VCPKG_ROOT environment variable set)
SDL2 (via vcpkg: vcpkg install sdl2:x64-windows)
Other deps installed automatically via vcpkg: protobuf, zlib, libpng, lz4

Clone with stella2023

git clone https://github.com/AtlasRedux/LearnfunPlayfunAtari2600.git
cd LearnfunPlayfunAtari2600
git submodule update --init --recursive

Build

cd tasbot
cmake --preset default
cmake --build --preset release

Or use the included build.bat from the project root.

Binaries are output to tasbot/build/Release/.

Usage

The fastest way is the included GUI:

cd tasbot\build\Release
pythonw tasbot_gui.pyw

Drop a .a26 (or .bin) file into that folder (or use the Browse… button), then click through the four buttons in order:

🎮 Record (play game) — opens an SDL window; play the game with the keyboard, ESC saves the input log as <game>.a26inp.
▶ Pretrain (learnfun) — analyses the recording and writes <game>.objectives.
▶ Run (playfun) — spawns one helper process per CPU core (configurable) plus a master. The AI plays the game and saves its best run so far to <game>-playfun-futures-progress.a26inp periodically. The Resume checkbox continues from an existing progress file.
🎬 Watch Replay — opens replayfun.exe on the latest progress file, with audio and per-frame controls.

The GUI auto-detects ROMs and input logs in its directory, manages the helper/master lifecycle, and streams the master's coloured progress bars to its log pane.

Keyboard controls in recordfun:

Arrow keys: joystick directions
Z / Space: fire button
Enter: console reset
Tab: console select
ESC: quit and save

Manual command-line workflow

If you'd rather skip the GUI, all tools read a config.txt file in the working directory:

game pitfall
rom pitfall.a26
movie pitfall.a26inp

recordfun.exe       # Step 1 — record human play, writes <game>.a26inp
learnfun.exe        # Step 2 — write <game>.objectives

# Step 3 — start one helper per core, then the master:
playfun.exe --helper 29000
playfun.exe --helper 29001
playfun.exe --master 29000 29001 ...

replayfun.exe       # Step 4 — watch the latest progress file

replayfun.exe controls: Space (pause), Right (step), Up/Down (speed), R (restart), M (mute), ESC (quit).

Architecture

Emulator wrapper (emulator.cc): Abstracts the stella2023 libretro core behind a clean C++ API — initialize, step, save/load state, read RAM, get video/audio.
Objective discovery (objective.cc, weighted-objectives.cc): Finds lexicographic orderings on RAM that correlate with human gameplay progress. Works on arbitrary byte arrays — fully generic.
MARIONET protocol (marionet.proto, netutil.cc): Protobuf-based TCP protocol for distributed master/helper search. Sends opaque byte blobs (emulator states, input sequences).
Input format (.a26inp): Simple text format — one line per frame, 5 characters UDLRF where . means not pressed. Human-readable and diffable.

Atari 2600 vs NES

	NES (original)	Atari 2600 (this port)
Emulator	FCEUX (embedded)	stella2023 (libretro)
RAM	2048 bytes	128 bytes
Inputs	8 buttons (RLDUTSBA)	5 inputs (UDLRF) + console switches
State size	~10-20 KB	~1.2 KB
Input format	FM2	.a26inp
Recording	FCEUX built-in	recordfun (SDL2)

The smaller RAM and state sizes make the Atari version significantly faster per-frame than the NES version.

Credits

Tom Murphy VII — original learnfun/playfun concept, NES implementation, and the brilliant paper
AtlasRedux — NES revival and Atari 2600 port
stella2023 — Atari 2600 libretro core by the Stella team
cc-lib — Tom Murphy VII's utility library

License

Same terms as the original learnfun/playfun. See individual source files for details.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

LearnfunPlayfun for Atari 2600

How it works

Tools

Building

Prerequisites

Clone with stella2023

Build

Usage

Manual command-line workflow

Architecture

Atari 2600 vs NES

Credits

License

FilesExpand file tree

README.md

Latest commit

History

README.md

File metadata and controls

LearnfunPlayfun for Atari 2600

How it works

Tools

Building

Prerequisites

Clone with stella2023

Build

Usage

Manual command-line workflow

Architecture

Atari 2600 vs NES

Credits

License