Q-Learning Tic-Tac-Toe Bot

This project implements Q-learning reinforcement learning to play Tic-Tac-Toe against itself to learn optimal strategies.

Features

Q-Learning table mapping states & actions with rewards
Self-play training of 100k+ rounds
Evaluation & training functions
50% reduced training time due to canonicalization of game states

How It Works

Board is represented as a tuple
Each unique state is an entry in the Q-table
Uses epsilon-greedy exploration, with epsilon decaying throughout training

Installation & Training

# Clone the repository
git clone https://github.com/DanielJoy6/TicTacToe-Bot.git
cd TicTacToe-Bot

# Install dependencies
pip install numpy

# Train the agent
python train.py

Future Improvements:

Reducing number of boards in Q-table by mapping rotations into single state
Saving & Loading q-table
MatPlotLib plots of winrate over time throughout training

For any questions, please email me at danieljoy2345@gmail.com

Name		Name	Last commit message	Last commit date
Latest commit History 20 Commits
__pycache__		__pycache__
DQNMemory.pkl		DQNMemory.pkl
QLTableO		QLTableO
QLTableX		QLTableX
README.md		README.md
TestingPlayers.py		TestingPlayers.py
base_player.py		base_player.py
dqn_player.py		dqn_player.py
game.py		game.py
minmax_alphabeta_player.py		minmax_alphabeta_player.py
minmax_player.py		minmax_player.py
monteCarlo_player.py		monteCarlo_player.py
player.py		player.py
q_table.npy		q_table.npy
ql_player.py		ql_player.py
random_player.py		random_player.py
tournament.py		tournament.py
tournamentGlicko.py		tournamentGlicko.py
train.py		train.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Q-Learning Tic-Tac-Toe Bot

Features

How It Works

Installation & Training

About

Uh oh!

Releases

Packages

Languages

DanielJoy6/TicTacToe-Bot

Folders and files

Latest commit

History

Repository files navigation

Q-Learning Tic-Tac-Toe Bot

Features

How It Works

Installation & Training

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages