An RL agent (using Q-learning) that learns to play Numerical Tic-Tac-Toe with odd numbers. The environment is playing randomly with the agent, i.e. its strategy is to put an even number randomly in an empty cell.
quirkovate/TicTacToe_Agent
Folders and files
| Name | Name | Last commit date | ||
|---|---|---|---|---|