Rhinoforcement

Abstract

The AlphaZero project by Google aroused my interest both as a developper and a chess player. In this project we shall attempt to use the main ideas behind the AlphaZero project and adapt them to our needs and to our small computing power. For those unfamiliar with AlphaZero it combines Monte Carlo Tree Search with deep learning to achieve unprecedented results in learning the game of chess. We shall apply this method to the game of Connect 4 for faster feedback and learning whilst keeping the program architecture flexible to facilitate a possible change of game.

Our objectives are:

Creating a program capable of learning any classical game.
Exploring different neural net architectures.
Exploring dataset management (duplicate deletion, depth compensation etc ..).
Understanding Hyperparameters and their impact.
Quantifying our results to make a comprehensive choice.
Enable others to understand these choices and better adapt this to their projects.

Monte Carlo Tree Search

MCTS DESCRIPTION
needs :

return a state
return a set of all possible moves from a state
identify terminal states
finito

Inserting a Neural Net

Architecture
switching from UCB1 to PUCT.
training pipeline.
VS mode or continuous learning ?
adapting our dataset to remove bias for early performance ?
improve speed : dictionaries vs tree nodes and making a C library if needed.
Making an infrastructure to record and display the performance impact of different choices.

Name		Name	Last commit message	Last commit date
Latest commit History 161 Commits
.vscode		.vscode
.gitignore		.gitignore
ARGS.py		ARGS.py
MCTS.py		MCTS.py
README.md		README.md
Trainer.py		Trainer.py
color.py		color.py
data.py		data.py
deep.py		deep.py
deep_stats.py		deep_stats.py
doop.py		doop.py
listy.py		listy.py
main.py		main.py
new_main.py		new_main.py
node.py		node.py
state.py		state.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Rhinoforcement

Abstract

Monte Carlo Tree Search

Inserting a Neural Net

About

Releases

Packages

Contributors 2

Languages

ezalos/Rhinoforcement

Folders and files

Latest commit

History

Repository files navigation

Rhinoforcement

Abstract

Monte Carlo Tree Search

Inserting a Neural Net

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages