Temporal Difference Learning Project

TD Learning Project

We are three students in the Academic college of Tel-Aviv-Yaffo. For our final project under the supervision of Dr. Gideon Dror, we decided to make a self learning system for strategic games using Artificial Neural Networks.
Our main reference was Tesauro's article about TD (Temporal Difference) learning and TD-Gammon.
We looked for an easy strategic game and after a while we found the BonesGames site. They have a lot of nice games (for free), and we decided to base our game on their LNL game.
In order to check our td learning system we first used a kind of random walk game, then a tic tac toe game and only then we started dealing with the real game.
Random Walk
The board of the game are six squares layered in two rows.
123
456
The player starts the game at square # 1. If the player reaches square # 5 he looses, if he comes to square # 6 he wins.
We use the raw data as an input for the system.
So this is a very simple game and our system learnt it fine.
Tic Tac Toe
The famous game. (I guess you all know it).
As input we used six parameters:
1. The number of rows and columns that have only one token of the player and no token of the opponent.
2. The same as 1 but with two tokens.
3. The same as 1 but with three tokens.
4,5,6 the same as 1,2,3 but for the opponent.
Well the results were good (much better than raw data input).
LNL - War in space
the real thing. You can see for yourself the results

Temporal Difference Learning and TD-Gammon by Gerald Tesauro

Download the source files

Applets for Neural Networks and Artificial Life

ANN resources on the Internet

Bibliographies on Neural Networks

Chen Levkovich

chen_levkovich@yahoo.com

Tali Hildeshaim

talihi@cs.bgu.ac.il

orly_lev@creo.com

Orly Levkovich-Frank

Reinforcement Learning: Introduction Richard S. Sutton & Andrew G. Barto

Introduction to Artificial Neural Networks course