A geometric explosion of data

I’m already facing up to the fact that the sheer amount of data in a game of Gobang / Gomoku / Connect 5 is quite a bit larger than in Connect 4; after all, the board is 15×15, where a Connect4 board is 7×6. 225 spaces on the board instead of 42 means you have a correspondingly greater number of inputs for the neural network to process.

Additionally, the games last longer. The average game of Connect 4 I saw lasted 10-20 moves (with most closer to 10). In the first tests I’ve done for Gomoku, the games last between 60-100 moves.

Finally, there’s the issue of convolution. Consider the two positions below:

Shift

Most people will see these two positions as being functionally equivalent. To traditional computer evaluation, however, they are completely distinct. Ideally, a neural network would generalize what it learns such that either one is equivalent. To facilitate this, I’ll be taking board positions like the above and shifting them over the available space on the board; I’ll also be rotating and flipping them.

With all of the above, I did some tests and found that each game would generate around 6,000 pieces of training data, as opposed to about 12 for a game of Connect 4.

Since it won’t be practical to run as many games per training set (the time to train would quickly prove ridiculous), I’m going to prime each training set with a series of examples standard positions meant to train the neural network in the basics of play. Hopefully, this will result in better performance overall.

A geometric explosion of data

Published by BenjaminDChambers

Leave a comment Cancel reply

Share this:

Published by BenjaminDChambers

Leave a comment Cancel reply