Well, this is out of my league in factTryss wrote:Step 1 : Start by training a level x policy network on games of players with level x. This network is reusable.
Step 2 : Add another couple layers at the end and train the last couple layers to discriminate between player A and not player A (you can further train the whole network when your last layers are well trained
I gave the previous run, it was stuck at the same error levels for the whole day. I had a few more try with different network size, but it's all the same, the control group hardly get past 39% error.
I guess that's it for me, I will upload the script I used to prepare the training data for other to try.
Just for fun: I noticed the SGF games I downloaded include the ELO rating of the bots, so I'm thinking trying to build again a training set with bots that are more or less at the same level (in a given ELO interval), and have try again to see if it makes a difference (I will probably download more archives from CGOS for that).