Re: The status of gnubg?

bug-gnubg

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: The status of gnubg?

From:	Isaac Keslassy
Subject:	Re: The status of gnubg?
Date:	Mon, 19 Oct 2020 23:23:54 +0300
User-agent:	Mozilla/5.0 (Windows NT 10.0; WOW64; rv:68.0) Gecko/20100101 Thunderbird/68.12.1

Hi,

It would be great to renew the effort on gnubg!

I have a question regarding the fundamental NN weight improvementtechnique. If I understand correctly, to improve the NN weights, you aretrying the supervised-learning approach of picking tough positions,determining the best move using rollouts, then gradually optimizing theNN weights. However, as Joseph mentioned, this may affect the NN play inpositions arising in regular games.

However, there are other techniques that have proved more efficient atgames like chess. They avoid the long rollouts and work on positions ofregular games. For instance:

1. SPSA: This is an obvious approach. Let the NN play against a veryslightly modified version of it, pick the winner, and using a randomwalk, gradually converge to better parameters; or:

2. Logistic regression: Instead of teaching the best move, teach theposition equity (as also mentioned by Aaron). Specifically, we could tryto minimize the equity error associated to each position. Assume DMP forsimplicity. Run a million games through self-play, and associate all theobtained positions to the final game result (-1 for loss, +1 for win).Then tune all the NN weights through gradient descent to minimize thedifference between the position estimate and the final game result.

(see https://www.chessprogramming.org/Automated_Tuning, Texel's tuning,SPSA etc. for more details)


Has anybody tried such alternative methods?

Thanks,
Isaac

[Prev in Thread]

Current Thread

[Next in Thread]

Re: The status of gnubg?, (continued)
- Re: The status of gnubg?, Philippe Michel, 2020/10/18
- Re: The status of gnubg?, Timothy Y. Chow, 2020/10/19
- Re: The status of gnubg?, Isaac Keslassy <=
  - Re: The status of gnubg?, Joseph Heled, 2020/10/20

Prev by Date: Re: The status of gnubg?
Next by Date: Re: The status of gnubg?
Previous by thread: Re: The status of gnubg?
Next by thread: Re: The status of gnubg?
Index(es):
- Date
- Thread