[Bug-gnubg] Feature request: Rollout frequency based on std. errors

bug-gnubg

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

[Bug-gnubg] Feature request: Rollout frequency based on std. errors

From:	Christopher Yep
Subject:	[Bug-gnubg] Feature request: Rollout frequency based on std. errors
Date:	Mon, 31 Aug 2009 05:05:58 -0400

I don't know how difficult this would be to program, but one possibleenhancement (which the user can optionally select) for rolling outpositions is to rollout moves (candidate plays) with large standard errorsmore frequently than moves with small standard errors.

For example, a user may want to rollout four moves (A, B, C, and D). MovesA and B may usually lead to complex (future) positions which gnubg has ahard time evaluating, while moves C and D may usually lead to simplepositions (e.g. holding games, races, etc.) which gnubg can easilyevaluate. So, variance reduction is much less effective for moves A and Bthan for moves C and D. It may be the case that the standard errors ofmoves A and B are about 4 times as large as the standard errors of moves Cand D (for an N-game rollout of each move).

Instead of rolling out all four moves N times, this new feature wouldrollout A and B approximately 16 (i.e. 4*4) times more frequently than Cand D, resulting in approximately equal standard errors for each move atthe conclusion of the rollout (and even during the rollout, based on thealgorithms below).

For a single thread, the algorithm works as follows (for the example ofrolling out 4 moves):


1. Rollout A, B, C, and D two times each.

2. Check the standard errors of each move. Rollout one additional game ofthe move with the highest standard error.

3. Repeat step 2 until the rollout is complete.

Note that "set rollout trials" could correspond to (1) the number of timesto rollout move A (the first play selected), (2) the average number ofrolled out games per play, (3) the maximum number of games to rollout eachplay, or something else. To avoid the possibility of unexpected longrollouts (i.e. if move A has extremely low standard errors), I suggestchoosing (2) or (3).

For multiple threads, the algorithm is similar: when a thread needs a newgame to rollout, assign it to rollout the move which currently has thehighest standard error.

If all of the above results in too much overhead (I doubt this is the case,but I suppose it's possible), then the single-thread algorithm could becarried out in batches of 36 games:


1. Rollout A, B, C, and D 36 times each.

2. Check the standard errors of each move. Rollout 36 additional games ofthe move with the highest standard error.

3. Repeat step 2 until the rollout is complete.

(The multiple-thread algorithm would have to be tweaked a little, but theidea is essentially the same.)


Chris

[Prev in Thread]

Current Thread

[Next in Thread]

[Bug-gnubg] Feature request: Rollout frequency based on std. errors, Christopher Yep <=
- Re: [Bug-gnubg] Feature request: Rollout frequency based on std. errors, Christopher Yep, 2009/08/31

Prev by Date: Re: [Bug-gnubg] Attempt to speed up single threaded use in multi threaded build
Next by Date: Re: [Bug-gnubg] Feature request: Rollout frequency based on std. errors
Previous by thread: [Bug-gnubg] Attempt to speed up single threaded use in multi threaded build
Next by thread: Re: [Bug-gnubg] Feature request: Rollout frequency based on std. errors
Index(es):
- Date
- Thread