bug-gnubg
[Top][All Lists]
Advanced

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: [Bug-gnubg] How strong is the lookahead?


From: boomslang
Subject: Re: [Bug-gnubg] How strong is the lookahead?
Date: Mon, 30 Mar 2009 16:35:11 +0000 (GMT)

Hi Erez,


About three years ago I let GNUBG play 26000 one pointers on FIBS at two 
settings: Expert and Supremo. Based on its wins and losses, I estimated the 
FIBS rating of the 2 settings using Maximum Likelihood. The results were:


 Chequer play     Point      95% Confidence
   setting       estimate      interval
--------------------------------------------
   expert         2034        (2009, 2059)
   supremo        2098        (2055, 2142)


A difference of about 65 FIBS points (based on 1 pointers).

Greetings, boomslang

(And yes, even 0ply plays 2000+ level...)



--- On Mon, 30/3/09, Christian Anthon <address@hidden> wrote:

> From: Christian Anthon <address@hidden>
> Subject: Re: [Bug-gnubg] How strong is the lookahead?
> To: "Erez" <address@hidden>
> Cc: address@hidden
> Date: Monday, 30 March, 2009, 1:25 PM
> 2009/3/30 Erez <address@hidden>:
> > I wanted to know what is the true addition of the
> lookahead to GNU so as a
> > true lamer I set out a money game session between GNU
> playing expert and GNU
> > playing Supremo. I thought to let them play until 64
> points and see Supremo
> > mop the floor with expert.
> > I was never able to get to 64 because the software
> crashed somewhere in the
> > middle (happened again on another computer), but still
> I was very surprised
> > to find out that Supremo got only about 20%
> more points than Expert.
> > So how much stronger really is Supremo than Expert?
> >
> 
> Not much I expect. The strength of the higher plies are
> mostly seen in
> specific situations where the look-ahead is important. It
> is more a
> question of reliability than overall strength. 64 points
> certainly
> won't be enough to determine the difference with any
> confidence. The
> best approach is probably to let  0ply play 0ply for a long
> money
> session. Have supremo analyse the games for both players,
> and then
> rollout any disagreements.  Then you'll have error
> rates to compare
> for the two playing strengths.
> 
> Christian.
> 
> > Erez
> > _______________________________________________
> > Bug-gnubg mailing list
> > address@hidden
> > http://lists.gnu.org/mailman/listinfo/bug-gnubg
> >
> >
> 
> 
> _______________________________________________
> Bug-gnubg mailing list
> address@hidden
> http://lists.gnu.org/mailman/listinfo/bug-gnubg







reply via email to

[Prev in Thread] Current Thread [Next in Thread]