[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
Re: [Bug-gnubg] Huge evaluation difference
From: |
Philippe Michel |
Subject: |
Re: [Bug-gnubg] Huge evaluation difference |
Date: |
Tue, 19 Mar 2019 22:23:27 +0100 |
User-agent: |
Mutt/1.11.4 (2019-03-13) |
On Mon, Mar 18, 2019 at 10:25:52AM +0100, Terje Pedersen wrote:
> I just ran into a huge evaluation difference between XG and Gnu BG:
>
> XGID=-B----CFC----A-------acbb-:1:1:1:16:0:0:0:5:10
>
> I got dinged with a -0.684 error here while XG says it is a -0.0013
> error to play 13/6.
>
> Desktop version says it is:
>
> 7. Cubeful 0-ply 13/6 Eq.: -2,264 ( -0,328)
> 0,000 0,000 0,000 - 1,000 0,964 0,000
> 0-ply cubeful prune [expert]
>
> but still wildly off. Perhaps difficult to fix these kind of problems?
The position is of a kind that was probably not common in the racing
neural net training database. X's formation just doesn't happen in real
play, does it ?
GNU Backgammon Position ID: 2wUAAAb3OwgAAA
Match ID : UQmnAAAAAAAE
+13-14-15-16-17-18------19-20-21-22-23-24-+ O: GNUbg
| X | | O O O O | OO 0 points
| | | O O O | OO
| | | O | O
| | | | O
| | | | O
v| |BAR| | 5 point match
| 6 | | |
| X | | |
| X X | | X |
| X X | | X X | Rolled 61
| X X | | X X | 0 points
+12-11-10--9--8--7-------6--5--4--3--2--1-+ X: You (Cube: 2)
Pip counts : O 19, X 99
"Fixing" this at the neural net level doesn't look easy. Adding a few
hundreds or thousands of similar positions in the training data would
probably help but it is difficult to guess in advance if the improvement
would be significant.
It wouldn't change the program's strength (it doesn't get into these
positions) but may avoid or at least mitigate this kind of embarrassing
result when analysing.
What you can do if it is important for you (for the analysis feature of
Backgamon Studio, I suppose) is to use the largest one-sided bearoff
database, available there :
ftp://ftp.demon.nl/pub/Museum/Demon/games/gnubg/databases/os/
If the download size is an issue, you can compute it yourself with the
makebearoff utility from gnubg ; os13.bd (with the back checker up to
the midpoint, as in your position) should take about one day, smaller
ones would be much faster.
With it, I get :
1. Cubeful 2-ply 13/6 Eq.: -2.201
0.000 0.000 0.000 - 1.000 0.913 0.000
2-ply cubeful prune [world class]
2. Cubeful 2-ply 8/2 7/6 Eq.: -2.204 (-0.003)
0.000 0.000 0.000 - 1.000 0.915 0.000
2-ply cubeful prune [world class]
3. Cubeful 2-ply 7/6 7/1 Eq.: -2.213 (-0.013)
0.000 0.000 0.000 - 1.000 0.923 0.000
2-ply cubeful prune [world class]
4. Cubeful 2-ply 13/12 8/2 Eq.: -2.234 (-0.034)
0.000 0.000 0.000 - 1.000 0.939 0.000
2-ply cubeful prune [world class]
5. Cubeful 2-ply 13/12 7/1 Eq.: -2.240 (-0.039)
0.000 0.000 0.000 - 1.000 0.943 0.000
2-ply cubeful prune [world class]
6. Cubeful 2-ply 13/7 6/5 Eq.: -2.294 (-0.093)
0.000 0.000 0.000 - 1.000 0.984 0.000
2-ply cubeful prune [world class]
It may seem radical to use a 1.5 Gb database just for this but, besides
the disk space needed, it is essentially free. The file is not read, it
is merely mapped into memory and the relevant blocks are read only when
needed.
- [Bug-gnubg] Huge evaluation difference, Terje Pedersen, 2019/03/18
- Re: [Bug-gnubg] Huge evaluation difference,
Philippe Michel <=
- Re: [Bug-gnubg] Huge evaluation difference, Øystein Schønning-Johansen, 2019/03/20
- Re: [Bug-gnubg] Huge evaluation difference, Philippe Michel, 2019/03/20
- Re: [Bug-gnubg] Huge evaluation difference, Øystein Schønning-Johansen, 2019/03/21
- Re: [Bug-gnubg] Huge evaluation difference, Philippe Michel, 2019/03/24
- Re: [Bug-gnubg] Huge evaluation difference, Øystein Schønning-Johansen, 2019/03/24
- [Bug-gnubg] Large bearoff databases (was: Huge evaluation difference), Philippe Michel, 2019/03/24
- Re: [Bug-gnubg] Large bearoff databases (was: Huge evaluation difference), Øystein Schønning-Johansen, 2019/03/24
- Re: [Bug-gnubg] Large bearoff databases (was: Huge evaluation difference), Øystein Schønning-Johansen, 2019/03/24