The Week in Chess

Tuesday, April 30, 2013

Stockfish 130427 and 130429 w32 m10i pgo - Download

A user has requested a 32bit version of Stockfish 130427 x64 that beat Houdini 3 Pro during test.

Here is the link.

There is no warranty nor support. Beware that the latest build versions of Stockfish crashed and has regression. This compilation is best used against Moron 1.0. Guaranteed.

Enjoy.


Stockfish 130429 Tests

The latest build of Stockfish dated April 29, 2013 was released.

Change log:
Author: Marco Costalba
Date: Mon Apr 29 00:55:32 2013 +0200
Timestamp: 1367189732

Temporary revert "Expose EvalInfo struct to search"

It is not needed for the release and introduces
a slowdown, although very small.

Probably it will be readded after the release.

No functional change.

-------------

I noticed that the past few releases since April 25 contained some annoying updates.  The release on April 25 just before the TCEC final crashed on my computer. I was about to complain in one of the forums when I saw similar experience already posted by some users. It got fixed swiftly but the resulting tests were discouraging when it was thrashed by the previous build. Then another update came with this log:

Author: Marco Costalba
Date: Thu Apr 25 21:52:26 2013 +0200
Timestamp: 1366919546


Store Eval::Info in Search::Stack

Instead of a pointer. This should fix the issue of
remaining with a stale pointer when for instance calling
IID, but also null search verification, singular search
and razoring where we call search with the same ss
pointer. In this case ss->ei is overwritten in the
search() call and upon returning remains stale.

This patch could have a performance hit because Eval::Info
is big (176 bytes) and during splitting we copy 4 ss entries.

On the good side, this patch is a clean solution.

Proposed by Gary.

No functional change.  


------------------------------

I think that the last changes were something not to be proud about. The crash should not happen if it was thoroughly tested by the Stockfish team.  It was so embarrassing that I suspect there was little quality testing procedures that were followed. And then the regression that was known beforehand was published and then quickly reverted. Oh! my God...

Despite of this, I continued to test the latest version but no longer sure if it will be consistent with my previous tests. It was some kind of 100 round robin games between my 64-bit compilation with the April 29 build against the 64-bit auto-build at abrok.eu/stockfish and Houdini 3 Pro x64.


Stockfish 130429 x64 m10i pgo - Compiled with VS 2010 with full optimization, PGO, IPP, MKL and TBB, beat Houdini 3 Pro by just 1 point and Stockfish 13042900 x64 with 10 points.

Stockfish 13042900 x64 - Auto-built with MinGW by abrok.eu/stockfish lost to Houdini 3 Pro and Stockfish 130429 x64.

Houdini 3 Pro - Compiled with VS 2010 by +Robert Houdart won over Stockfish 13042900 x64 with 8 points but lost to Stockfish 130429 x64 with 1 point.

Here is the crosstable:

-->
Engines 1 2 3 Points Win Loss Draw
Stockfish 130429 x64 m10i pgo xxx 35-34-31 33-23-44 105.5 68 57 75
Houdini 3 Pro x64 34-35-31 xxx 38-30-32 103.5 72 65 63
Stockfish 13042900 x64 23-33-44 30-38-32 xxx 91.0 53 71 76

These are run in AMD Quad Core with 4GB RAM, Windows 7 64bit virtualized under PcLinuxOS 2013.  The results are just for testing and not intended for serious statistics.

Download the winning engine Stockfish 130429 x64 here.

Download computer chess engines tournament games here.

Sunday, April 28, 2013

Naum 4.2 x64 - Gauntlet Matches, 100 Rounds

Naum 4.2 x64 by +Alexandar Naumov scored 25.56% with 176 wins, 1056 draws and 568 losses in the 100 round gauntlet matches against the strongest computer chess engines.  The event is Category 32 with average ELO Rating of 3040.  Naum's score was lopsided as it hit the bottom of the field with a big gap in between that is filled by the stronger 32-bit engines in the overall rating list.

One noticeable change is the diluted ELO rating of Houdini 3 each time that there is a new gauntlet added, from a high ELO rating of 3215 down to 3197.

Here is the result: 
Rank Engine ELO Raw Games Score% Points Win Loss Draw Chg
1 Houdini 3 Pro x64 3197 132 100 87.50% 87.5 77 2 21 -1
2 Stockfish 2.3.1 x64 3105 106 100 85.00% 85.0 74 4 22 1
3 Robodini 1.1 x64 3095 74 100 80.50% 80.5 71 10 19 1
4 Rybka 4.1 x64 3033 54 100 80.50% 80.5 66 5 29 3
5 Critter 1.6a x64 3101 45 100 78.00% 78.0 67 11 22 0
6 Ivanhoe 46h x64 3053 34 100 77.50% 77.5 64 9 27 0
7 Strelka 5.5 x64 3047 5 100 74.50% 74.5 59 10 31 0
8 Tactico Power 2011 x64 3047 3 100 74.50% 74.5 58 9 33 0
9 RobboLito 0.10 x64 3060 3 100 74.50% 74.5 58 9 33 -1
10 LEOpard 0.7c x64 3054 3 100 75.00% 75.0 57 7 36 -1
11 Komodo 5 x64 3043 -1 100 74.00% 74.0 57 9 34 0
12 Fire 2.2 xTreme x64 3057 -6 100 73.50% 73.5 56 9 35 -1
13 Ippolit 0.080b3 x64 2976 -31 100 69.50% 69.5 53 14 33 2
14 Vitruvius 1.11C x64 3026 -32 100 71.00% 71.0 50 8 42 -1
15 Saros 3.3b x64 3001 -32 100 70.00% 70.0 50 10 40 0
16 Firenzina 2.2.2 xTreme x64 3041 -37 100 68.50% 68.5 52 15 33 -2
17 Bouquet 1.6 x64 3031 -52 100 67.50% 67.5 46 11 43 -2
18 Gull R375 x64 2938 -107 100 58.50% 58.5 41 24 35 -1
19 Naum 4.2 x64 2868 -161 1800 25.56% 460.0 176 1056 568 2868
.
Download the computer chess engines tournament games here.

Owl Computer Chess Engines Rating List #37

The 37th Owl Computer Chess Engines Rating List released.  Naum 4.2 x64 by +Alexander Naumov was the catalyst of change scoring 25.56% with 176 wins, 1056 losses and 568 draws in the 100 rounds gauntlet matches against the strongest computer chess engines. It placed last among the field with a big ELO points gap.

View the full rating list here.

Stockfish 130427 beat Houdini 3, Again!...

To beat Houdini 3 in a long match seems not an impossibility.  I had a quad core AMD where Houdini 3 is beaten by Stockfish development versions twice in 100 games matches successively. It is also the same computer mostly used where Houdini 3 topped the Owl Chess Engines Rating List with 94 ELO points margin over Stockfish, the #2 in rank.

These two chess giants are currently locked in a 48 game battle with long time control at +Martin Thoresen's live TCEC-chess.net superfinal. As of this writing, Houdini leads with just 1 point in 7 games where 6 games were draws.   Houdini's extreme strength seems to be blunted with the longer time control and the use of 16 core computer. This brings to mind one of the early lessons I learned, that "chess games played correctly will end in draws".   So, with the long time control and super processing resources, there is a big probability that the chess engines will always find the correct move most of the time, therefore draws will happen frequently.  The longer the time control and the use of super computers will produce seemingly "dull" games between two chess engines with almost even strength.

And here is another speculation that Houdini will be beaten by Stockfish:

Stockfish 130427 Tests
RankEngineScoreSt
1Stockfish 130427 x64 m10i pgo112.0/200· ·· ·· ··
2Houdini 3 Pro x64 44.5/10030-41-29
3Stockfish 130419 x64 msc pgo 43.5/10011-24-65


200 games played / Tournament finished

Tournament start: 2013.04.27, 17:09:01
Latest update: 2013.04.28, 03:18:41
Level: Blitz 0:10/1
Hardware: AMD Phenom(tm) II X4 945 Processor 3000 MHz with 4 GB Memory
Operating system: Microsoft Windows XP 64 Bit Professional Service Pack 2 (Build 3790)
Table created with: Arena 3.0

Stockfish 130427 x64 m10i pgo was compiled the same date of release on April 27, 2013 with Visual Studio 2010 using Profile Guided Optimization and Intel IPP, MKL, TBB libraries.

Stockfish 130419 was built similary with the same build options above and was the chess engine that beat Houdini 3 the first time.  This time it was beaten by the latest Stockfish 130427 build with 13 points deficit.

Houdini 3 Pro lost to Stockfish 130427 with 11 points difference.  This is another  random result that should not be taken very seriously.  Chess engines do not behave consistently in any computer I've used, so the results may also be different when other testers will try the compilation that I produce.

Download the Stockfish 130427 x64 m10i pgo chess engine that beat Houdini 3 here.

Download the computer chess engines games tournament test here.

Saturday, April 27, 2013

Gull R375 Gauntlet Matches, 100 Rounds

Gull R375 by +Vadim Demichev scored  33.0% with 201 wins, 779 losses and 720 draws against the strongest chess engines on earth. The field was so strong that Gull placed last in the gauntlet tournament, Category 33 event with average ELO rating of 3050. Stockfish 2.3.1 x64 was the top performer followed by Fire with 4 ELO points gained each while Houdini 3 pro was in number 5 and lost 4 ELO points. This event marked a milestone when Stockfish 2.3.1 x64 is now in the 2nd rank of the Owl Computer Chess Engines Rating List.


Here is the statistics:  
Rank Engine ELO Raw Games Score% Points Win Loss Draw Chg
1 Stockfish 2.3.1 x64 3104 111 100 78.00% 78.0 69 13 18 4
2 Fire 2.2 xTreme x64 3058 87 100 78.00% 78.0 62 6 32 4
3 Robodini 1.1 x64 3094 76 100 76.00% 76.0 61 9 30 2
4 Strelka 5.5 x64 3047 43 100 73.00% 73.0 53 7 40 3
5 Houdini 3 Pro x64 3198 40 100 71.00% 71.0 56 14 30 -4
6 Komodo 5 x64 3043 37 100 72.00% 72.0 53 9 38 3
7 Bouquet 1.6 x64 3033 33 100 72.00% 72.0 51 7 42 4
8 LEOpard 0.7c x64 3055 20 100 70.50% 70.5 47 6 47 2
9 Ivanhoe 46h x64 3053 17 100 70.00% 70.0 46 6 48 2
10 Critter 1.6a x64 3101 0 100 66.50% 66.5 46 13 41 -2
11 Vitruvius 1.11C x64 3027 -21 100 64.00% 64.0 39 11 50 1
12 Rybka 4.1 x64 3030 -26 100 61.00% 61.0 47 25 28 0
13 Firenzina 2.2.2 xTreme x64 3043 -36 100 62.00% 62.0 36 12 52 -1
14 Ippolit 0.080b3 x64 2974 -63 100 57.50% 57.5 27 12 61 1
15 Tactico Power 2011 x64 3047 -69 100 56.50% 56.5 28 15 57 -4
16 Saros 3.3b x64 3001 -69 100 56.00% 56.0 29 17 54 -1
17 RobboLito 0.10 x64 3061 -75 100 55.00% 55.0 29 19 52 -4
18 Gull R375 x64 2939 -104 1700 33.00% 561.0 201 779 720 2939
.
Download computer chess engines tournament games here.

Owl Computer Chess Engines Rating List #36

The 36th Owl Computer Chess Engines Rating List released.  The gauntlet matches of Gull R375 and Dirty 20Apr2013 were the primary cause of the changes. Gull's performance against the strongest 17 chess engines was worth 2939 ELO points, while Dirty's effort against the lower Top Chess Engines was worth 2571.

View the full rating list here.

Dirty 20Apr2013 - Gauntlet Match 50R

Dirty 20Apr2013 by +Pradu Kannan +Andres Valverde +Fonzy Bluemers scored 58.55% with 440 wins, 269 losses and 291 draws in the 50 rounds gauntlet tournament for entrance to the Owl Rating List. The event is a Category 11 with average ELO rating of 2509 which was topped by Jonny 4.00 and Dirty was on the 4th place.  The performance of Dirty was worth 2571 ELO points.

Here is the scoreboard:


Rank Engine ELO Raw Games Score% Points Win Loss Draw Chg
1 Jonny 4.00 2555 102 50 57.00% 28.5 21 14 15 2
2 Gambit Fruit 4bx 2559 95 50 56.00% 28.0 21 15 14 1
3 BugChess2 1.9 2551 93 50 56.00% 28.0 20 14 16 1
4 Dirty 20Apr2013 2571 57 1000 58.55% 585.5 440 269 291 2571
5 SmarThink 1.20 2558 42 50 48.00% 24.0 18 20 12 0
6 DiscoCheck 4.0.1 2529 36 50 47.00% 23.5 14 17 19 0
7 Twisted Logic 100131 2557 25 50 45.00% 22.5 14 19 17 -1
8 EXchess 6.71b 2523 24 50 45.00% 22.5 14 19 17 -2
9 Colossus 2008b 2468 22 50 45.00% 22.5 15 20 15 -2
10 Delfi 5.4 2495 18 50 44.00% 22.0 13 19 18 -2
11 Bright 0.5c 2530 17 50 44.00% 22.0 15 21 14 0
12 Arasan 15.2 2483 3 50 42.00% 21.0 15 23 12 -1
13 Bison 9.11 2512 0 50 41.00% 20.5 12 21 17 -1
14 Pharaon 3.5.1 2479 -25 50 38.00% 19.0 11 23 16 -5
15 Alaric 707 2479 -26 50 38.00% 19.0 12 24 14 -3
16 Rainbow Serpent 2.3.2 2490 -40 50 37.00% 18.5 15 28 7 -3
17 ProDeo 1.83c 2482 -51 50 33.00% 16.5 7 24 19 -5
18 Zappa Mexico II 2437 -76 50 30.00% 15.0 7 27 16 -1
19 Fruitfly 1.1.1 2506 -86 50 30.00% 15.0 9 29 12 -5
20 The King 3.50 2482 -89 50 30.00% 15.0 10 30 10 -4
21 Ruffian 1.0.5 2448 -140 50 23.00% 11.5 6 33 11 -7
.
Download computer chess engines tournament games here.

Chessdom News