The Week in Chess

Saturday, May 31, 2014

Komodo 7a x64 vs. Stockfish, Houdini, Gull, Strelka - Gauntlet 100 Rounds

The TCEC Live Tournament Season 6 was finally over a couple of days ago and the undisputed winner is Stockfish with 7 points advantage in 64 games against Komodo 7x. The rating list sites are now updating their lists to show the current standings specially that the official Stockfish 5 which is almost similar in strength with the entry at TCEC was released a few hours ago. Some sites like sedatcanbaz.com and this site were showing Stockfish as the number 1 even before the TCEC Season 6 began. Other sites like the popular CCRL still has Houdini 4 Pro as the number 1 which was a clear loser in TCEC, ahead of the latest Komodo 7a the, runner up.

White waiting for the ongoing Stockfish 5 gauntlet matches to finish in a few days, I decided to publish my private tests of Komodo 7a done earlier against the top 4 strongest chess engines like Stockfish, Houdini, Gull and Strelka. This was to see whether the narrow ELO advantage of Komodo 7a against  Houdini 4 Pro will still hold with more games played.

The test was done in the same AMD quad core computer used in previous tests with the same tournament conditions of 1 minute base + 1 second increment time control in 100 round gauntlets done 4 times.

In the final result, the ELO ranking was maintained but the ranking while the gauntlet tournaments were in progress were not consistent for Komodo, Houdini and Gull but Stockfish was number 1 and Strelka was number 5 throughout the 4 batches. Only in the 4th batch that the ranking eventually aligned with the latest Owl Rating List. The ELO rating gap between Komodo 7a and Houdini 4 was just mere 5 ELO points which is statistically very close that it could change easily with more games.

Here is the estimated ELO strength of the contestants and their score statistics:

Rank Engine Est. ELO Raw Elo Games Score% Points Win Loss Draw Chg
1Stockfish 14051110 x64 3173.9885.7140061.13 244.5151621875.14
2Komodo 7a x64 3114.416.20160051.13 818.04313957740.36
3Houdini 4 Pro x64 3109.201.1540048.75 195.08797216-0.02
4Gull 3 x64 3083.82-10.3840047.75 191.0861042102.79
5Strelka 6 3063.85-82.6740037.88 151.571168161-3.95
.

Here are the 4 gauntlet batches that tells the performance story:

Komodo 7a x64 vs. Stockfish, Houdini, Gull, Strelka - 100RR 1M1S Batch 1
RankEngineScoreKo
1Komodo 7a x64 208.0/400· ·· ·· ··
2Stockfish 14051110 x6456.5/10032-19-49
3Houdini 4 Pro x64 51.0/10020-18-62
4Gull 3 x64 47.0/10022-28-50
5Strelka 6 37.5/10018-43-39



Komodo 7a x64 vs. Stockfish, Houdini, Gull, Strelka - 100RR 1M1S Batch 2
RankEngineScoreKo
1Komodo 7a x64 206.5/400· ·· ·· ··
2Stockfish 14051110 x6462.0/10037-13-50
3Gull 3 x64 50.5/10024-23-53
4Houdini 4 Pro x64 42.0/10014-30-56
5Strelka 6 39.0/10015-37-48


Komodo 7a x64 vs. Stockfish, Houdini, Gull, Strelka - 100RR 1M1S Batch 3
RankEngineScoreKo
1Komodo 7a x64 203.5/400· ·· ·· ··
2Stockfish 14051110 x6467.0/10046-12-42
3Gull 3 x64 48.5/10022-25-53
4Houdini 4 Pro x64 48.0/10024-28-48
5Strelka 6 33.0/10013-47-40


Komodo 7a x64 vs. Stockfish, Houdini, Gull, Strelka - 100RR 1M1S Batch 4
RankEngineScoreKo
1Komodo 7a x64 200.0/400· ·· ·· ··
2Stockfish 14051110 x6459.0/10036-18-46
3Houdini 4 Pro x64 54.0/10029-21-50
4Gull 3 x64 45.0/10018-28-54
5Strelka 6 42.0/10025-41-34


400 games played / Tournament finished

Tournament start: 2014.05.27, 08:40:07
Latest update: 2014.05.30, 16:51:10
Level: Blitz 1/1
Hardware: AMD Phenom(tm) II X4 945 Processor with 1.8 GB Memory
Operating system: Windows 7 Ultimate Professional Service Pack 1 (Build 7601) 64 bit
Table created with: Arena 3.5

Download the gauntlet matches PGN games here.
.

2 comments:

Chessdom News