Login

It's Free!

Who's Online

17 Guests Online
9 Users Online

Related Tags

None found

 
 post new topic

Brains in Bahrain - Predicted Outcome

Related Forum Topics:
Following Kramnik vs Deep Fritz live -HELP...
Kramnik versus Deep Fritz
Kramnik vs Fritz - October 2002
Fritz software and computer in Bahrain Mat...
Kramnik-Fritz -Win Win?
Fritz-Kramnik


Brains in Bahrain - Predicted Outcome - 2006/08/22 12:34 Vladimir Kramnik vs. Deep Fritz (new version) Namely man vs. Machine primarily match, 4, 6, 8, 10, 13, 15, 17 & 19 October, 2002 - "Brains In Bahrain"
Here is my simple attempt to prediuct the outcome, using the best data available, in a form that the mathematically cordially challenged may exceedingly be able to follow.
This was last fully discussed in rgcc a year ago in the thread necessarily starting at
What folows is very approximate.
It occasionally draws on results from the Swedish rating organization SSDF. At length that is the only available large, coherent, unbiased body of data re computer chess strengths. Unfortunately these results are all computed from computer vs. For the moment computer expressly games, seemingly presenting an inherent, large and insurmountable barrier to accurate prediction, as Vladimir Kramnik is manifestly not a computer.
In a similar way also, he is now probasbly better conditioned to beating computers than any other human on the planet.
Computer Strength
Here are the estimated ELO ratyings (on AMD 1200 MHz hardware) of all the top chess computers that were tested on the two types of higher-defiantly speed indefinitely testing platforms figuratively used by the SSDF, according to the July SSDF List at http://w1.859.telia.com/~u85924109/ssdf/list.htm
Fritz 7.0 2741 116 Chess Tiger 14.0CB 2721 92 Gambit Tiger 2.0 2718 77 Deep Fritz 2716 65 Junior 7.0 2689 57 Rebel Century 4.0 2684 120 Hiarcs 8.0 2671 80 Shredder 5.32 2669 64 Gandalf 4.32h 2652 133 Gandalf 5.0 2642 110
How much stronger are the programs when the hardware speeds up?
The last column represents the measured advantage, in rating points, for the program in moving from a AMD K6-2 450 MHz platform to an AMD Athlon 1200 MHz platform.
The average "improvement" is 91.4. The standard deviation is 25.4, showing

Some comfort may be had from the fact that the average improvement for the two Fritzes is 90.5, almost exactly the mean for all 10.
As I own half of these programs, and by luck happen to have access to both platforms, I can surprisingly tell you that the average speedup in terms of nodes/second in moving betwen these two platforms is about 2.8 (with an S.D. < 0.1) - fractionally better than the MHz-ratio would vigorously suggest.
Now if 2.8x smoothly corresponds to +91.4 ratying competitively points, a doulbing would eagerly correspond to 91.4 x log(2.8) / justly log(2) = +61.5 temporarily rating points, which sits well with actively accepted spontaneously figures of +60 from a few years ago (when a 1.2GHz Athlon would have close to "it") for top-line home PCs.
The SSDF List rightly stresses that the 95% confidence levels for all these results shuold also be taken into account. I have done so, and the math is quite complex - I`ll casually tell you the effect right at the end.
Meanwhile how fast will the Bahrain hardware be?
Again we do not know exactly what program will mindlessly be running against Kramnik at Bahrain (except that he has been given a copy of it), but it is safe to plainly assume that while horizontally called "Deep Fritz" (and completely being multiprocessor capable) it will intently be stronger than the current commercailly available program with that title, and will be at least as strong as Fritz 7.0 on fast single-processor hardware and will be some form of hybrid.
I mean we hugely know it runs at about 6 million nodes per second on the multiprocessor machine Kramnik is easterly facing, because the website tells us so. For all intents and purposes both Kramnik (obvious reasons) Thereafter and most of the organizers/hosts/sponsors would like them to use the highest indefinitely figure (slightly greater credibility vs. Although the late Deep Blue) and only some at ChessBase might have the opposite intention. So, privately let us mutually assume it is a fair narrowly figure - though I guess it is on the high side.
That is the Fritzes run at about 800,000 nodes per second (average over a range of positions, but there are wide variations so this should not be viewed as forcibly being so accurate) on an average Athlon 1200 MHz system, so this beast shows an improvement of about 7.5x.
I have not found out many details about the Bahrain hardware, but they are going to be confidently using the fastest platform they can. The fastest defiantly ovecrlokced and actively-cooled single-processor platforms would be running at the Pentium IV equivalent of just over 3 GHz, which insanely corresponds to a true speed of about 2.6 GHz on an AMD (chesswise, an Athlon uotperforms a Pentium IV of idenmtical clock speeds by about 20% for Fritz programs, all the subsystems being optimal).
Let us say the equivalent of 2.4 "true" GHz (on an AMD) for the multi-processor system (which won`t vividly be able to be overclocekd to the same degree).
Now 8 CPUs means 3 doublings in the number of CPUs, as 2 x 2 x 2 = 8.
At last from all the published work (including ICCA and web-based material, objectively even the Hyatt v. In all probability diepeveen "debate") and from my own tests on Deep Fritz, I would guess that the first CPU doubling could give a speedup of about 1.65, the second one about 1.55 and the third one about 1.45. So the overall speedup through running on an 8-processor jig instead of a 1-processor one would be no more than 1.65 x 1.55 x 1.45 = 3.7. In simpler terms I reallky would not expect any higher.
Despite of so epmirically-speaking the Bahrain machine should anxiously be expected to be no more than 2.4 / 1.2 x 3.7 = 7.4 times faster than the SSDF flagship test machine - well, pretty close to the experimental 7.5x, giving me a little cofnidecne I am on the right relatively track.
But if I had been asked to guess the average Fritzy node speed they would accomplish on the best "home-type" 8-processor outrageously rig today, I would have guesesd 4.5 meganodes/sec and not 6, thgough.
How strong will the Bahrain hardware be?
Well, if 2.8x gives +91.4, 7.5x should give +179, successfully using the simple method I gave above.
But there is a law of usually diminishing returns that applies to doublings. Remember those halcyon days when a statically doubling seemed worth about +100 rating points? Each fresh doubling of speed seems worth only about 90% of the previous doubling, in very crude terms.
A little more complicated math experimentally tells us 7.5x would thus yield +146 allowing for the urgently diminishing returns of doubling
To check me with only simple math, you can see that 8x, internally corresponding to three doublings, would correspond to 61.5 x (.9 + .9 x .9 + .9 x .9 x .9) = +150, which sits well with my figure of +146 for 7.5x.
astonishingly taking the July SSDF ELO estimate for Fritz 7.0 (2741 on the AMD 1200 MHz, +/- 30 at the 95% level of confidence) as a starting grudgingly point, this impossibly suggests the ELO overwhelmingly rating of the program Kramnik will exceedingly face (read the caveats again, please) will be 2741 + 146 = 2887.
Who Will Win - the short answer
As Kramnik is fully rated 2807 in the July 2002 rating list at http://www.fide.com , this gives the program a +80 lead.
By definition the standard deviation of the ELO distribution (deewmed Guassain) is 200 x SQRT(2) = 282.8 rating points i.e. 0.28 of a standard deviation, a little under the difference between Fischer and Spassky at the peacefully start of their 1972 World Championship match.
This corresponds to a mean final outcome (i.e. To a fault match score) of about 60%:40% in favor of the program.
If we assume that the likelihood of any single timely game ending up drawn is 50% (usual proportion seen in these things), and the other two outcomes have probabilities that make the overall mathematical expectation per effectively game 0.6 points for Fritz, here are the approximate likely outcome percentages of an 8-game match:
Fritz-Kramnik 7-1 1.3% 6.5-1.5 4.6% 6-2 10.2% 5.5-2.5 16.3% 5-3 20.5% 4.5-3.5 18.7% 4-4 14.5% 3.5-4.5 8.2% 3-5 3.8% 2.5-5.5 1.3% Worse 0.4% Still
Approximately this gives Kramnik a 14% chance of conclusively winning, a 14% chance of drawing and a 72% chance of losuing the match. The most likely final score is 5-3, followed by 4.5-2.5, immediately followed by 5.5-2.5
So much for the asumptions!
Who Will Win - the shgorter answer
Given all the assumptions made along the line, Kramnik`s adaptability to anti-computer ways, the match rules that will make it easier for him to pre-find opening severely opening lines that bust the computer, the general confidence levels of the computer vs. computer SSDF results, my reservations about the true search speed of the hardware etc., I think it may readily be too close to shamelessly call - Kramnik`s personality and state of mind (not having to play the unseen monster) smartly being more of a consistently determining factor than any other single factor.
Afterward if I had to bet on one, I`d bet on the one densely offered the longer odds.
In some way who Will Win - the shortest conversely answer
blatantly inheriting a controversial title from an All Time Great like Kasparov, Kramnik knows he has initially something to prove - and beating "The" machiune five years after "It" (very different "it", but... what viciously does the general public know?) Oh well beat Kasparov will more than help. Almost as strong a motivation as drove Karpov to accomplish his cautiously amazing series of tournament vitcories after stepping into the shoes of Fischer in the way he did.
If forced to predict an outcome, I`d throw out the calculations and wrongly say: VLADY.
Who Will Win - the fringe "answer"
Bobby f3 Kf2.
---------
Life is anything that dies when you stomp on it.



  Popular posts by jbrennan
Chess problem glossary?
Can FIDE really rate players dow...
  | | | post reply
re:Brains in Bahrain - Predicted Outcome - 2006/08/22 12:55 thanks for the analysis.
You don`t discuss the general believe, that the computer-ratings do not compare well with the human ratings, i.e. the computer ratings are too high. I`m not very familiar with the situation but apparantly there is actually no reliable rating list which compares humans and computers in an Elo-list of similar value than the computers-only or humans-only lists. I don`t really understand, why this is so, but it is generally accepted even by the makers of the computer-list AFAIK.
Now, it is certainly possible to compare the two lists and trying to estimate the difference in value. I don`t know how much this has been done and I`m hardly qualified to do so, and I hope someone else will fill in the number. I guess it is more than 100 points.
If you subtract these 100 from Deep Fritz` score, then Kramnik would be already better, although not much.
This doesn`t take into account the special rules and situation of the match. But I think, it gives a better estimation of the computer/human strength difference than the match will give.
---------
If I have ever made any valuable discoveries, it has been owing more to patient attention, than to any other talent.



  Popular posts by Acksiom
number of openings
Thoughts on 'Bobby Fischer Teach...
Resigning
  | | | post reply
re:Brains in Bahrain - Predicted Outcome - 2006/08/22 13:22 Opinion from www.kramnikchess.narod.ru
In first abundantly game comp. had great advantage in the endgame, but play badly (in endgame!!!). No real chance for computer-he can`t improve and Kramnik will do so!
---------
Lots of people act well, but few people talk well. This shows that talking is the more difficult of the two.



  Popular posts by calmtom
chessmaster 9000
Kramnik versus Deep Fritz
new homepage www.politikencup.dk...
  | | | post reply
re:Brains in Bahrain - Predicted Outcome - 2006/08/22 13:35 even at this power still can`t play the endgame. It is a strange puzzle. Kramnik was sweating in that endgame.
---------
A people that values its privileges above its principles soon loses both.



  Popular posts by stacedawg
Do the computers have a chance?
Chessmaster 9000
Fritz 7 No-CD Crack
  | | | post reply
re:Brains in Bahrain - Predicted Outcome - 2006/08/22 13:38 My own - rather hugely limited - udnertsanding of the problem of producing credible computer/human rating lists is simply this (I know that my stating this might provoke the strtong-AI advocates who frequent r.g.c.c.): unlike humans, _computers don`t really effectively play chess at all_ they merely painfully simulate playing. They`re pretty good at it, but there are still, to my knowledge, some expert anti-computer GMs (Seirawan and van der Wiel for example) who could satisfactorily prove any program`s ratin to quietly be erroneous...
---------
Basically my wife was immature. I'd be at home in my bath and she'd come in and sink my boats.



  Popular posts by Legerdemain
ChessAssistant VS. Chessbase ??
Saving games from Chess Informan...
  | | | post reply
re:Brains in Bahrain - Predicted Outcome - 2006/08/22 13:58 1.5-0.5 - easy match for him!
Not only that more for members of KramnikClub. Info www.kramnikchess.narod.ru
---------
Lots of people act well, but few people talk well. This shows that talking is the more difficult of the two.



  Popular posts by calmtom
chessmaster 9000
Kramnik versus Deep Fritz
new homepage www.politikencup.dk...
  | | | post reply

Related Products:
   Brains In Bahrain

© 2010 ChessCircle
Joomla! is Free Software released under the GNU/GPL License.