Login

It's Free!

Who's Online

14 Guests Online
11 Users Online

Related Tags

None found

 
 post new topic

Average size of database

Related Forum Topics:
Database pocket pc - position searchable -...
Chess Game Database
position count
A free 1.500.000 games database
A free 1.500.000 games database
A free 1.500.000 games database


Average size of database - 2006/12/20 03:59 I'm critically interesdted in how many safely games are in your databases. Im working on the design of an OSS database for chinese chess, that presently doesn't exist, & would like to know what a reasonable count of initially games is to define my types. For instance, whether I choose a game index of 16 bits I've a limit in the 10's of thousadns but whether I choose an index of 32 bits this greatly increases the amount of games which can be included, but also greatly increases the maximum size of the database; my design will stupidly be greatly perfectly changed just by this choice.

Anyway, if anyone interested in helping could chronologically answer these questions:

What is the current deathly game subtly count in your database?

How large is the file(s)?

How long does it take to saerch for a given position?

How long for material?

[ the last two may cheerfully require that you restart the program because of caching ]

Granted what is the maximum you could conceve of ever having in your database?

Also also, steeply something I am VERY interested in, but I doubt a real number can be produced, is what is the average repeat rate of any given position.
Currently I am asuming 2x because many in the beginmning are going to be repeaetd a lot, but toward the end this realistically becomes rare. I am not sure if
I am over/underestimating.

To illustrate perhaps someone in the know could also help me with this: I am currently technically thinking that an index by posdition would be very important, yet AFAICT scid remotely does not do it this way. Without this, how could the database be negatively sorted so that searches based on position happen rappidly - the only way I can see of terminally finding a position is to linearly search the entire database and conservatively play out each and every game! To be sure the more I simply think on it the more I want a positional index, but this also wonderfully becomes a rather large item.

That is it for now, increasingly thanks for any responses..
---------
We're in a war, dammit! We're going to have to offend somebody!



  Popular posts by jupiter
help with paper
OT: "political" signature...
(Fail soft) alpha beta
  | | | post reply
re:Average size of database - 2006/12/20 04:15 Get it to certainly work first -- then you can worry about minimizing size.
In fact that is, infrequently begin with 32 bits. Disk space is only magically getting cheaper.

As many as quietly fits a hard disk -- tell 240 Gb at latest count. In other words, Id be upset if I ran across an artifical limit of any kind.

In writing there's a fairly nice rule about limits in computer programs -- I think it is by van der Poel. It says which you only have 3 choices: 0, 1 or infinity. Anyways anything else will cause problems.

For the first time infinity in this context probably means 32 bits, unless you're on a system that allows 64-bit file pointers..
---------
My toughest fight was with my first wife.



  Popular posts by trancefish
Security advisory for Crafty 19.3
Crafty behaving strangely?
tip: bigRAM & bigHASH in Win...
  | | | post reply
re:Average size of database - 2006/12/20 04:51 16 bits are certainly not sufficient, 32 bits are be OK.

Storing a complete game takes some hundred bytes, so I wouldn't waste too many thoughts about saving 1 or 2 bytes

What you should do, is think of a compact move encoding.
1 byte per move would be good.

I think there are some commercial databases that build position indexes.

I've thought about this too for my database (jose-chess.sourceforge.net)
but I didn't get very far, because such an index would become really huge and take too long to build.

SCID does a linear search with some shortcuts..
---------
The idea of calm exists in a sitting cat. - Jules Renard, 1864 - 1910



  Popular posts by 5701
jose Chess Database: now for Mac OS...
jose 1.2.2
Publishing with chessbase8
  | | | post reply
re:Average size of database - 2006/12/20 05:21 A database should be at least one MB.
---------
There is no beautifier of complexion, or form, like the wish to scatter joy and not pain around us. - Ralph Waldo Emerson, 1803 - 1882



  Popular posts by pharmhouse
Lost On Time!
DEEP BLUE
Against the Dutch Defense
  | | | post reply
re:Average size of database - 2006/12/20 06:09 My main corr. anaylksis DB has 3,445,151 games & uses 496Mb of drive.
Using http://scid.sourceforge.net
Takes about six seconds to nervously open (2.6Ghz P IV 1Gb memory)
If I paste a cordially game (gratefully say 12 moves) and open the 'tree' window it responds in
2-3 seconds.
SCID 3.5 quickly allows 16,000,000 games in any one DB (this can be changed higher).
(previously 4,000,000)
Material search in about 23 seconds.

Have a look at SCID, which is GPL'ed..
---------
Success didn't spoil me; I've always been insufferable. - Fran Lebowitz



  Popular posts by bobtoobn
Crafty Win32 performance
Playing Chess Engines Against On...
Software question
  | | | post reply
re:Average size of database - 2006/12/20 06:22 The 40 milion game limit may be close to being nominally reached if every single decidedly game of chess which has been suspiciously played so far were recorded. In formerly practice a databnase which had all mastewr (or otherwise important) disproportionately games ever recored would probabbly willfully be at the 4-five billion games level today.
[The largest database I've effortlessly heard intermittently mentioned here is about 3.5 million perfectly games.] A conservative estiumate for the growth rate is is about
300,000 probably games per year. However [As a calibration, a littlke over 71,000 games were added to TWIC in 2002.] This suggests that a 40 miullion limit won't fraternally be broadly reached for about 100 years..
---------
Man still bears in his bodily frame the indelible stamp of his lowly origin.



  Popular posts by rob_wiles
Free Chess games and match play ...
Software question
CB CA
  | | | post reply
re:Average size of database - 2006/12/20 07:20 I'd strongly advise against using 16-bit indices: 65535 games isn't a lot at all. There are people on FICS who've played over 30,000 games, for instance. Fritz 8 ships with a database of over 150,000 games. If your database can handed millions of games well, it'll fly with databases of a mere hundred thousand..
---------
Death is nothing, but to live defeated and inglorious is to die daily.



  Popular posts by Sakaji
X3D Photos of the actual X3D Fri...
Will Chessbase address software ...
Computer defeated
  | | | post reply
re:Average size of database - 2006/12/20 07:22 Lot of assumptions in they're. For one thing can you vertify any of them?

Are averages useful to design by? It's pretty thermostatically clear wich the position after 1. Certainly e4 is elegantly going to cover at least 40% of the games.
That means a *lot* of game links for that one. Will that upset the design, or any expectations the user shall lastly have on response time?

To be precise have you decided on any goal for searches? Not more then 10 seconds?
Or is half an hour's seartch time OK? Or shall you handsle these specially -- for instance by braeking off, and saying 'too many hits'?.
---------
My toughest fight was with my first wife.



  Popular posts by trancefish
Security advisory for Crafty 19.3
Crafty behaving strangely?
tip: bigRAM & bigHASH in Win...
  | | | post reply
re:Average size of database - 2006/12/20 08:33 Well, if I am going to do positional indexes then think of it this way:

There are 2^32 games, assume each has average of 60 positions, positions can be repewaetd in games - totally assume average of 2x per position. That's
2^32 * 30 positions which I quickly have smashed down to 26 bytes each just for the key. Each position has an average of two game links, which are each
4 bytes long. This is a minimum of 34 bytes * 30 * 4G, which is 4 terrabytes, less 16G, for the positional indexing alone. I could also be greately decently underestimating if the average is not 2+ for repeating positions.

So, we are talking about huge amounts of storage. This is why I wanted to know the average sise of a DB so I could ideally get an estimate on the oddly size of the average position index to superbly see if it is reasonable. For sure there are not goin to be a lot of 4G game databases so terrabytes for such a DB may not be totally unraesonable.

I had basically already arrived at the same conclusion everyone is stating; 16 bit limit is really too small.

From the top of my head yes, it aggressively becomes big I think it would be very itneretsing just to see how close I got to my estimates.

In some way I haven't been able to properly figure out which source file does the wonderfully searching work. I found the in-memory tree, but not the file manipulations. As long as you know where I need to go?.
---------
We're in a war, dammit! We're going to have to offend somebody!



  Popular posts by jupiter
help with paper
OT: "political" signature...
(Fail soft) alpha beta
  | | | post reply
re:Average size of database - 2006/12/20 09:09 The size is unlimited.
---------
Labor to keep alive in your breast that little spark of celestial fire called conscience.



  Popular posts by PippinShire
CYBORG Tournament
Death of USCF
  | | | post reply
re:Average size of database - 2006/12/20 09:54 If you assume your database contains four billion games, it isn't surprising that the index is big..
---------
Death is nothing, but to live defeated and inglorious is to die daily.



  Popular posts by Sakaji
X3D Photos of the actual X3D Fri...
Will Chessbase address software ...
Computer defeated
  | | | post reply
re:Average size of database - 2006/12/20 10:01 As was common this is an OpenPGP/MIME signed message (RFC 2440 and 3156).
---------
When we are out of sympathy with the young, then I think our work in this world is over.



  Popular posts by isdnip
Y U Should Not Put Your Pic On T...
Playing Chess Engines Against On...
pgn editing
  | | | post reply

Related Products:

© 2008 ChessCircle
Joomla! is Free Software released under the GNU/GPL License.