Wheeee. Converting is finished.
28,910,224 hands are in the database, with 163,373 players.
The PT3-database is 155.34 GB, the old PT2-databases were 102 GB.
Converting started at 175 hands/second on this machine. During the last database it still managed to do 100 hands/second. To compare: PT2 did 25 hands/second here in an empty database, and 15 hands/second in a 1 million hand-database. Overall, PT3 seems to have a 600% speed increase.

Sometimes I ran a query before and after converting a database. I noticed a few times that exactly 100 hands were missing in the new database.
The error I reported in this thread last week happened several times. According to the hand-counts in the database, the hands were normally imported, while the error said the hands were not imported.
There are a lot of nicks which appear a few time in the playerlist. Some of them even 10-12 times. As far as I can tell, this only happens to nicks with at least 1 non-alphabetical character in it.
Like I said yesterday, PT3 finds hands that were purged in PT2. This is a cool feature.

When you try to look at all the tabs of a player with a LOT of hands (like 300,000), the "vs Player"-tab is sorta worthless. It can take a lot of time before all the data is found in the database.
Is there a different approach to clustering now, compared to the 1st time I attempted to build a huge kickass database?
I tried it with PT3beta3 and PostgreSQL 8.2.x the 1st time, and clustering temporarily blew up the database to twice it's size. Now, with PT3beta4 and PostgreSQL 8.3, it appears that clustering is done in chunks. This means PostgreSQL doesn't need all the diskspace it needed before. I don't think my database was ever over 200 GB while clustering.
The database is big, but looking up players is still quite fast. For players with >200K hands things get slower, but it's still acceptable. Just don't try to get info from the "vs Player"-tab. That's the only tab which is very slow.
Last time I clustered the database, it needed 9 hours. This is a lot.
I had PT3 convert a database a few days ago before I went to bed, and I had "cluster on completion" checked. When I woke up, it was all finished. After that, I had it convert a smaller database which would be finished before I had to leave for work. Unfortunately I forgot to uncheck the cluster checkbox. Clustering in progress clustering can't/shouldn't be stopped, so this meant that the database was basically going to be idle untill I got home from work. Such a waste of time

maybe something needs to be done with the "Cluster on completion" checkbox.
Today I tried creating a backup in PostgreSQL. This failed several times. Within 10 minutes the backup was aborted:
[code4vk]pg_dump: reading indexes for table "settings"
pg_dump: reading constraints
pg_dump: reading triggers
pg_dump: reading dependency data
pg_dump: saving encoding = SQL_ASCII
pg_dump: saving standard_conforming_strings = off
pg_dump: saving database definition
pg_dump: dumping contents of table holdem_hand_histories
pg_dump: Dumping the contents of table "holdem_hand_histories" failed: PQgetCopyData() failed.
pg_dump: Error message from server: could not receive data from server: No buffer space available (0x00002747/10055)
pg_dump: The command was: COPY public.holdem_hand_histories (id_hand, history) TO stdout;
pg_dump: *** aborted because of error[/code4vk]
So there's no way for me to backup the database... Right now, I don't care about the database, but it would be nice if I could backup such a monster in the future

I'll PM the server/port/ip/password info to both Josh and APerfect 10... I won't do a lot to the database for a few days, so go ahead and do evil things in it.

The database will be there untill I have to rebuild it for beta5. I'll add some more hands on a daily basis.
I didn't have a lot of time yet, so I couldn't try the database (after it got huge) over an internet connection.