Jeff Dean (google) talked recently at the UW about BigTable.
Data in a single table is internally broken at arbitrary row boundaries to form contiguous regions of data called tablets. These tablets are distributed across a large pool of worker machines. The system is designed to manage several petabytes of data distributed across thousands of machines, with very high update and read request rates coming from thousands of simultaneous clients.
Andrew Hitchcock has a detailed review of the above talk.