Greenplum, MapReduce, and Hadoop

June 18, 2009

If your job involves processing massive amounts of data you should familiarize yourself with Greenplum, MapReduce, and Hadoop.
With 6.5 Petabytes of data eBay runs the world’s largest data warehouse on Greenplum. Facebook runs a 2 PB warehouse on Hadoop. Impressive.
Both Greenplum and Hadoop make use of the MapReduce framework pioneered by Google.
You can run Hadoop on Amazon Elastic MapReduce to play around with the technology.
There have also been two Hadoop books published recently. I have ordered both of them and can’t wait to hold them in my hands.
Hadoop: The Definitive Guide
Pro Hadoop
No books on Greenplum, but they have some good whitepapers on their website.