Data Warehousing in the age of Big Data. The end of an era?

Uli Bethke Big Data, Data Warehouse, Hadoop

In this series of blog posts we will explore how the age of Big Data is changing the data warehouse landscape for good. There are a multitude of issues that today's data warehouse implementations face. What are the steps that you as a CIO or Data Warehouse executive can take to implement the next generation of data warehouses. Part II: ...

Multiple Spark Worker Instances on a single Node. Why more of less is more than less.

Uli Bethke Big Data, Hadoop, Spark

If you are running Spark in standalone mode on memory rich nodes it can be beneficial to have multiple worker instances on the same node as a very large heap size has two disadvantages: - Garbage collector pauses can hurt throughput of Spark jobs. - Heap size of >32 GB can't use CompressedOoops. So 35 GB is actually less than ...

In-memory analytics with Tableau, SparkSQL, and MapR

Uli Bethke Big Data, Hadoop, Hive, MapR, Spark, SparkSQL, Tableau

Last week Tableau released version 9.0 of their data visualisation tool. From a Big Data point of view the nicest new feature was support for querying cached (in-memory) SchemaRDDs (Data Frames as of Spark 1.3). In this tutorial I will show you how to connect to Spark 1.2.1 on the MapR 4.1 sandbox with Tableau 9.0. Pre-requisites: - Download the ...

Big Data Made Easy. The Sonra Hadoop Quick Start Appliance

Uli Bethke Big Data, Hadoop, MapR, Supermicro

Big Data: Huge cost savings and revenue opportunities, but daunting Hadoop is a core component of the modern data platform. It is not only up to 25 times less expensive per TB than traditional solutions but also opens up new revenue streams. However, Big Data can be daunting. Businesses need to acquire expertise in a lot of areas to be ...