SQL on Hadoop, BigQuery, or Exadata. Please don’t call them MPP.

I often hear people referring to SQL engines running against HDFS or object storage as MPP. Strictly speaking this is incorrect. Let me first explain what an MPP database is…
Uli Bethke May 10, 2018

You might be surprised to hear, but Hadoop is a poor choice for a data lake

As there are a lot of definitions on what constitutes a data lake let’s first define what we actually mean by it. A data lake is similar to the staging…
Uli Bethke May 3, 2018

A brief history of XML – From hype to useful data format

Is XML really dead? When it first became popular about 20 years ago, XML was meant to be the one and only format to serialize, encapsulate, and exchange data. The…
Vadim Mytarev October 18, 2016

Big Data News: HUG Ireland’s 1st 2016 Big Data Event, Airbnb’s Predictive Model using NPS and Hive Optimization

Hadoop User Group (HUG) Ireland packed the house with a great evening on Apache Mesos/Myriad and an overview of Airbnb’s Predictive Model After a restful holiday season, the new year…
Uli Bethke January 15, 2016

Big Data News: Yahoo’s Data Sketching and Apache Spark 1.6

Launching 2016 in style with an exploration of Yahoo’s successful scaling of aggregate computational queries using data sketching libraries to Apache Spark releasing Spark 1.6 Firstly, the team at Sonra…
Uli Bethke January 8, 2016

Big Data News: Hadoop User Group (HUG) Ireland’s morning briefing and MapR’s new Product Launch called “Streams”

HUG Ireland’s great morning briefing with Alexey Grishchenko and MapR is now directly competing with Apache Kafka Last week was a busy one in operations for the Sonra team with…
Uli Bethke December 11, 2015
spinner