Self-Service Analytics in the Data Lake

Uli Bethke Big Data, Business Intelligence, Data Warehouse, Hadoop, MapR, Spark

Deriving Value from your data... We all know that decision makers and analysts need quick access to all of the structured and unstructured data in an enterprise. Typically this data is locked away in various source systems and can’t be queried from a single central location The data warehouse set out to fix this problem ages ago and it has ...

Window Functions (aka Analytic Functions) in Spark.

Uli Bethke analytic functions, Big Data, MapR, Spark, SparkSQL, SQL

As of Spark 1.4.0 we now have support for window functions (aka analytic functions) in SparkSQL. At Sonra we are heavy users of SparkSQL to handle data transformations for structured data. We also use it in combination with cached RDDs and Tableau for business intelligence and visual analytics. Spark SQL and Window Functions: The rationale I am a strong supporter ...

Data Warehousing in the Age of Big Data. RDBMS Scalability, Exploding Data Volumes and License Costs.

Uli Bethke Big Data, Data Warehouse, Hadoop, MapR

Note: There is an updated version of this post. You can download it for free. Download updated version In the first part of this series of blog posts on data warehousing in the era of big data we looked at limitations of relational databases for data warehousing. In this post I will elaborate further on the impact of growing data ...

In-memory analytics with Tableau, SparkSQL, and MapR

Uli Bethke Big Data, Hadoop, Hive, MapR, Spark, SparkSQL, Tableau

Last week Tableau released version 9.0 of their data visualisation tool. From a Big Data point of view the nicest new feature was support for querying cached (in-memory) SchemaRDDs (Data Frames as of Spark 1.3). In this tutorial I will show you how to connect to Spark 1.2.1 on the MapR 4.1 sandbox with Tableau 9.0. Pre-requisites: - Download the ...