Apache Spark Quickstart Packages

Uli Bethke May 15, 2017

We are pleased to announce three Apache Spark Quickstart Packages. The packages are designed for companies that want to explore and evaluate Apache Spark. Example Use Cases The quickstart packages can be used for various scenarios. I have listed some use cases below. You would like to evaluate a certain Spark feature and identify its ...

Read More

Window Functions (aka Analytic Functions) in Spark.

Uli Bethke July 3, 2015

As of Spark 1.4.0 we now have support for window functions (aka analytic functions) in SparkSQL. At Sonra we are heavy users of SparkSQL to handle data transformations for structured data. We also use it in combination with cached RDDs and Tableau for business intelligence and visual analytics. [big_data_promotion] Spark SQL and Window Functions: The ...

Read More

In-memory analytics with Tableau, SparkSQL, and MapR

Uli Bethke April 16, 2015

Last week Tableau released version 9.0 of their data visualisation tool. From a Big Data point of view the nicest new feature was support for querying cached (in-memory) SchemaRDDs (Data Frames as of Spark 1.3). [big_data_promotion] In this tutorial I will show you how to connect to Spark 1.2.1 on the MapR 4.1 sandbox with ...

Read More