Big Data News: Convergence with Mapr and Faster Stateful Streaming Processes with Spark

Mapr on Impedance Mismatch and how convergence is achieved for layered architecture along with Databricks on using the new Spark API “mapWithState” for faster Stateful Spark Streaming As our big…
Uli Bethke February 5, 2016

Big Data News: Apps Development with “JSON” and an Open Source “Spark” Library for Geospatial Analysis

Faster Big Data Apps Developments with Open Source JSON UI called “OJAI” and how the Spark Library “Magellan” will come to the rescue in Geo Spatial Analysis As the week…
Uli Bethke October 23, 2015

Big Data News: HyperLogLog with Spark and Open Source GZinga Compression

Exploring the performance enhancements of HyperLogLog on Spark and adding splittable and seekable features to Gzip in a new open source project called GZinga Life is never dull in big…
Uli Bethke October 16, 2015

Big Data News – MapR’s Deep Dive into Apache Drill and Gartner’s Big Data Predictions for 2020

30% of all Enterprises will use intermediaries for big data by 2017 Another week has passed and our big data community has been busy around the world. Some interesting movements in…
Uli Bethke September 4, 2015

Self-Service Analytics in the Data Lake

Deriving Value from your data... We all know that decision makers and analysts need quick access to all of the structured and unstructured data in an enterprise. Typically this data…
fsd_admin July 23, 2015

Window Functions (aka Analytic Functions) in Spark.

As of Spark 1.4.0 we now have support for window functions (aka analytic functions) in SparkSQL. At Sonra we are heavy users of SparkSQL to handle data transformations for structured…
Uli Bethke July 3, 2015
spinner