Big Data News: Convergence with Mapr and Faster Stateful Streaming Processes with Spark

Uli Bethke Big Data, DFS, HUG Ireland, MapR, Spark

Mapr on Impedance Mismatch and how convergence is achieved for layered architecture along with Databricks on using the new Spark API “mapWithState” for faster Stateful Spark Streaming As our big data world comes to the end of another week, the team at Sonra have been once again impressed by the weeks highlights in big data. Mapr has shared its insights ...

Big Data News: Apps Development with “JSON” and an Open Source “Spark” Library for Geospatial Analysis

Uli Bethke Business Intelligence, Data Discovery, MapR, Spark

Faster Big Data Apps Developments with Open Source JSON UI called “OJAI” and how the Spark Library “Magellan” will come to the rescue in Geo Spatial Analysis As the week moves closer to an end, the team at Sonra have been impressed with the developments reviewed, which positively reflects the direction our community is headed in. The increasing use of ...

Big Data News: HyperLogLog with Spark and Open Source GZinga Compression

Uli Bethke Big Data, Spark, Technology

Exploring the performance enhancements of HyperLogLog on Spark and adding splittable and seekable features to Gzip in a new open source project called GZinga Life is never dull in big data and as I left a great Spark Dublin meetup last night pondering the distributed performance enhancements of using dataframes in Spark, I was once again struck by the continuous ...

Big Data News – MapR’s Deep Dive into Apache Drill and Gartner’s Big Data Predictions for 2020

Uli Bethke Big Data, Hadoop, Spark, SQL for Analysis

30% of all Enterprises will use intermediaries for big data by 2017 Another week has passed and our big data community has been busy around the world. Some interesting movements in our industry has arisen with Doug Laney on Forbes making three predictions on the advancement of big data to 2020, including the rise of 3rd party big data contractors helping ...

Self-Service Analytics in the Data Lake

Uli Bethke Big Data, Business Intelligence, Data Warehouse, Hadoop, MapR, Spark

Deriving Value from your data... We all know that decision makers and analysts need quick access to all of the structured and unstructured data in an enterprise. Typically this data is locked away in various source systems and can’t be queried from a single central location The data warehouse set out to fix this problem ages ago and it has ...