Big Data News: HyperLogLog with Spark and Open Source GZinga Compression

Uli Bethke Big Data, Spark, Technology

Exploring the performance enhancements of HyperLogLog on Spark and adding splittable and seekable features to Gzip in a new open source project called GZinga Life is never dull in big data and as I left a great Spark Dublin meetup last night pondering the distributed performance enhancements of using dataframes in Spark, I was once again struck by the continuous ...

Data discovery and the unfulfilled promises of self-service Business Intelligence

Uli Bethke Big Data, Business Intelligence, Data Discovery, ETL, Hadoop

Self-service BI is all about allowing business users or non-technical staff to generate insights from data. It is about to make its breakthrough in the enterprise! Happy days. Unfortunately, this has been the headline prediction for the last ten years. The hype has never really materialised. So why has self-service BI struggled so much to gain traction in the enterprise? ...

Two Use Cases of Data Discovery Tools

Uli Bethke Big Data, Business Intelligence, Data Discovery, Hadoop

Data discovery tools are a relatively recent phenomenon as can be witnessed by the fact that there is no separate Gartner Magic Quadrant for them. Data discovery tools allow users without programming skills to wrangle and transform raw data. There are two main use cases for this type of tools (1) Self-service Business Intelligence. Self-service BI promises business users without ...

Big Data News - Innovative Innovations on Hadoop by Twitter & LinkedIn...

Uli Bethke Big Data, DFS, Hadoop

From Twitter’s innovative Namespace Design to LinkedIn’s Gobblin 0.5.0 release, which now includes Apache Kafka integration The team at Sonra have been impressed by the innovative developments reviewed through the week and the Social Media giants once again come to the forefront for their innovative developments on Hadoop that bear mention for their insightful design responses to problems experienced at ...

Big Data News - Predict Conference 2015… A Premier Big Data event!

Uli Bethke Big Data, Cloud, Data Science, Technology

From Predict Conference 2015 to Data Mining and Innovation at Twitter There is no doubt that before I went to represent Sonra at Predict Conference 2015, which was organised by Creme Global. I was a little dubious about the efficacy of the conference along with its outcomes for both Sonra and HUG Ireland. Would it allow us to effectively deliver ...