Big Data News: Yahoo’s Data Sketching and Apache Spark 1.6

Uli Bethke Apache, Big Data, Community, Data Science, Data Science, Hadoop, Hive, Open Source Software, Technology

Launching 2016 in style with an exploration of Yahoo’s successful scaling of aggregate computational queries using data sketching libraries to Apache Spark releasing Spark 1.6 Firstly, the team at Sonra would like to wish you and yours every success in 2016. As the arrow of time pushes us forward, our Big Data industry is forging ahead in a cycle of ...

Big Data News: Open Source with HUG Ireland with Google & Zalando

Uli Bethke Big Data, Hadoop, HUG Ireland, Technology

Hadoop User Group (HUG) Ireland’s latest event, Google open sourced its AI “Tensorflow” and Zalando has a new “vision statement” for it’s Open Source “Guild” Last week was a busy week once again at Sonra starting with a packed house at Bank of Ireland, Grand Canal Square last Monday where HUG Ireland held a meetup event for our big data ...

Big Data News: HyperLogLog with Spark and Open Source GZinga Compression

Uli Bethke Big Data, Spark, Technology

Exploring the performance enhancements of HyperLogLog on Spark and adding splittable and seekable features to Gzip in a new open source project called GZinga Life is never dull in big data and as I left a great Spark Dublin meetup last night pondering the distributed performance enhancements of using dataframes in Spark, I was once again struck by the continuous ...

Big Data News - Where advanced “Machine Learning” meets “Fine Wine”

Uli Bethke Big Data, Data Science, Technology

A week in review - Tableau’s first M&A venture, Machine Learning meets Fine Wine and a Streaming World beyond Batch Processing This week has been another week of movements around the world with Tableau stepping into the M&A club with its first acquisition venture. We explore how a UCL academic and former wine trader developed machine learning algorithms to increase ...