Big Data News: HUG Ireland’s 1st 2016 Big Data Event, Airbnb’s Predictive Model using NPS and Hive Optimization
Hadoop User Group (HUG) Ireland packed the house with a great evening on Apache Mesos/Myriad and an overview of Airbnb’s Predictive Model After a restful holiday season, the new year kicked off in style for Hadoop User Group (HUG) Ireland with its opening 2016 event at Synchronoss on January 11th. We heard from Mary Mangru, ...
Read MoreBig Data News: Yahoo’s Data Sketching and Apache Spark 1.6
Launching 2016 in style with an exploration of Yahoo’s successful scaling of aggregate computational queries using data sketching libraries to Apache Spark releasing Spark 1.6 Firstly, the team at Sonra would like to wish you and yours every success in 2016. As the arrow of time pushes us forward, our Big Data industry is forging ...
Read MoreIn-memory analytics with Tableau, SparkSQL, and MapR
Last week Tableau released version 9.0 of their data visualisation tool. From a Big Data point of view the nicest new feature was support for querying cached (in-memory) SchemaRDDs (Data Frames as of Spark 1.3). [big_data_promotion] In this tutorial I will show you how to connect to Spark 1.2.1 on the MapR 4.1 sandbox with ...
Read MoreUsing parameters and variables in Hive CLI
In this blog post we look at how we can address a shortcoming in the Hive ALTER TABLE statement using parameters and variables in the Hive CLI (Hive 0.13 was used). There’s a simple way to query Hive parameter values directly from CLI You simply execute (without specifying the value to be set): SET <parameter>; ...
Read More