Big Data News: HUG Ireland’s 1st 2016 Big Data Event, Airbnb’s Predictive Model using NPS and Hive Optimization

Uli Bethke January 15, 2016

Hadoop User Group (HUG) Ireland packed the house with a great evening on Apache Mesos/Myriad and an overview of Airbnb’s Predictive Model After a restful holiday season, the new year kicked off in style for Hadoop User Group (HUG) Ireland with its opening 2016 event at Synchronoss on January 11th. We heard from Mary Mangru, ...

Read More

Big Data News: Yahoo’s Data Sketching and Apache Spark 1.6

Uli Bethke January 8, 2016

Launching 2016 in style with an exploration of Yahoo’s successful scaling of aggregate computational queries using data sketching libraries to Apache Spark releasing Spark 1.6 Firstly, the team at Sonra would like to wish you and yours every success in 2016. As the arrow of time pushes us forward, our Big Data industry is forging ...

Read More

In-memory analytics with Tableau, SparkSQL, and MapR

Uli Bethke April 16, 2015

Last week Tableau released version 9.0 of their data visualisation tool. From a Big Data point of view the nicest new feature was support for querying cached (in-memory) SchemaRDDs (Data Frames as of Spark 1.3). [big_data_promotion] In this tutorial I will show you how to connect to Spark 1.2.1 on the MapR 4.1 sandbox with ...

Read More

Using parameters and variables in Hive CLI

Maciek Kocon February 19, 2015

In this blog post we look at how we can address a shortcoming in the Hive ALTER TABLE statement using parameters and variables in the Hive CLI (Hive 0.13 was used). There’s a simple way to query Hive parameter values directly from CLI You simply execute (without specifying the value to be set): SET <parameter>; ...

Read More