Take the pain out of XML processing on Spark.

Note: We have written an updated version of this post that shows XML conversion on Spark to Parquet with code samples. Did you ever have to process XML files? Complex…
Maciek Kocon September 8, 2016

From Big to Smart Data in 5 Simple Steps

Are you the same as me? Tired of hearing buzzwords such as "mining gold in your data", "finding valuable nuggets of information" or the ever present "actionable insights". One of…
Uli Bethke April 5, 2016

Plot.ly in Dataiku DSS Web App

Dataiku DSS and Plot.ly JS (Plot.ly in DSS WebAPP) What is Plot.ly JS? Plot.ly is a high-level charting library, Plot.ly JS built on top of D3.js. It ships with 20…
Vadim Mytarev March 15, 2016

Data Science & Data Discovery Platforms Compared. Datameer and Dataiku DSS go head to head.

Overview We recently performed an evaluation of various data science and data discovery platforms for one of our clients. We looked in detail at Dataiku Data Science Studio (DSS) and…
Uli Bethke February 12, 2016

Big Data News: Convergence with Mapr and Faster Stateful Streaming Processes with Spark

Mapr on Impedance Mismatch and how convergence is achieved for layered architecture along with Databricks on using the new Spark API “mapWithState” for faster Stateful Spark Streaming As our big…
Uli Bethke February 5, 2016

Big Data News: Streaming in the Extreme.. An evolution in Data Processing and Analytics

Google’s Dataflow have submitted a project proposal to open source Dataflow through the Apache Software Foundation along with MapR on Streaming across Data Centers. As another week comes to a…
Uli Bethke January 29, 2016
spinner