A brief history of XML – From hype to useful data format

Vadim Mytarev October 18, 2016

Is XML really dead? When it first became popular about 20 years ago, XML was meant to be the one and only format to serialize, encapsulate, and exchange data. The serialization format to end all serialization formats so to speak. This was a bold claim. Has it materialised? Over the last couple of years it ...

Read More

Take the pain out of XML processing on Spark.

Maciek Kocon September 8, 2016

Note: We have written an updated version of this post that shows XML conversion on Spark to Parquet with code samples. Did you ever have to process XML files? Complex and large ones? Lots of them? No matter which processing framework or programming language you use it always is pain. It never is easy. It ...

Read More

Big Data News: Convergence with Mapr and Faster Stateful Streaming Processes with Spark

Uli Bethke February 5, 2016

Mapr on Impedance Mismatch and how convergence is achieved for layered architecture along with Databricks on using the new Spark API “mapWithState” for faster Stateful Spark Streaming As our big data world comes to the end of another week, the team at Sonra have been once again impressed by the weeks highlights in big data. ...

Read More

Big Data News: Apps Development with “JSON” and an Open Source “Spark” Library for Geospatial Analysis

Uli Bethke October 23, 2015

Faster Big Data Apps Developments with Open Source JSON UI called “OJAI” and how the Spark Library “Magellan” will come to the rescue in Geo Spatial Analysis As the week moves closer to an end, the team at Sonra have been impressed with the developments reviewed, which positively reflects the direction our community is headed ...

Read More
1 2 3 4