Converting complex XML to CSV

Maciek Kocon XML

Have you ever tried to process data in XML? Did you have to load its values into a relational database or simply convert it to plain CSV? Before I show you a simple way of processing complex XMLs, let’s first think out loud what XML actually is used for. Note: If you have a good grasp on XML specifics you ...

Querying XML files with Tableau via Flexter

Uli Bethke Tableau, XML, XSD

Tutorial - Querying XML files with Tableau Tableau is a great visualisation tool that works with a large number of different file and data formats. One format that is not supported is data in XML files. There are some workarounds, e.g. using Excel to flatten the XML file. However, with deeply nested files or XMLs with a complex schema this ...

A brief history of XML - From hype to useful data format

Vadim Mytarev Flexter, Hadoop, Spark, XML, XSD

Is XML really dead? When it first became popular about 20 years ago, XML was meant to be the one and only format to serialize, encapsulate, and exchange data. The serialization format to end all serialization formats so to speak. This was a bold claim. Has it materialised? Over the last couple of years it has become clear that this ...

Take the pain out of XML processing on Spark.

Maciek Kocon Big Data, Spark, XML

Note: We have written an updated version of this post that shows XML conversion on Spark to Parquet with code samples. Did you ever have to process XML files? Complex and large ones? Lots of them? No matter which processing framework or programming language you use it always is pain. It never is easy. It can be sure that it ...