Top Articles

SQL Visualisation Guide – Query Diagrams, Lineage & ERD

Have you ever inherited the SQL codebase from someone else where it is in a “bit of a mess”? 😁 Sound familiar? Most data engineers and SQL developers have been there including myself—staring at a spaghetti-like web of queries, trying to untangle a chaotic SQL codebase. It’s a common nightmare. Traditionally, you had to fight …

Uli Bethke November 3, 2024

Recent Articles

Learn Window Functions on Snowflake. Become a cloud data warehouse superhero.

In a recent post we compared Window Function Features by Database Vendors. In this post we will give you an overview on the support for various window function features on…
Uli Bethke January 31, 2018
XML,

Converting XML to Hive

In this example we will use the Flexter XML converter to generate a Hive schema and parse an XML file into a Hive database. We will then use the spark-sql…
Maciek January 27, 2018
XML,

How to convert XML to Spark Delta Tables and Parquet

The main option for converting XML on Spark to Parquet and Delta Tables is the Spark XML-Library. It is an external library that can be integrated with Spark but does…
Maciek January 25, 2018

Why is concurrency overrated to measure performance of data warehouse platforms?

The difference between making a good and a bad decisions often comes down to the quality of the pre-defined metrics. If the metric is poor so will be the decision.When…
Uli Bethke January 11, 2018

Using Apache Airflow to build reusable ETL on AWS Redshift

Building a data pipeline on Apache Airflow to populate AWS RedshiftIn this post we will introduce you to the most popular workflow management tool - Apache Airflow. Using Python as…
Uli Bethke January 1, 2018
XML,

Convert FpML XML to Oracle Data Modeler with Enterprise Flexter

Today we will show you how easy it is to convert FpML XML files to relational tables in an Oracle database with the enterprise version of Flexter. We then reverse…
Maciek December 22, 2017
spinner