Top Articles

SQL Visualisation Guide – Query Diagrams, Lineage & ERD

Have you ever inherited the SQL codebase from someone else where it is in a “bit of a mess”? 😁 Sound familiar? Most data engineers and SQL developers have been there including myself—staring at a spaghetti-like web of queries, trying to untangle a chaotic SQL codebase. It’s a common nightmare. Traditionally, you had to fight …

Uli Bethke November 3, 2024

Recent Articles

Rocketship Performance for Snowflake with Clustering Keys

Introduction Snowflake stores tables by dividing their rows across multiple micro-partitions (horizontal partitioning). Each micro-partition automatically gathers metadata about all rows stored in it such as the range of values…
Uli Bethke February 8, 2018

Create your own custom aggregate (UDAF) and window functions in Snowflake

In this post we will show you how to create your own aggregate functions in Snowflake cloud data warehouse. This type of feature is known as a user defined aggregate…
Uli Bethke February 4, 2018

Learn Window Functions on Snowflake. Become a cloud data warehouse superhero.

In a recent post we compared Window Function Features by Database Vendors. In this post we will give you an overview on the support for various window function features on…
Uli Bethke January 31, 2018
XML,

Converting XML to Hive

In this example we will use the Flexter XML converter to generate a Hive schema and parse an XML file into a Hive database. We will then use the spark-sql…
Maciek January 27, 2018
XML,

How to convert XML to Parquet with Spark

Converting XML to Parquet on Spark might seem straightforward, until you dive in. For years, the go-to approach has been the external spark-xml library: functional, flexible… and a bit hands-on.…
Maciek January 25, 2018

Why is concurrency overrated to measure performance of data warehouse platforms?

The difference between making a good and a bad decisions often comes down to the quality of the pre-defined metrics. If the metric is poor so will be the decision.When…
Uli Bethke January 11, 2018
spinner