Top Articles

SQL Visualisation Guide – Query Diagrams, Lineage & ERD

Have you ever inherited the SQL codebase from someone else where it is in a “bit of a mess”? 😁 Sound familiar? Most data engineers and SQL developers have been there including myself—staring at a spaghetti-like web of queries, trying to untangle a chaotic SQL codebase. It’s a common nightmare. Traditionally, you had to fight …

Uli Bethke November 3, 2024

Recent Articles

The top 10+1 things we love about Snowflake

Introduction I have been familiarising myself with Snowflake over the last couple of months and these are my impressions on the top 10+1 features that really make Snowflake stand out…
Uli Bethke February 14, 2018

Rocketship Performance for Snowflake with Clustering Keys

Introduction Snowflake stores tables by dividing their rows across multiple micro-partitions (horizontal partitioning). Each micro-partition automatically gathers metadata about all rows stored in it such as the range of values…
Uli Bethke February 8, 2018

Create your own custom aggregate (UDAF) and window functions in Snowflake

In this post we will show you how to create your own aggregate functions in Snowflake cloud data warehouse. This type of feature is known as a user defined aggregate…
Uli Bethke February 4, 2018

Learn Window Functions on Snowflake. Become a cloud data warehouse superhero.

In a recent post we compared Window Function Features by Database Vendors. In this post we will give you an overview on the support for various window function features on…
Uli Bethke January 31, 2018
XML,

Converting XML to Hive

In this example we will use the Flexter XML converter to generate a Hive schema and parse an XML file into a Hive database. We will then use the spark-sql…
Maciek January 27, 2018
XML,

How to convert XML to Spark Delta Tables and Parquet

The main option for converting XML on Spark to Parquet and Delta Tables is the Spark XML-Library. It is an external library that can be integrated with Spark but does…
Maciek January 25, 2018
spinner