Administrative Boundaries USA

Overview The dataset Administrative Boundaries USA contains border information as polygons about the following divisions of the fifty states, the District of Columbia, Puerto Rico, and the Island areas (American…
Uli Bethke December 20, 2021

Data Warehouse 3.0. A Reference Architecture for the Modern Data Warehouse.

Reference data architecture for data management and analytics. An introduction. I frequently come across data architecture diagrams that are riddled with vendor names, tools, and technologies. Tools and technologies have…
Uli Bethke December 9, 2021
XML, XML,

Converting CRS XML to AWS Athena

AWS Athena does not have native support for XML. Processing XML files on AWS Athena is slow, manual, and error prone. One option is to use AWS Glue and convert…
Uli Bethke October 29, 2021

SQL Parser deep dive. Use cases, features, practical examples, and tools for SQL parsing

In this blog post we will cover all aspects of SQL parsing. First we will explain what we mean by SQL parsing and look at the inner workings and mechanics…
Uli Bethke October 22, 2021

Converting XML and JSON to a Data Lake

Data lakes are a popular design pattern in data analytics. A data lake is used to store a copy of data coming from operational source systems such as relational databases.…
Uli Bethke September 30, 2021
XML, XML,

Converting SDMX XML to BigQuery

If our data is in Avro, JSON, Parquet, etc. then you can load it easily to BigQuery. Got XML for BigQuery? BigQuery does not provide any native support to deal…
Uli Bethke June 22, 2021
spinner