Converting Google Analytics JSON to S3 on AWS

Uli Bethke Flexter, JSON

In this guide we will show you how to process Google Analytics JSON files with Enterprise Flexter and convert it to Amazon AWS S3. Google Analytics Google Analytics is a freemium web analytics service offered by Google that tracks and reports website traffic.Google launched the service in November 2005 after acquiring Urchin. Google Analytics is now the most widely used ...

Loading data into Snowflake and performance of large joins

Dorian Beganovic Snowflake

Introduction In this blog post we will load a large dataset into Snowflake and then evaluate the performance of joins in Snowflake. Loading large data into Snowflake Dataset The dataset we will load is hosted on Kaggle and contains Checkouts of Seattle library from 2006 until 2017. You can also download the data and see some samples here. The dataset ...

Converting FHIR JSON to CSV with Flexter

Uli Bethke CSV, Flexter, JSON

In this post we will be converting FHIR JSON files to text (CSV). FHIR Fast Healthcare Interoperability Resources (FHIR, pronounced "fire") is a draft standard describing data formats and elements (known as "resources") and an application programming interface (API) for exchanging electronic health records. The standard was created by the Health Level Seven International (HL7) health-care standards organization. FHIR builds ...

Deep dive on caching in Snowflake

Dorian Beganovic Snowflake

In this post we will explain the clever caching strategies Snowflake uses for performance optimization. In the process we will also cover related internals of Snowflake. A lot of information is from the official research paper created by the Snowflake authors which explains the architecture of Snowflake in depth. Caching in virtual warehouses Snowflake strictly separates the storage layer from ...

Big Data, ETL, Data Warehouse Intern

Uli Bethke Uncategorized

Big Data, ETL, Data Warehouse Intern Interested in Big Data, Data Warehouses, ETL, Business Intelligence, Data Analytics? Sonra Intelligence is looking for a computer science intern to join our team based in our office (Dublin 7 Grangegorman DIT campus) or remotely. Duration Starting asap The position is either full time or part time. Working remotely is an option. 3-6 months ...