Data Vault vs. Data Mesh?

Should I still do Data Vault if there is Data Mesh? In the past few weeks and months, I got these very interesting questions which brought up a several times. So, I try to explain here my understanding and my thoughts. If you have a diverging view, feel free to contact me on LinkedIn for […]

Do Equi-Joins always matter?

It happens that from time to time I comeacross some statements about how databases work and how they shall be queried. And I like to read those recommendations. Especially if they come with a theoretical explanation. But even more, I like to read those comments if they come with tests that prove what the theory […]

Third Normal Form (3NF) and the Data Vault OR How not to be afraid of the Data Vault

From time to time we receive an interesting question from people interested in using the Datavault Builder: Does the Datavault Builder DWH automation tool supports 3NF? The interesting answer is: yes. Though we have actively decided against a 3NF core because we believe that splitting the data into individual functions such as keys (hubs), relationships […]

Temporality in the Data Warehouse Part 4 of 4: SCD Type 2 Dimensions (and Facts)

Kimball Style dimensions – SCD Type 2 Output If you haven’t read them I recommend reading the first 3 parts first. Getting Started After writing a nine-page long story about the topic including history, modeling, and performance consideration I came to the point where I understood that it is too much to […]

Temporality in the Data Warehouse Part 3 of 4: Give me the damn times

Although in many cases it is not necessary to output the timelines in the reports, there are some cases where the output of timelines is important. Examples are: Displaying a customer history (1d) Highlighting subsequently modified data to identify fraud and legitimate corrections (2d) Flexible comparison of different time periods to understand relative changes (3d) […]

Temporality in the Data Warehouse Part 1 of 4: Definition of the challenge

In the past years, I was confronted with the demand to create a reporting with SCD type 2 dimensions. But since there are usually 3 or more time lines in the data warehouse, I had to clarify again and again what the business requirement actually is. Here I try to sort my thoughts and give […]