Skip to main content

Watch the Video

YouTube

By loading the video, you agree to YouTube's privacy policy.
Learn more

Load video

PGlmcmFtZSB0aXRsZT0iRXh0ZW5kaW5nIEV4aXN0aW5nIERhdGEgVmF1bHQgTW9kZWwgYnkgR0RQUi1JZGVudGlmaWVkIERhdGEiIHdpZHRoPSI1MDAiIGhlaWdodD0iMjgxIiBzcmM9Imh0dHBzOi8vd3d3LnlvdXR1YmUtbm9jb29raWUuY29tL2VtYmVkL0lQMFZCSlhjdDJVP2ZlYXR1cmU9b2VtYmVkIiBmcmFtZWJvcmRlcj0iMCIgYWxsb3c9ImFjY2VsZXJvbWV0ZXI7IGF1dG9wbGF5OyBjbGlwYm9hcmQtd3JpdGU7IGVuY3J5cHRlZC1tZWRpYTsgZ3lyb3Njb3BlOyBwaWN0dXJlLWluLXBpY3R1cmU7IHdlYi1zaGFyZSIgcmVmZXJyZXJwb2xpY3k9InN0cmljdC1vcmlnaW4td2hlbi1jcm9zcy1vcmlnaW4iIGFsbG93ZnVsbHNjcmVlbj48L2lmcmFtZT4=

In our ongoing Data Vault Friday series, our esteemed CEO, Michael Olschimke, tackles a compelling question raised by an engaged member of our audience.

“Let’s assume that DWH is fed from many source systems and one of them (some minor one, called ‘XYZ’) exports customer data identified by PERSONAL_ID (no other identifier available). We already have HUB_CUSTOMER based on some other customer identifier, and the PERSONAL_ID attribute is stored in SAT_CUSTOMER_PD. But there is one important thing regarding customer data, there are cases where multiple rows in HUB_CUSTOMER have the same PERSONAL_ID in mentioned satellite (which means, that some of the customers have been registered multiple times in our core systems).”

In this illuminating episode, Michael delves into the intricate scenario of integrating customer data from diverse sources, emphasizing the challenges posed by the absence of a unique identifier and the existence of duplicate entries. He articulates a strategic approach to address this nuanced issue within the Data Vault framework, providing practical insights and recommendations for achieving a coherent and accurate representation of customer information.

This discussion proves invaluable for data professionals navigating the complexities of consolidating diverse customer data sets with varying identifier structures.

Meet the Speaker

Extending Existing Data Vault Model by GDPR-Identified Data

Michael Olschimke

Michael has more than 15 years of experience in Information Technology. During the last eight years he has specialized in Business Intelligence topics such as OLAP, Dimensional Modelling, and Data Mining. Challenge him with your questions!

Get Updates and Support

Please send inquiries and feature requests to [email protected]

For Data Vault training and on-site training inquiries, please contact [email protected] or register at www.scalefree.com.

To support the creation of Visual Data Vault drawings in Microsoft Visio, a stencil is implemented that can be used to draw Data Vault models. The stencil is available at www.visualdatavault.com.

Scalefree

Leave a Reply