Skip to main content
search
0

Masking Business Keys from Hubs for Privacy in Data Vault

Watch the Video

In our continuous Data Vault Friday series, our experienced BI Consultant, Lorenz Kindling, delves into a pertinent question posed by an engaged member of our audience.

“How to mask business keys from Hubs in a GDPR-compliant way?”

Lorenz, with his wealth of expertise, provides insightful guidance on the crucial matter of masking business keys while ensuring compliance with the rigorous regulations outlined by GDPR. With data privacy and security at the forefront, he explores effective techniques and strategies to safeguard business keys within the Hub entities, striking the delicate balance between usability and GDPR adherence.

This informative discussion is a valuable resource for data professionals navigating the complexities of GDPR compliance within the Data Vault framework, offering practical solutions and best practices.

Capturing Temporal Data on Changing Relationships in Data Vault

Watch the Video

In the latest installment of our enlightening Data Vault Friday series, our CEO, Michael Olschimke, delves into a thought-provoking query posed by a member of our engaged audience.

“We have received the relationship between investor and company with a PostingMonth for the last couple of months. Also, the ownership percentage for the relationship could change over time (see attached Excel for mock data :)). So our question is: should we take the Period as a part of the Investor_Company_Link? If yes, how can we track the relationship changes with Effectivity Satellite? Or do you think Multi-active link satellite is a better choice here?”

Michael meticulously explores the intricacies of modeling investor-company relationships, particularly when faced with dynamic factors such as changing ownership percentages over distinct time periods. He offers valuable insights into the considerations between incorporating Period as part of the Investor_Company_Link and the nuanced application of Effectivity Satellite or Multi-active link satellite to accurately capture and manage the evolving nature of these relationships.

This insightful discussion proves instrumental for data professionals navigating the complexities of representing dynamic relationships within the Data Vault framework.

Meet the Speaker

Profile picture of Michael Olschimke

Michael Olschimke

Michael has more than 15 years of experience in Information Technology. During the last eight years he has specialized in Business Intelligence topics such as OLAP, Dimensional Modelling, and Data Mining. Challenge him with your questions!

Extending Existing Data Vault Model by GDPR-Identified Data

Watch the Video

In our ongoing Data Vault Friday series, our esteemed CEO, Michael Olschimke, tackles a compelling question raised by an engaged member of our audience.

“Let’s assume that DWH is fed from many source systems and one of them (some minor one, called ‘XYZ’) exports customer data identified by PERSONAL_ID (no other identifier available). We already have HUB_CUSTOMER based on some other customer identifier, and the PERSONAL_ID attribute is stored in SAT_CUSTOMER_PD. But there is one important thing regarding customer data, there are cases where multiple rows in HUB_CUSTOMER have the same PERSONAL_ID in mentioned satellite (which means, that some of the customers have been registered multiple times in our core systems).”

In this illuminating episode, Michael delves into the intricate scenario of integrating customer data from diverse sources, emphasizing the challenges posed by the absence of a unique identifier and the existence of duplicate entries. He articulates a strategic approach to address this nuanced issue within the Data Vault framework, providing practical insights and recommendations for achieving a coherent and accurate representation of customer information.

This discussion proves invaluable for data professionals navigating the complexities of consolidating diverse customer data sets with varying identifier structures.

Meet the Speaker

Profile picture of Michael Olschimke

Michael Olschimke

Michael has more than 15 years of experience in Information Technology. During the last eight years he has specialized in Business Intelligence topics such as OLAP, Dimensional Modelling, and Data Mining. Challenge him with your questions!

Processing CDC Data in Data Vault

Watch the Video

As part of our ongoing Data Vault Friday series, our knowledgeable BI Consultant, Julian Brunner, delves into a question presented by an audience member.

“One of our sources delivers CDC data. Are there any DV standards/best practices on how to model and process this kind of data?”

In this insightful episode, Julian addresses the specific challenges associated with Change Data Capture (CDC) data within the realm of Data Vault methodology. He provides a comprehensive overview of the best practices and standards that should be considered when modeling and processing CDC data in a Data Vault environment.

Julian’s expertise shines through as he navigates the intricacies of incorporating CDC data seamlessly into the Data Vault model. Viewers gain valuable insights into the recommended approaches, key considerations, and potential pitfalls to be aware of when dealing with CDC data sources.

By the end of the discussion, Julian equips the audience with practical knowledge, empowering them to effectively integrate CDC data into their Data Vault implementations.

Extending Satellites in Data Vault

Watch the Video

In our continuous Data Vault Friday series, our skilled trainer, Marc Finger, delves into a question posed by an audience member.

“Changes in the source system (new column/s): New row in an existing Satellite or new Satellite?”

Marc provides valuable insights into handling changes in the source system, specifically when encountering the addition of new columns. The question revolves around whether it’s more appropriate to introduce a new row in an existing Satellite or create an entirely new Satellite to accommodate these changes.

Through a clear and concise discussion, Marc elucidates the considerations and factors that influence the decision-making process. He explores the implications of both options, emphasizing the importance of aligning the chosen approach with the specific requirements and goals of the Data Vault model.

The trainer guides the audience through the thought process involved in making this decision, providing practical tips and best practices. By the end of the episode, viewers gain a deeper understanding of how to navigate the challenges associated with changes in the source system within the context of Data Vault methodology.

Meet the Speaker

Marc Winkelmann

Marc Finger

Marc is working in Business Intelligence and Enterprise Data Warehousing (EDW) with a focus on Data Vault 2.0 implementation and coaching. Since 2016 he is active in consulting and implementation of Data Vault 2.0 solutions with industry leaders in manufacturing, energy supply and facility management sector. In 2020 he became a Data Vault 2.0 Instructor for Scalefree.

Metadata Translation in Data Vault

Watch the Video

In our ongoing Data Vault Friday series, our CEO Michael Olschimke discusses a question from the audience.

“Our EDW should use English entity names for hubs, links, and satellites. However, our sources are in a variety of languages (English, and German mostly). Where is the best option to translate everything into English?”

Michael provides insightful guidance on tackling the challenge of maintaining consistency in entity names across a multilingual landscape. He explores different strategies for translating entity names, weighing the pros and cons of various approaches. Whether to perform the translation at the source level, during the ETL (Extract, Transform, Load) process, or within the EDW itself, Michael offers considerations to help make an informed decision based on the specific needs and characteristics of the project.

The CEO emphasizes the importance of aligning with business objectives and ensuring that the chosen translation strategy aligns with the overall goals of the data warehousing initiative. This episode provides valuable insights and best practices for handling multilingual challenges in Data Vault projects, contributing to the success of your data integration and management endeavors.

Hiding Dimension Members in Data Vault

Watch the Video

In our ongoing Data Vault Friday series, our CEO Michael Olschimke addresses a query from the audience, exploring the dynamics of managing data visibility in the DIMENSION information mart.

“How can a record be hidden in the DIMENSION information mart if it is no longer in use? Our Data Warehouse (DWH) features a hierarchy of region, division, and zone, which may undergo splitting or merging multiple times. The challenge is that the deleted event is not signaled from the source side, and only a full refresh captures new hierarchy information. Users desire a consistently current status reflected in both FACT and DIM tables.

1. To handle this, the current relation can be flagged and counted. This approach involves managing the relationship with a counter, allowing for effective tracking and visibility.

2. Additionally, the last relation needs to remain visible in the FACT table, ensuring that historical relationships are retained for reference.”

In this engaging video, Michael elaborates on these strategies, providing insights into maintaining data integrity and visibility within complex hierarchies, while accommodating changes and updates efficiently.

Meet the Speaker

Profile picture of Michael Olschimke

Michael Olschimke

Michael has more than 15 years of experience in Information Technology. During the last eight years he has specialized in Business Intelligence topics such as OLAP, Dimensional Modelling, and Data Mining. Challenge him with your questions!

Sampling (DB Subsetting) Production Data in Data Vault

Watch the Video

In our ongoing Data Vault Friday series, our CEO Michael Olschimke engages with a pertinent question from the audience, shedding light on best practices for structuring EDW environments.

“In one of the previous webinars (‘EDW Environments’), you mentioned about best practices for creating your EDW environments. Let’s consider a configuration where we have 4 environments, DEV + TST and PRE_PROD + PROD. Moreover, assume that the PROD environment is very heavy in the meaning of data volumes and we simply cannot handle such amounts of data on PRE PROD and TST (data on TST env. will be anonymized). Do you have any advice on how to create lightweight environments from PROD?”

In this insightful video, Michael delves into the complexities of managing EDW environments with varying data volumes. He offers practical advice on creating lightweight versions of the production environment for development, testing, and pre-production stages. The discussion encompasses strategies for data anonymization on the testing environment and optimizing resources to ensure efficiency across different stages of the EDW lifecycle.

Meet the Speaker

Profile picture of Michael Olschimke

Michael Olschimke

Michael has more than 15 years of experience in Information Technology. During the last eight years he has specialized in Business Intelligence topics such as OLAP, Dimensional Modelling, and Data Mining. Challenge him with your questions!

Reference Tables With Effectivity Satellites in Data Vault

Watch the Video

In our continuous exploration of Data Vault concepts in the Data Vault Friday series, our CEO Michael Olschimke delves into an intriguing question posed by the audience.

“Do you use Effectivity Satellites also for Reference Data in Reference Satellites?”

This concise yet crucial inquiry prompts Michael to unravel the considerations and best practices associated with leveraging Effectivity Satellites in the context of Reference Data within Reference Satellites.

In this insightful video, Michael shares his expertise, discussing the potential applications and benefits of employing Effectivity Satellites for managing reference data. He sheds light on how this approach can enhance the flexibility and temporal aspects of Reference Satellites, contributing to a more robust and adaptable Data Vault architecture.

Meet the Speaker

Profile picture of Michael Olschimke

Michael Olschimke

Michael has more than 15 years of experience in Information Technology. During the last eight years he has specialized in Business Intelligence topics such as OLAP, Dimensional Modelling, and Data Mining. Challenge him with your questions!

Stärken der Datenanalyse Innerhalb von Salesforce

Watch the Webinar

In diesem Webinar erfahren Sie, wie Sie Salesforce-Reports erstellen und nutzen können, um aussagekräftige Einblicke in Ihre KPI zu gewinnen.

Wir zeigen Ihnen bewährte Best Practices für die Erstellung von benutzerdefinierten Berichten und Dashboards sowie praktische Tipps für die effektive Nutzung von Salesforce-Reporting-Tools.

Salesforce unterscheidet sich unter allen anderen CRM-Systemen, da das Reporting auf allen Daten innerhalb des Systems möglich ist. Sie lernen, wie Sie Reports filtern, gruppieren und Diagramme erstellen können.

Dieses Webinar richtet sich sowohl an Anfänger als auch an fortgeschrittene Salesforce-Nutzer, die ihre Reporting-Fähigkeiten verbessern möchten.

Watch Webinar Recording

Webinar Agenda

1. Reports In Salesforce
2. Vorteile
3. Warum Nutzen
4. Demos
5. Fazit

Close Menu