Data Modeling using Star Schema


Data Model.PNG

✍️The process


🌐 Data Merging: Creating a Unified Dataset

Our journey started with a crucial step - getting our data ready. 🧐 We needed to not only prepare it for analysis but also make it structured for deeper exploration.

In the first step, I blended datasets from three pivotal years: 2019, 2020, and 2021. πŸ“Š This merging process was no ordinary task; it aimed to forge a comprehensive dataset. The result? A unified dataset that paved the way for our exciting analytical adventure. πŸš€

πŸ”‘ Creating a Key Identifier: Introducing a New Primary Key

Within the complex web of the Cost and Revenue dataset, a specific need emerged - a unique identifier to streamline our analysis. πŸ”„ To address this need, I embarked on a thoughtful endeavor. I carefully crafted a new primary key by merging the Program code with the corresponding year. πŸ—οΈ This transformation gave birth to what we now call the "Cost and Revenue" table. πŸ“ˆ This table became the linchpin in our analytical framework, providing cohesion and clarity through careful planning and consideration.

🌍 Enhancing Context: Adding Geographical and Demographic Insights

Our analytical journey demanded geographical context, and I eagerly embraced the task. I incorporated geolocation and demographic data from external CSV files, carefully integrating this valuable information into our dataset. πŸ—ΊοΈ Here, Python's precision and flexibility played a crucial role in maintaining data accuracy. Additionally, the strategic application of fuzzy logic bolstered the quality and reliability of this essential data layer. πŸ“ˆ

πŸ”— Building Connections: The Core of Our Analysis

To provide our dataset with coherence, I carefully created table relationships. 🧩 These connections were thoughtfully constructed using composite keys, skillfully combining program codes and years. These connections formed the foundation of our data structure, facilitating precise and clear exploration and helping us extract valuable insights throughout our journey. πŸ’‘πŸ§ 

πŸ“ŠThe Data Model


Please find the attached Power BI Dashboard including the data model below.

Power BI Dashboard for Blue Box Program β†’

Power BI Dashboard for Blue Box Program β†’

https://drive.google.com/file/d/1lXTi5tFgKaC3_oEC6Pd0ipfOIBmcqbT0/view?usp=drive_web