
"Join me as we embark on an eco-conscious journey into the heart of data analysis, where sustainability meets insight. π±β»οΈ
In Canada, the renowned Blue Box Program takes center stage in the realm of waste management and recycling. β»οΈ It's a nationwide effort, orchestrated by diverse municipalities and organizations, with a shared vision of promoting recycling and responsible material disposal in our homes and communities. π‘ The program equips residents with blue boxes or bins designed to cradle recyclables like paper, cardboard, glass, plastic, and metal. β»οΈποΈ These materials are tenderly plucked from the regular trash stream and sent on a path of renewal through recycling.
Operated by a nationwide waste management organization, this program spans over 70 locations, reaching 9 provinces. π¨π¦ With a whopping 5.4 million active participants, its commitment to boosting recycling knows no bounds. π Year after year, it welcomes a 5% rise in fresh contributors, contributing to the successful recycling of an impressive 740,000 tons of waste materials on average. ππ
πΏ At the heart of this project, you'll find a deep-seated passion for the environment and an unwavering dedication to effective waste management. πβ»οΈ
π― My mission? To craft an all-encompassing framework for data analysis, guiding us through data cleaning, preparation, feature analysis, and regression. It's a roadmap to unearth the hidden gems buried within the data, one step at a time.
Together, let's dive in and unlock the potential of this data-driven exploration! ππ"
Data Modeling using Star Schema

In this journey, we're exploring a vast dataset that holds the secrets of waste management and recyclable materials across our nation. πβ»οΈ
Our mission is clear: create a comprehensive data analysis framework from start to finish, covering data cleaning, preparation, feature analysis, and regression modeling.
π It all begins here with data manipulation. Using Power Query, we carefully cleaned, and merged datasets from 2019, 2020, and 2021, ensuring our data was ready for analysis. A Star Schema data model was crafted by defining primary and foreign keys within the datasets.
πΊοΈ To enrich our analysis, we incorporated geolocation and demographic data from CSV files, adding a valuable layer of context. Python played a role in data refinement and matching, maintaining accuracy through the use of fuzzy logic.