The aim of this project was to transform raw UK airport punctuality datasets into a clean and structured analytical system. Starting with 24 CSV files, I implemented a full ETL process by extracting raw data, cleaning inconsistencies, removing duplicates, standardising formats, and aggregating key metrics such as delays, cancellations, and early/on-time performance.
I designed a data warehouse star schema consisting of five dimension tables (Time, Airline, Airport, Area, Service Ratings) and one central fact table capturing monthly airport punctuality metrics. Oracle SQL was used to build and populate the warehouse, enforcing data integrity through primary and foreign key constraints.
With the warehouse complete, I created four Tableau visualisations exploring airport and airline performance, quarterly trends, geographical distribution of poor ratings, and the relationship between delays and cancellations. These dashboards reveal insights into UK aviation operations across 2023 and 2024.