
This project presents a production-ready data engineering pipeline for analyzing bicycle safety in France, particularly in Île-de-France and Paris, using real-world datasets.
By setting up this data engineering and analysis pipeline, you will get:
For those looking to reuse this dataset, the project addresses and corrects several data quality issues, including:
Inconsistent File Naming: e.g., carcteristiques-2021.csv contains spelling errors
Join Complexity: Requires choosing a granularity (e.g., one row per accident vs. per user)
Data Quality Fixes:
Métadonnées :
201K
551K
79
45
Il n'y a pas d'autres réutilisations du même créateur.