
This project presents a production-ready data engineering pipeline for analyzing bicycle safety in France, particularly in Île-de-France and Paris, using real-world datasets.
By setting up this data engineering and analysis pipeline, you will get:
For those looking to reuse this dataset, the project addresses and corrects several data quality issues, including:
Inconsistent File Naming: e.g., carcteristiques-2021.csv contains spelling errors
Join Complexity: Requires choosing a granularity (e.g., one row per accident vs. per user)
Data Quality Fixes:
Métadonnées :
204K
557K
79
45
Il n'y a pas encore d'API associées