This dataset collects the daily requests sent to the Navitia route planner during 2022 and the first quarter of 2023 starting or ending in the Île-de-France region. The responses included in the dataset are not real journeys but routes proposed by the planner. Every route includes the start and end points as well as some transit points.
Since this information could jeopardize the privacy of the users, an anonymization procedure was applied. The original dataset was anonymized using the anonymizer module developed within the Mobidatalab project. The module includes several anonymization methods, privacy-preserving analysis method, and methods to compute different utility and privacy metrics. It also provides a command line interface (CLI) that allows users to use all the module’s functionalities in a straightforward way. The module is also ready to be deployed in a server and to process requests through an API.
The dataset was anonymized using the “Time partition Microaggregation” method, a version of the well-known microaggregation method for very large mobility datasets where the application of microaggregation would not be feasible. A detailed description of the method can be found here.
Specific parameters:
• K: 10
• Interval: 3600 seconds
• Clustering_method: MDAV
• Agregation method: Mean trajectory
Along with the daily datasets, origin-destination matrixes are also included. These were computed using a tessellation corresponding to the French zip codes. A geojson file with the layout of the postal codes of the region is also included.