Identifying representative weeks: A clustering analysis of urban dynamics based on public transport data

Publication date

2025-10

Authors

López Pavez, Martín
Soza-Parra, JaimeORCID 0000-0003-1530-0439ISNI 0000000527561235
Herrera, Juan Carlos

Editors

Advisors

Supervisors

Document Type

Article
Open Access logo

License

cc_by

Abstract

Transportation is key to understanding urban dynamics, with public transport bus systems shaping mobility patterns, accessibility, and activity, especially in developing countries where this mode accounts for a significant share of trips. Passive data like GPS and smart card records can reveal these patterns when properly processed. This study proposes a clustering-based methodology to analyze public transport bus data collected over an arbitrarily long period (at least a week) to identify time blocks with similar dynamics without predefined structures. If the dynamics for each time block in a week is the same over time, a representative week can be constructed, showing the most likely dynamics for each time block in a week. The analysis considers three dimensions: demand (passenger counts), supply (distance traveled by buses), and level of service (bus speeds). Cluster results generate a representative week in terms of mobility indicators and transport operations, enabling analysis and comparison of dynamics across different city zones. Using data from Santiago de Chile's bus system for August 2019 and April 2020, the methodology was applied to 10 city zones. Results highlighted distinct dynamics across zones and the need to incorporate all three dimensions for representative weeks. Regular application of this approach is crucial, as cluster characteristics evolve over time. While promising venues for future development remain, our methodology provides a flexible, robust data-driven foundation for understanding urban transport dynamics, adaptable to different cities and supporting evidence-based decisions.

Keywords

Bus system operational characteristics, Clustering passive data, Representative week, Taverne, Development, Sociology and Political Science, Urban Studies, Tourism, Leisure and Hospitality Management, SDG 11 - Sustainable Cities and Communities

Citation

López Pavez, M, Soza-Parra, J & Herrera, J C 2025, 'Identifying representative weeks : A clustering analysis of urban dynamics based on public transport data', Cities, vol. 165, 106094, pp. 1-13. https://doi.org/10.1016/j.cities.2025.106094