Estimating traffic volume for local streets with imbalanced data
Date:
Abstract: Annual average daily traffic (AADT) is an important measurement used in traffic engineering. Local streets are major components of a road network. However, automatic traffic recorders (ATRs) used to collect AADT are often limited to arterial roads, and such information is, therefore, often unavailable for local streets. Estimating AADT on local streets becomes a necessity as local street traffic continues to grow and the capacity of arterial roads becomes insufficient. A challenge is that an under-represented sample of local street AADT may result in biased estimation. A synthetic minority oversampling technique (SMOTE) is applied to oversample local streets to correct the imbalanced sampling among different road types. A generalized linear mixed model (GLMM) is employed to estimate AADT incorporating various independent variables, including factors of roadway design, socio-demographics, and land use. The model is examined with an AADT dataset from Seattle, WA. Results show that: (1) SMOTE helps to correct imbalanced sampling proportions and improve model performance significantly; (2) the number of lanes and the number of crosswalks are both positively associated with AADT; (3) road segments located in areas with a higher population density or more mixed land use have a higher AADT; (4) distance to the nearest arterial road is negatively correlated with AADT; and (5) AADT creates spatial spillover effects on neighboring road segments. The combination of SMOTE and GLMM improves the estimation accuracy on AADT, which contributes to better data for transportation planning and traffic monitoring, and to cost saving on data collection.