Generalized model for mapping bicycle ridership with crowdsourced data

dc.contributor.authorNelson, Trisalyn
dc.contributor.authorRoy, Avipsa
dc.contributor.authorFerster, Colin
dc.contributor.authorFischer, Jaimy
dc.contributor.authorBrum-Bastos, Vanessa
dc.contributor.authorLaberee, Karen
dc.contributor.authorWinters, Meghan
dc.date.accessioned2021-03-04T20:26:35Z
dc.date.available2021-03-04T20:26:35Z
dc.date.copyright2021en_US
dc.date.issued2021
dc.description.abstractFitness apps, such as Strava, are a growing source of data for mapping bicycling ridership, due to large samples and high resolution. To overcome bias introduced by data generated from only fitness app users, researchers build statistical models that predict total bicycling by integrating Strava data with official counts and geographic data. However, studies conducted on single cities provide limited insight on best practices for modeling bicycling with Strava as generalizability is difficult to assess. Our goal is to develop a generalized approach to modeling bicycling ridership using Strava data. In doing so we enable detailed mapping that is more inclusive of all bicyclists and will support more equitable decision-making across cities. We used Strava data, official counts, and geographic data to model Average Annual Daily Bicycling (AADB) in five cities: Boulder, Ottawa, Phoenix, San Francisco, and Victoria. Using a machine learning approach, LASSO, we identify variables important for predicting ridership in all cities, and independently in each city. Using the LASSO-selected variables as predictors in Poisson regression, we built generalized and city-specific models and compared accuracy. Our results indicate generalized prediction of bicycling ridership on a road segment in concert with Strava data should include the following variables: number of Strava riders, percentage of Strava trips categorized as commuting, bicycling safety, and income. Inclusion of city-specific variables increased model performance, as the R2 for generalized and city-specific models ranged from 0.08–0.80 and 0.68–0.92, respectively. However, model accuracy was influenced most by the official count data used for model training. For best results, official count data should capture diverse street conditions, including low ridership areas. Counts collected continuously over a long time period, rather than at peak periods, may also improve modeling. Modeling bicycling from Strava and geographic data enables mapping of bicycling ridership that is more inclusive of all bicyclists and better able to support decision-making.en_US
dc.description.reviewstatusRevieweden_US
dc.description.scholarlevelFacultyen_US
dc.description.sponsorshipThis work was supported by a grant (#1516-HQ-000064) from the Public Health Agency of Canada. MW is supported by a Scholar Award from the Michael Smith Foundation for Health Research.en_US
dc.identifier.citationNelson, T., Roy, A., Ferster, C., Fischer, J., Brum-Bastos, V., Laveree, K., … Winters, M. (2021). Generalized model for mapping bicycle ridership with crowdsourced data. Transportation Research Part C: Emerging Technologies, 125, 1-15. https://doi.org/10.1016/j.trc.2021.102981.en_US
dc.identifier.urihttps://doi.org/10.1016/j.trc.2021.102981
dc.identifier.urihttp://hdl.handle.net/1828/12752
dc.language.isoenen_US
dc.publisherTransportation Research Part C: Emerging Technologiesen_US
dc.subjectBias-correction
dc.subjectLASSO
dc.subjectBig data
dc.subjectBicycling ridership
dc.subjectExposure
dc.subjectStrava
dc.subject.departmentDepartment of Geography
dc.titleGeneralized model for mapping bicycle ridership with crowdsourced dataen_US
dc.typeArticleen_US

Files

Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Nelson_Trisalyn_TranspResPartCEmergTechnol_2021.pdf
Size:
5.15 MB
Format:
Adobe Portable Document Format
Description:
License bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
2 KB
Format:
Item-specific license agreed upon to submission
Description: