Author: Shalit, Nadav; Fire, Michael; Ben-Elia, Eran
Title: A Supervised Machine Learning Model For Imputing Missing Boarding Stops In Smart Card Data Cord-id: kkmwww6b Document date: 2020_3_10
ID: kkmwww6b
Snippet: Public transport has become an essential part of urban existence with increased population densities and environmental awareness. Large quantities of data are currently generated, allowing for more robust methods to understand travel behavior by harvesting smart card usage. However, public transport datasets suffer from data integrity problems; boarding stop information may be missing due to imperfect acquirement processes or inadequate reporting. We developed a supervised machine learning metho
Document: Public transport has become an essential part of urban existence with increased population densities and environmental awareness. Large quantities of data are currently generated, allowing for more robust methods to understand travel behavior by harvesting smart card usage. However, public transport datasets suffer from data integrity problems; boarding stop information may be missing due to imperfect acquirement processes or inadequate reporting. We developed a supervised machine learning method to impute missing boarding stops based on ordinal classification using GTFS timetable, smart card, and geospatial datasets. A new metric, Pareto Accuracy, is suggested to evaluate algorithms where classes have an ordinal nature. Results are based on a case study in the city of Beer Sheva, Israel, consisting of one month of smart card data. We show that our proposed method is robust to irregular travelers and significantly outperforms well-known imputation methods without the need to mine any additional datasets. Validation of data from another Israeli city using transfer learning shows the presented model is general and context-free. The implications for transportation planning and travel behavior research are further discussed.
Search related documents:
Co phrase search for related documents- absolute difference and accuracy specificity sensitivity: 1
- absolute difference and logistic regression: 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14
- absolute difference and machine learning: 1, 2
- absolute difference and mae absolute error: 1, 2, 3, 4
- absolute error and accuracy metric: 1
- absolute error and accuracy specificity: 1, 2, 3
- absolute error and accuracy specificity sensitivity: 1, 2, 3
- absolute error and accurate method: 1
- absolute error and accurate prediction: 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13
- absolute error and additional dataset: 1
- absolute error and address problem: 1, 2
- absolute error and logistic regression: 1, 2, 3, 4
- absolute error and long short term: 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29
- absolute error and long short term memory network: 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12
- absolute error and machine apply: 1
- absolute error and machine learn: 1
- absolute error and machine learn model: 1
- absolute error and machine learning: 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50, 51, 52, 53, 54, 55, 56, 57, 58
- absolute error and mae absolute error: 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50, 51, 52, 53, 54, 55, 56, 57, 58, 59, 60, 61, 62, 63, 64, 65, 66, 67, 68, 69, 70, 71, 72, 73
Co phrase search for related documents, hyperlinks ordered by date