Résumé
We applied four machine learning models, linear regression, the k-nearest neighbors (KNN), random forest, and support vector machine, to predict consumer demand for bike sharing in Seoul. We aimed to advance previous research on bike sharing demand by incorporating features other than weather - such as air pollution, traffic information, Covid-19 cases, and social economic factors- to increase prediction accuracy. The data were retrieved from Seoul Public Data Park website, which records the counts of public bike rentals in Seoul of Korea from January 1 to December 31, 2020. We found that the two best models are the random forest and the support vector machine models. Among the 29 features in six categories the features in the weather, pollution, and Covid-19 outbreak categories are the most important in model prediction. While almost all social economic features are the least important, we found that they help enhance the performance of the models.