Used for route choice modelling by the transportation research community, recursive logit is a form of inverse reinforcement learning. By solving a large-scale system of linear equations recursive logit allows estimation of an optimal (negative) reward function in a computationally efficient way that performs for large networks and a large number of observations. In this paper we review examples of recursive logit and inverse reinforcement learning models applied to real world GPS travel trajectories and explore some of the challenges in modeling bicycle route choice in the city of Amsterdam using recursive logit as compared to a simple baseline multinomial logit model with environmental variables. We discuss conceptual, computational, numerical and statistical issues that we encountered and conclude with recommendation for further research.

, , , ,
doi.org/10.1016/j.procs.2021.03.062
Procedia Computer Science
12th International Conference on Ambient Systems, Networks and Technologies, ANT 2021 / 4th International Conference on Emerging Data and Industry 4.0, EDI40 2021 / Affiliated Workshops
Centrum Wiskunde & Informatica, Amsterdam, The Netherlands

Koch, T, & Dugundji, E.R. (2021). Limitations of recursive logit for inverse reinforcement learning of bicycle route choice behavior in Amsterdam. In Procedia Computer Science (Vol. 184, pp. 492–499). doi:10.1016/j.procs.2021.03.062