Multimodal representation learning for place recognition using deep Hebbian predictive coding

Pearson, Martin; Dora, Shirin; Struckmeier, Oliver; Knowles, Thomas; Mitchinson, Ben; Tiwari, Kshitij; Kyrki, Ville; Bohte, Sander; Pennartz, Cyriel

doi:10.3389/frobt.2021.732023

M.J. Pearson (Martin), S. Dora (Shirin), O. Struckmeier (Oliver), T.C. Knowles (Thomas), B. Mitchinson (Ben), K. Tiwari (Kshitij), V. Kyrki (Ville), S.M. Bohte (Sander) and C. Pennartz (Cyriel)

2021-12-13

Multimodal representation learning for place recognition using deep Hebbian predictive coding

Frontiers in Robotics and AI , Volume 8

Recognising familiar places is a competence required in many engineering applications that interact with the real world such as robot navigation. Combining information from different sensory sources promotes robustness and accuracy of place recognition. However, mismatch in data registration, dimensionality, and timing between modalities remain challenging problems in multisensory place recognition. Spurious data generated by sensor drop-out in multisensory environments is particularly problematic and often resolved through adhoc and brittle solutions. An effective approach to these problems is demonstrated by animals as they gracefully move through the world. Therefore, we take a neuro-ethological approach by adopting self-supervised representation learning based on a neuroscientific model of visual cortex known as predictive coding. We demonstrate how this parsimonious network algorithm which is trained using a local learning rule can be extended to combine visual and tactile sensory cues from a biomimetic robot as it naturally explores a visually aliased environment. The place recognition performance obtained using joint latent representations generated by the network is significantly better than contemporary representation learning techniques. Further, we see evidence of improved robustness at place recognition in face of unimodal sensor drop-out. The proposed multimodal deep predictive coding algorithm presented is also linearly extensible to accommodate more than two sensory modalities, thereby providing an intriguing example of the value of neuro-biologically plausible representation learning for multimodal navigation.

Additional Metadata
Persistent URL	doi.org/10.3389/frobt.2021.732023
Journal	Frontiers in Robotics and AI
Project	Human Brain Project - SGA3
Grant	This work was funded by the European Commission 7th Framework Programme; grant id h2020/945539 - Human Brain Project - SGA3 (HBP-SGA3)
Organisation	Machine Learning
Citation APA Style AAA Style APA Style Cell Style Chicago Style Harvard Style IEEE Style MLA Style Nature Style Vancouver Style American-Institute-of-Physics Style Council-of-Science-Editors Style BibTex Format Endnote Format RIS Format CSL Format DOIs only Format	Pearson, M., Dora, S., Struckmeier, O., Knowles, T., Mitchinson, B., Tiwari, K., … Pennartz, C. (2021). Multimodal representation learning for place recognition using deep Hebbian predictive coding. Frontiers in Robotics and AI, 8. doi:10.3389/frobt.2021.732023

View at Publisher

Free Full Text ( Final Version , 6mb )

Multimodal representation learning for place recognition using deep Hebbian predictive coding

Publication

Publication

Address

Publishing at CWI

Questions or comments?

Multimodal representation learning for place recognition using deep Hebbian predictive coding

Publication

Publication

Workflow

Workflow

Add Content