Improved Visual Relocalization by Discovering Anchor Points

Soham Saha, Girish Varma, C V Jawahar

Spotlight Presentation
British Machine Vision Conference

July, 2018

arXiv pdf slides poster code

Bibtex

@inproceedings{SVJ18, author = {Soham Saha and Girish Varma and C. V. Jawahar}, title = {Improved Visual Relocalization by Discovering Anchor Points}, booktitle = {British Machine Vision Conference 2018, {BMVC} 2018, Northumbria University, Newcastle, UK, September 3-6, 2018}, pages = {164}, year = {2018}, url = {http://bmvc2018.org/contents/papers/0962.pdf} }

Abstract

We address the visual relocalization problem of predicting the location and camera orientation or pose (6DOF) of the given input scene. We propose a method based on how humans determine their location using the visible landmarks. We define anchor points uniformly across the route map and propose a deep learning architecture which predicts the most relevant anchor point present in the scene as well as the relative offsets with respect to it. The relevant anchor point need not be the nearest anchor point to the ground truth location, as it might not be visible due to the pose. Hence we propose a multi task loss function, which discovers the relevant anchor point, without needing the ground truth for it. We validate the effectiveness of our approach by experimenting on Cambridge Landmarks (large scale outdoor scenes) as well as 7 Scenes (indoor scenes) using various CNN feature extractors. Our method improves the median error in indoor as well as outdoor localization datasets compared to the previous best deep learning model known as PoseNet (with geometric re-projection loss) using the same feature extractor. We improve the median error in localization in the specific case of Street scene, by over 8m.