72010604

Electronic Imaging

2470-1173

Society for Imaging Science and Technology

7003 Kilworth Lane, Springfield, VA 22151 USA

10.2352/ISSN.2470-1173.2020.10.IPAS-313

2470-1173(20200126)2020:10L.3131;1-

ei_24701173_v2020n10_input/s25.xml

/ist/ei/2020/00002020/00000010/art00024

Articles

Multiscale Convolutional Descriptor Aggregation for Visual Place Recognition

Imbriaco

Raffaele

Bondarev

Egor

With

Peter H.N. de

26 01 2020

2020 10 313-1 313-7

2020

Visual place recognition using query and database images from different sources remains a challenging task in computer vision. Our method exploits global descriptors for efficient image matching and local descriptors for geometric verification. We present a novel, multi-scale aggregation method for local convolutional descriptors, using memory vector construction for efficient aggregation. The method enables to find preliminary set of image candidate matches and remove visually similar but erroneous candidates. We deploy the multi-scale aggregation for visual place recognition on 3 large-scale datasets. We obtain a Recall@10 larger than 94% for the Pittsburgh dataset, outperforming other popular convolutional descriptors used in image retrieval and place recognition. Additionally, we provide a comparison for these descriptors on a more challenging dataset containing query and database images obtained from different sources, achieving over 77% Recall@10.

Deep learning Visual place recognition Local descriptors