Journal
IEEE ROBOTICS AND AUTOMATION LETTERS
Volume 6, Issue 4, Pages 8317-8324Publisher
IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC
DOI: 10.1109/LRA.2021.3107024
Keywords
Visual learning; representation learning; SfM; localization; mapping
Categories
Funding
- Doctorado Industrial Grant [DI-16-08966]
- MINECO [TIN2016-75982-C2-2-R]
Ask authors/readers for more resources
New binary image descriptors, BAD and HashSIFT, offer a balance between accuracy and resource consumption, with BAD being the fastest implementation and HashSIFT showing high accuracy and computational efficiency. Public source code is available for further research and development.
The advent of a panoply of resource limited devices opens up new challenges in the design of computer vision algorithms with a clear compromise between accuracy and computational requirements. In this letter we present new binary image descriptors that emerge from the application of triplet ranking loss, hard negative mining and anchor swapping to traditional features based on pixel differences and image gradients. These descriptors, BAD (Box Average Difference) and HashSIFT, establish new operating points in the state-of-the-art's accuracy vs. resources trade-off curve. In our experiments we evaluate the accuracy, execution time and energy consumption of the proposed descriptors. We show that BAD bears the fastest descriptor implementation in the literature while HashSIFT approaches in accuracy that of the top deep learning-based descriptors, being computationally more efficient. We have made the source code public(1).
Authors
I am an author on this paper
Click your name to claim this paper and add it to your profile.
Reviews
Recommended
No Data Available