Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Yoshitaka Hara

Loop Closure using AnyLoc Visual Place Recognition in DPV-SLAM

Jan 06, 2026

Wenzheng Zhang, Kazuki Adachi, Yoshitaka Hara, Sousuke Nakamura

Abstract:Loop closure is crucial for maintaining the accuracy and consistency of visual SLAM. We propose a method to improve loop closure performance in DPV-SLAM. Our approach integrates AnyLoc, a learning-based visual place recognition technique, as a replacement for the classical Bag of Visual Words (BoVW) loop detection method. In contrast to BoVW, which relies on handcrafted features, AnyLoc utilizes deep feature representations, enabling more robust image retrieval across diverse viewpoints and lighting conditions. Furthermore, we propose an adaptive mechanism that dynamically adjusts similarity threshold based on environmental conditions, removing the need for manual tuning. Experiments on both indoor and outdoor datasets demonstrate that our method significantly outperforms the original DPV-SLAM in terms of loop closure accuracy and robustness. The proposed method offers a practical and scalable solution for enhancing loop closure performance in modern SLAM systems.

* Accepted at IEEE/SICE International Symposium on System Integration(SII) 2026. 6 pages, 14 figures

Via

Access Paper or Ask Questions

Topological Mapping and Navigation using a Monocular Camera based on AnyLoc

Jan 03, 2026

Wenzheng Zhang, Yoshitaka Hara, Sousuke Nakamura

Abstract:This paper proposes a method for topological mapping and navigation using a monocular camera. Based on AnyLoc, keyframes are converted into descriptors to construct topological relationships, enabling loop detection and map building. Unlike metric maps, topological maps simplify path planning and navigation by representing environments with key nodes instead of precise coordinates. Actions for visual navigation are determined by comparing segmented images with the image associated with target nodes. The system relies solely on a monocular camera, ensuring fast map building and navigation using key nodes. Experiments show effective loop detection and navigation in real and simulation environments without pre-training. Compared to a ResNet-based method, this approach improves success rates by 60.2% on average while reducing time and space costs, offering a lightweight solution for robot and human navigation in various scenarios.

* Proc. IEEE International Conference on Automation Science and Engineering (CASE), 2025
* Published in Proc. IEEE CASE 2025. 7 pages, 11 figures

Via

Access Paper or Ask Questions