Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Tatsuya Kawaguchi

PureCLIP-Depth: Prompt-Free and Decoder-Free Monocular Depth Estimation within CLIP Embedding Space

Mar 17, 2026

Ryutaro Miya, Kazuyoshi Fushinobu, Tatsuya Kawaguchi

Abstract:We propose PureCLIP-Depth, a completely prompt-free, decoder-free Monocular Depth Estimation (MDE) model that operates entirely within the Contrastive Language-Image Pre-training (CLIP) embedding space. Unlike recent models that rely heavily on geometric features, we explore a novel approach to MDE driven by conceptual information, performing computations directly within the conceptual CLIP space. The core of our method lies in learning a direct mapping from the RGB domain to the depth domain strictly inside this embedding space. Our approach achieves state-of-the-art performance among CLIP embedding-based models on both indoor and outdoor datasets. The code used in this research is available at: https://github.com/ryutaroLF/PureCLIP-Depth

* 12 pages, 4 figures

Via

Access Paper or Ask Questions

SoftNeuro: Fast Deep Inference using Multi-platform Optimization

Oct 12, 2021

Masaki Hilaga, Yasuhiro Kuroda, Hitoshi Matsuo, Tatsuya Kawaguchi, Gabriel Ogawa, Hiroshi Miyake, Yusuke Kozawa

Figure 1 for SoftNeuro: Fast Deep Inference using Multi-platform Optimization

Figure 2 for SoftNeuro: Fast Deep Inference using Multi-platform Optimization

Figure 3 for SoftNeuro: Fast Deep Inference using Multi-platform Optimization

Figure 4 for SoftNeuro: Fast Deep Inference using Multi-platform Optimization

Abstract:Faster inference of deep learning models is highly demanded on edge devices and even servers, for both financial and environmental reasons. To address this issue, we propose SoftNeuro, a novel, high-performance inference framework with efficient performance tuning. The key idea is to separate algorithmic routines from network layers. Our framework maximizes the inference performance by profiling various routines for each layer and selecting the fastest path. To efficiently find the best path, we propose a routine-selection algorithm based on dynamic programming. Experiments show that the proposed framework achieves both fast inference and efficient tuning.

* 8 pages, 3 figures

Via

Access Paper or Ask Questions