Alert button

"Image": models, code, and papers
Alert button

Federated Self-Supervised Learning of Monocular Depth Estimators for Autonomous Vehicles

Oct 07, 2023
Elton F. de S. Soares, Carlos Alberto V. Campos

Viaarxiv icon

TextField3D: Towards Enhancing Open-Vocabulary 3D Generation with Noisy Text Fields

Sep 29, 2023
Tianyu Huang, Yihan Zeng, Bowen Dong, Hang Xu, Songcen Xu, Rynson W. H. Lau, Wangmeng Zuo

Figure 1 for TextField3D: Towards Enhancing Open-Vocabulary 3D Generation with Noisy Text Fields
Figure 2 for TextField3D: Towards Enhancing Open-Vocabulary 3D Generation with Noisy Text Fields
Figure 3 for TextField3D: Towards Enhancing Open-Vocabulary 3D Generation with Noisy Text Fields
Figure 4 for TextField3D: Towards Enhancing Open-Vocabulary 3D Generation with Noisy Text Fields
Viaarxiv icon

BLIP-Adapter: Parameter-Efficient Transfer Learning for Mobile Screenshot Captioning

Sep 26, 2023
Ching-Yu Chiang, I-Hua Chang, Shih-Wei Liao

Figure 1 for BLIP-Adapter: Parameter-Efficient Transfer Learning for Mobile Screenshot Captioning
Figure 2 for BLIP-Adapter: Parameter-Efficient Transfer Learning for Mobile Screenshot Captioning
Figure 3 for BLIP-Adapter: Parameter-Efficient Transfer Learning for Mobile Screenshot Captioning
Figure 4 for BLIP-Adapter: Parameter-Efficient Transfer Learning for Mobile Screenshot Captioning
Viaarxiv icon

A novel approach for holographic 3D content generation without depth map

Sep 26, 2023
Hakdong Kim, Minkyu Jee, Yurim Lee, Kyudam Choi, MinSung Yoon, Cheongwon Kim

Viaarxiv icon

Rephrase, Augment, Reason: Visual Grounding of Questions for Vision-Language Models

Oct 09, 2023
Archiki Prasad, Elias Stengel-Eskin, Mohit Bansal

Figure 1 for Rephrase, Augment, Reason: Visual Grounding of Questions for Vision-Language Models
Figure 2 for Rephrase, Augment, Reason: Visual Grounding of Questions for Vision-Language Models
Figure 3 for Rephrase, Augment, Reason: Visual Grounding of Questions for Vision-Language Models
Figure 4 for Rephrase, Augment, Reason: Visual Grounding of Questions for Vision-Language Models
Viaarxiv icon

SimFIR: A Simple Framework for Fisheye Image Rectification with Self-supervised Representation Learning

Aug 17, 2023
Hao Feng, Wendi Wang, Jiajun Deng, Wengang Zhou, Li Li, Houqiang Li

Figure 1 for SimFIR: A Simple Framework for Fisheye Image Rectification with Self-supervised Representation Learning
Figure 2 for SimFIR: A Simple Framework for Fisheye Image Rectification with Self-supervised Representation Learning
Figure 3 for SimFIR: A Simple Framework for Fisheye Image Rectification with Self-supervised Representation Learning
Figure 4 for SimFIR: A Simple Framework for Fisheye Image Rectification with Self-supervised Representation Learning
Viaarxiv icon

Dual Prompt Tuning for Domain-Aware Federated Learning

Oct 04, 2023
Guoyizhe Wei, Feng Wang, Anshul Shah, Rama Chellappa

Viaarxiv icon

PDR-CapsNet: an Energy-Efficient Parallel Approach to Dynamic Routing in Capsule Networks

Oct 04, 2023
Samaneh Javadinia, Amirali Baniasadi

Viaarxiv icon

Underwater Image Enhancement by Transformer-based Diffusion Model with Non-uniform Sampling for Skip Strategy

Sep 07, 2023
Yi Tang, Takafumi Iwaguchi, Hiroshi Kawasaki

Figure 1 for Underwater Image Enhancement by Transformer-based Diffusion Model with Non-uniform Sampling for Skip Strategy
Figure 2 for Underwater Image Enhancement by Transformer-based Diffusion Model with Non-uniform Sampling for Skip Strategy
Figure 3 for Underwater Image Enhancement by Transformer-based Diffusion Model with Non-uniform Sampling for Skip Strategy
Figure 4 for Underwater Image Enhancement by Transformer-based Diffusion Model with Non-uniform Sampling for Skip Strategy
Viaarxiv icon

MonoGAE: Roadside Monocular 3D Object Detection with Ground-Aware Embeddings

Sep 30, 2023
Lei Yang, Jiaxin Yu, Xinyu Zhang, Jun Li, Li Wang, Yi Huang, Chuang Zhang, Hong Wang, Yiming Li

Figure 1 for MonoGAE: Roadside Monocular 3D Object Detection with Ground-Aware Embeddings
Figure 2 for MonoGAE: Roadside Monocular 3D Object Detection with Ground-Aware Embeddings
Figure 3 for MonoGAE: Roadside Monocular 3D Object Detection with Ground-Aware Embeddings
Figure 4 for MonoGAE: Roadside Monocular 3D Object Detection with Ground-Aware Embeddings
Viaarxiv icon