Alert button

"Image": models, code, and papers
Alert button

Denoising Diffusion Probabilistic Models for Robust Image Super-Resolution in the Wild

Feb 15, 2023
Hshmat Sahak, Daniel Watson, Chitwan Saharia, David Fleet

Figure 1 for Denoising Diffusion Probabilistic Models for Robust Image Super-Resolution in the Wild
Figure 2 for Denoising Diffusion Probabilistic Models for Robust Image Super-Resolution in the Wild
Figure 3 for Denoising Diffusion Probabilistic Models for Robust Image Super-Resolution in the Wild
Figure 4 for Denoising Diffusion Probabilistic Models for Robust Image Super-Resolution in the Wild
Viaarxiv icon

TcGAN: Semantic-Aware and Structure-Preserved GANs with Individual Vision Transformer for Fast Arbitrary One-Shot Image Generation

Feb 16, 2023
Yunliang Jiang, Lili Yan, Xiongtao Zhang, Yong Liu, Danfeng Sun

Figure 1 for TcGAN: Semantic-Aware and Structure-Preserved GANs with Individual Vision Transformer for Fast Arbitrary One-Shot Image Generation
Figure 2 for TcGAN: Semantic-Aware and Structure-Preserved GANs with Individual Vision Transformer for Fast Arbitrary One-Shot Image Generation
Figure 3 for TcGAN: Semantic-Aware and Structure-Preserved GANs with Individual Vision Transformer for Fast Arbitrary One-Shot Image Generation
Figure 4 for TcGAN: Semantic-Aware and Structure-Preserved GANs with Individual Vision Transformer for Fast Arbitrary One-Shot Image Generation
Viaarxiv icon

CLIP4STR: A Simple Baseline for Scene Text Recognition with Pre-trained Vision-Language Model

May 23, 2023
Shuai Zhao, Xiaohan Wang, Linchao Zhu, Yi Yang

Figure 1 for CLIP4STR: A Simple Baseline for Scene Text Recognition with Pre-trained Vision-Language Model
Figure 2 for CLIP4STR: A Simple Baseline for Scene Text Recognition with Pre-trained Vision-Language Model
Figure 3 for CLIP4STR: A Simple Baseline for Scene Text Recognition with Pre-trained Vision-Language Model
Figure 4 for CLIP4STR: A Simple Baseline for Scene Text Recognition with Pre-trained Vision-Language Model
Viaarxiv icon

ConGraT: Self-Supervised Contrastive Pretraining for Joint Graph and Text Embeddings

May 23, 2023
William Brannon, Suyash Fulay, Hang Jiang, Wonjune Kang, Brandon Roy, Jad Kabbara, Deb Roy

Figure 1 for ConGraT: Self-Supervised Contrastive Pretraining for Joint Graph and Text Embeddings
Figure 2 for ConGraT: Self-Supervised Contrastive Pretraining for Joint Graph and Text Embeddings
Figure 3 for ConGraT: Self-Supervised Contrastive Pretraining for Joint Graph and Text Embeddings
Figure 4 for ConGraT: Self-Supervised Contrastive Pretraining for Joint Graph and Text Embeddings
Viaarxiv icon

REC-MV: REconstructing 3D Dynamic Cloth from Monocular Videos

May 23, 2023
Lingteng Qiu, Guanying Chen, Jiapeng Zhou, Mutian Xu, Junle Wang, Xiaoguang Han

Figure 1 for REC-MV: REconstructing 3D Dynamic Cloth from Monocular Videos
Figure 2 for REC-MV: REconstructing 3D Dynamic Cloth from Monocular Videos
Figure 3 for REC-MV: REconstructing 3D Dynamic Cloth from Monocular Videos
Figure 4 for REC-MV: REconstructing 3D Dynamic Cloth from Monocular Videos
Viaarxiv icon

Design and Operation of Autonomous Wheelchair Towing Robot

May 23, 2023
Hyunwoo Kang, Jaeho Shin, Jaewook Shin, Youngseok Jang, Seung Jae Lee

Figure 1 for Design and Operation of Autonomous Wheelchair Towing Robot
Figure 2 for Design and Operation of Autonomous Wheelchair Towing Robot
Figure 3 for Design and Operation of Autonomous Wheelchair Towing Robot
Figure 4 for Design and Operation of Autonomous Wheelchair Towing Robot
Viaarxiv icon

Learning Remote Sensing Object Detection with Single Point Supervision

May 23, 2023
Shitian He, Huanxin Zou, Yingqian Wang, Boyang Li, Xu Cao, Ning Jing

Figure 1 for Learning Remote Sensing Object Detection with Single Point Supervision
Figure 2 for Learning Remote Sensing Object Detection with Single Point Supervision
Figure 3 for Learning Remote Sensing Object Detection with Single Point Supervision
Figure 4 for Learning Remote Sensing Object Detection with Single Point Supervision
Viaarxiv icon

Multi-object Video Generation from Single Frame Layouts

May 06, 2023
Yang Wu, Zhibin Liu, Hefeng Wu, Liang Lin

Figure 1 for Multi-object Video Generation from Single Frame Layouts
Figure 2 for Multi-object Video Generation from Single Frame Layouts
Figure 3 for Multi-object Video Generation from Single Frame Layouts
Figure 4 for Multi-object Video Generation from Single Frame Layouts
Viaarxiv icon

Advances and Challenges in Multimodal Remote Sensing Image Registration

Feb 05, 2023
Bai Zhu, Liang Zhou, Simiao Pu, Jianwei Fan, Yuanxin Ye

Figure 1 for Advances and Challenges in Multimodal Remote Sensing Image Registration
Figure 2 for Advances and Challenges in Multimodal Remote Sensing Image Registration
Figure 3 for Advances and Challenges in Multimodal Remote Sensing Image Registration
Figure 4 for Advances and Challenges in Multimodal Remote Sensing Image Registration
Viaarxiv icon

MASK-CNN-Transformer For Real-Time Multi-Label Weather Recognition

Apr 28, 2023
Shengchao Chen, Ting Shu, Huan Zhao, Yuan Yan Tan

Viaarxiv icon