Alert button

"Image": models, code, and papers
Alert button

VIGC: Visual Instruction Generation and Correction

Sep 11, 2023
Bin Wang, Fan Wu, Xiao Han, Jiahui Peng, Huaping Zhong, Pan Zhang, Xiaoyi Dong, Weijia Li, Wei Li, Jiaqi Wang, Conghui He

Figure 1 for VIGC: Visual Instruction Generation and Correction
Figure 2 for VIGC: Visual Instruction Generation and Correction
Figure 3 for VIGC: Visual Instruction Generation and Correction
Figure 4 for VIGC: Visual Instruction Generation and Correction
Viaarxiv icon

Sem-CS: Semantic CLIPStyler for Text-Based Image Style Transfer

Add code
Bookmark button
Alert button
Jul 12, 2023
Chanda Grover Kamra, Indra Deep Mastan, Debayan Gupta

Figure 1 for Sem-CS: Semantic CLIPStyler for Text-Based Image Style Transfer
Figure 2 for Sem-CS: Semantic CLIPStyler for Text-Based Image Style Transfer
Figure 3 for Sem-CS: Semantic CLIPStyler for Text-Based Image Style Transfer
Figure 4 for Sem-CS: Semantic CLIPStyler for Text-Based Image Style Transfer
Viaarxiv icon

Parameter-Efficient Long-Tailed Recognition

Add code
Bookmark button
Alert button
Sep 18, 2023
Jiang-Xin Shi, Tong Wei, Zhi Zhou, Xin-Yan Han, Jie-Jing Shao, Yu-Feng Li

Viaarxiv icon

Face-Driven Zero-Shot Voice Conversion with Memory-based Face-Voice Alignment

Add code
Bookmark button
Alert button
Sep 18, 2023
Zheng-Yan Sheng, Yang Ai, Yan-Nian Chen, Zhen-Hua Ling

Figure 1 for Face-Driven Zero-Shot Voice Conversion with Memory-based Face-Voice Alignment
Figure 2 for Face-Driven Zero-Shot Voice Conversion with Memory-based Face-Voice Alignment
Figure 3 for Face-Driven Zero-Shot Voice Conversion with Memory-based Face-Voice Alignment
Figure 4 for Face-Driven Zero-Shot Voice Conversion with Memory-based Face-Voice Alignment
Viaarxiv icon

Can you text what is happening? Integrating pre-trained language encoders into trajectory prediction models for autonomous driving

Sep 13, 2023
Ali Keysan, Andreas Look, Eitan Kosman, Gonca Gürsun, Jörg Wagner, Yu Yao, Barbara Rakitsch

Figure 1 for Can you text what is happening? Integrating pre-trained language encoders into trajectory prediction models for autonomous driving
Figure 2 for Can you text what is happening? Integrating pre-trained language encoders into trajectory prediction models for autonomous driving
Figure 3 for Can you text what is happening? Integrating pre-trained language encoders into trajectory prediction models for autonomous driving
Viaarxiv icon

Feature Modulation Transformer: Cross-Refinement of Global Representation via High-Frequency Prior for Image Super-Resolution

Add code
Bookmark button
Alert button
Aug 12, 2023
Ao Li, Le Zhang, Yun Liu, Ce Zhu

Figure 1 for Feature Modulation Transformer: Cross-Refinement of Global Representation via High-Frequency Prior for Image Super-Resolution
Figure 2 for Feature Modulation Transformer: Cross-Refinement of Global Representation via High-Frequency Prior for Image Super-Resolution
Figure 3 for Feature Modulation Transformer: Cross-Refinement of Global Representation via High-Frequency Prior for Image Super-Resolution
Figure 4 for Feature Modulation Transformer: Cross-Refinement of Global Representation via High-Frequency Prior for Image Super-Resolution
Viaarxiv icon

A Novel Truncated Norm Regularization Method for Multi-channel Color Image Denoising

Jul 16, 2023
Yiwen Shan, Dong Hu, Haoming Ding, Chunming Yang, Zhi Wang

Figure 1 for A Novel Truncated Norm Regularization Method for Multi-channel Color Image Denoising
Figure 2 for A Novel Truncated Norm Regularization Method for Multi-channel Color Image Denoising
Figure 3 for A Novel Truncated Norm Regularization Method for Multi-channel Color Image Denoising
Figure 4 for A Novel Truncated Norm Regularization Method for Multi-channel Color Image Denoising
Viaarxiv icon

Diffusion Models for Interferometric Satellite Aperture Radar

Add code
Bookmark button
Alert button
Aug 31, 2023
Alexandre Tuel, Thomas Kerdreux, Claudia Hulbert, Bertrand Rouet-Leduc

Viaarxiv icon

Latent Painter

Add code
Bookmark button
Alert button
Sep 01, 2023
Shih-Chieh Su

Figure 1 for Latent Painter
Figure 2 for Latent Painter
Figure 3 for Latent Painter
Figure 4 for Latent Painter
Viaarxiv icon

Do humans and Convolutional Neural Networks attend to similar areas during scene classification: Effects of task and image type

Add code
Bookmark button
Alert button
Jul 25, 2023
Romy Müller, Marcel Duerschmidt, Julian Ullrich, Carsten Knoll, Sascha Weber, Steffen Seitz

Figure 1 for Do humans and Convolutional Neural Networks attend to similar areas during scene classification: Effects of task and image type
Figure 2 for Do humans and Convolutional Neural Networks attend to similar areas during scene classification: Effects of task and image type
Figure 3 for Do humans and Convolutional Neural Networks attend to similar areas during scene classification: Effects of task and image type
Figure 4 for Do humans and Convolutional Neural Networks attend to similar areas during scene classification: Effects of task and image type
Viaarxiv icon