Alert button

"Image": models, code, and papers
Alert button

Towards Robust Prompts on Vision-Language Models

Apr 17, 2023
Jindong Gu, Ahmad Beirami, Xuezhi Wang, Alex Beutel, Philip Torr, Yao Qin

Figure 1 for Towards Robust Prompts on Vision-Language Models
Figure 2 for Towards Robust Prompts on Vision-Language Models
Figure 3 for Towards Robust Prompts on Vision-Language Models
Figure 4 for Towards Robust Prompts on Vision-Language Models
Viaarxiv icon

Visual Instruction Tuning

Add code
Bookmark button
Alert button
Apr 17, 2023
Haotian Liu, Chunyuan Li, Qingyang Wu, Yong Jae Lee

Figure 1 for Visual Instruction Tuning
Figure 2 for Visual Instruction Tuning
Figure 3 for Visual Instruction Tuning
Figure 4 for Visual Instruction Tuning
Viaarxiv icon

Generative Disco: Text-to-Video Generation for Music Visualization

Apr 17, 2023
Vivian Liu, Tao Long, Nathan Raw, Lydia Chilton

Figure 1 for Generative Disco: Text-to-Video Generation for Music Visualization
Figure 2 for Generative Disco: Text-to-Video Generation for Music Visualization
Figure 3 for Generative Disco: Text-to-Video Generation for Music Visualization
Figure 4 for Generative Disco: Text-to-Video Generation for Music Visualization
Viaarxiv icon

FaceQAN: Face Image Quality Assessment Through Adversarial Noise Exploration

Add code
Bookmark button
Alert button
Dec 05, 2022
Žiga Babnik, Peter Peer, Vitomir Štruc

Figure 1 for FaceQAN: Face Image Quality Assessment Through Adversarial Noise Exploration
Figure 2 for FaceQAN: Face Image Quality Assessment Through Adversarial Noise Exploration
Figure 3 for FaceQAN: Face Image Quality Assessment Through Adversarial Noise Exploration
Figure 4 for FaceQAN: Face Image Quality Assessment Through Adversarial Noise Exploration
Viaarxiv icon

Bent & Broken Bicycles: Leveraging synthetic data for damaged object re-identification

Add code
Bookmark button
Alert button
Apr 16, 2023
Luca Piano, Filippo Gabriele Pratticò, Alessandro Sebastian Russo, Lorenzo Lanari, Lia Morra, Fabrizio Lamberti

Figure 1 for Bent & Broken Bicycles: Leveraging synthetic data for damaged object re-identification
Figure 2 for Bent & Broken Bicycles: Leveraging synthetic data for damaged object re-identification
Figure 3 for Bent & Broken Bicycles: Leveraging synthetic data for damaged object re-identification
Figure 4 for Bent & Broken Bicycles: Leveraging synthetic data for damaged object re-identification
Viaarxiv icon

Optimization of Image Transmission in a Cooperative Semantic Communication Networks

Jan 01, 2023
Wenjing Zhang, Yining Wang, Mingzhe Chen, Tao Luo, Dusit Niyato

Figure 1 for Optimization of Image Transmission in a Cooperative Semantic Communication Networks
Figure 2 for Optimization of Image Transmission in a Cooperative Semantic Communication Networks
Figure 3 for Optimization of Image Transmission in a Cooperative Semantic Communication Networks
Figure 4 for Optimization of Image Transmission in a Cooperative Semantic Communication Networks
Viaarxiv icon

NaviNeRF: NeRF-based 3D Representation Disentanglement by Latent Semantic Navigation

Apr 22, 2023
Baao Xie, Bohan Li, Zequn Zhang, Junting Dong, Xin Jin, Jingyu Yang, Wenjun Zeng

Figure 1 for NaviNeRF: NeRF-based 3D Representation Disentanglement by Latent Semantic Navigation
Figure 2 for NaviNeRF: NeRF-based 3D Representation Disentanglement by Latent Semantic Navigation
Figure 3 for NaviNeRF: NeRF-based 3D Representation Disentanglement by Latent Semantic Navigation
Figure 4 for NaviNeRF: NeRF-based 3D Representation Disentanglement by Latent Semantic Navigation
Viaarxiv icon

OTS: A One-shot Learning Approach for Text Spotting in Historical Manuscripts

Add code
Bookmark button
Alert button
Apr 03, 2023
Wen-Bo Hu, Hong-Jian Zhan, Cong Liu, Bing Yin, Yue Lu

Figure 1 for OTS: A One-shot Learning Approach for Text Spotting in Historical Manuscripts
Figure 2 for OTS: A One-shot Learning Approach for Text Spotting in Historical Manuscripts
Figure 3 for OTS: A One-shot Learning Approach for Text Spotting in Historical Manuscripts
Figure 4 for OTS: A One-shot Learning Approach for Text Spotting in Historical Manuscripts
Viaarxiv icon

Large-Scale Bidirectional Training for Zero-Shot Image Captioning

Add code
Bookmark button
Alert button
Nov 15, 2022
Taehoon Kim, Mark Marsden, Pyunghwan Ahn, Sangyun Kim, Sihaeng Lee, Alessandra Sala, Seung Hwan Kim

Figure 1 for Large-Scale Bidirectional Training for Zero-Shot Image Captioning
Figure 2 for Large-Scale Bidirectional Training for Zero-Shot Image Captioning
Figure 3 for Large-Scale Bidirectional Training for Zero-Shot Image Captioning
Figure 4 for Large-Scale Bidirectional Training for Zero-Shot Image Captioning
Viaarxiv icon

Self-Supervised Pre-training of 3D Point Cloud Networks with Image Data

Dec 13, 2022
Andrej Janda, Brandon Wagstaff, Edwin G. Ng, Jonathan Kelly

Figure 1 for Self-Supervised Pre-training of 3D Point Cloud Networks with Image Data
Figure 2 for Self-Supervised Pre-training of 3D Point Cloud Networks with Image Data
Figure 3 for Self-Supervised Pre-training of 3D Point Cloud Networks with Image Data
Viaarxiv icon