Alert button

"Image": models, code, and papers
Alert button

AnyV2V: A Plug-and-Play Framework For Any Video-to-Video Editing Tasks

Mar 21, 2024
Max Ku, Cong Wei, Weiming Ren, Huan Yang, Wenhu Chen

Figure 1 for AnyV2V: A Plug-and-Play Framework For Any Video-to-Video Editing Tasks
Figure 2 for AnyV2V: A Plug-and-Play Framework For Any Video-to-Video Editing Tasks
Figure 3 for AnyV2V: A Plug-and-Play Framework For Any Video-to-Video Editing Tasks
Figure 4 for AnyV2V: A Plug-and-Play Framework For Any Video-to-Video Editing Tasks
Viaarxiv icon

Chain-of-Spot: Interactive Reasoning Improves Large Vision-Language Models

Add code
Bookmark button
Alert button
Mar 21, 2024
Zuyan Liu, Yuhao Dong, Yongming Rao, Jie Zhou, Jiwen Lu

Figure 1 for Chain-of-Spot: Interactive Reasoning Improves Large Vision-Language Models
Figure 2 for Chain-of-Spot: Interactive Reasoning Improves Large Vision-Language Models
Figure 3 for Chain-of-Spot: Interactive Reasoning Improves Large Vision-Language Models
Figure 4 for Chain-of-Spot: Interactive Reasoning Improves Large Vision-Language Models
Viaarxiv icon

Videoshop: Localized Semantic Video Editing with Noise-Extrapolated Diffusion Inversion

Mar 22, 2024
Xiang Fan, Anand Bhattad, Ranjay Krishna

Figure 1 for Videoshop: Localized Semantic Video Editing with Noise-Extrapolated Diffusion Inversion
Figure 2 for Videoshop: Localized Semantic Video Editing with Noise-Extrapolated Diffusion Inversion
Figure 3 for Videoshop: Localized Semantic Video Editing with Noise-Extrapolated Diffusion Inversion
Figure 4 for Videoshop: Localized Semantic Video Editing with Noise-Extrapolated Diffusion Inversion
Viaarxiv icon

Deployment of Deep Learning Model in Real World Clinical Setting: A Case Study in Obstetric Ultrasound

Mar 22, 2024
Chun Kit Wong, Mary Ngo, Manxi Lin, Zahra Bashir, Amihai Heen, Morten Bo Søndergaard Svendsen, Martin Grønnebæk Tolsgaard, Anders Nymark Christensen, Aasa Feragen

Viaarxiv icon

Cross-Domain Image Conversion by CycleDM

Add code
Bookmark button
Alert button
Mar 05, 2024
Sho Shimotsumagari, Shumpei Takezaki, Daichi Haraguchi, Seiichi Uchida

Figure 1 for Cross-Domain Image Conversion by CycleDM
Figure 2 for Cross-Domain Image Conversion by CycleDM
Figure 3 for Cross-Domain Image Conversion by CycleDM
Figure 4 for Cross-Domain Image Conversion by CycleDM
Viaarxiv icon

Exploiting Priors from 3D Diffusion Models for RGB-Based One-Shot View Planning

Mar 25, 2024
Sicong Pan, Liren Jin, Xuying Huang, Cyrill Stachniss, Marija Popović, Maren Bennewitz

Viaarxiv icon

Progressive trajectory matching for medical dataset distillation

Mar 20, 2024
Zhen Yu, Yang Liu, Qingchao Chen

Figure 1 for Progressive trajectory matching for medical dataset distillation
Figure 2 for Progressive trajectory matching for medical dataset distillation
Figure 3 for Progressive trajectory matching for medical dataset distillation
Figure 4 for Progressive trajectory matching for medical dataset distillation
Viaarxiv icon

IRConStyle: Image Restoration Framework Using Contrastive Learning and Style Transfer

Add code
Bookmark button
Alert button
Mar 07, 2024
Dongqi Fan, Xin Zhao, Liang Chang

Viaarxiv icon

Eyes Closed, Safety On: Protecting Multimodal LLMs via Image-to-Text Transformation

Add code
Bookmark button
Alert button
Mar 14, 2024
Yunhao Gou, Kai Chen, Zhili Liu, Lanqing Hong, Hang Xu, Zhenguo Li, Dit-Yan Yeung, James T. Kwok, Yu Zhang

Figure 1 for Eyes Closed, Safety On: Protecting Multimodal LLMs via Image-to-Text Transformation
Figure 2 for Eyes Closed, Safety On: Protecting Multimodal LLMs via Image-to-Text Transformation
Figure 3 for Eyes Closed, Safety On: Protecting Multimodal LLMs via Image-to-Text Transformation
Figure 4 for Eyes Closed, Safety On: Protecting Multimodal LLMs via Image-to-Text Transformation
Viaarxiv icon

Neural Network-Based Processing and Reconstruction of Compromised Biophotonic Image Data

Mar 21, 2024
Michael John Fanous, Paloma Casteleiro Costa, Cagatay Isil, Luzhe Huang, Aydogan Ozcan

Figure 1 for Neural Network-Based Processing and Reconstruction of Compromised Biophotonic Image Data
Figure 2 for Neural Network-Based Processing and Reconstruction of Compromised Biophotonic Image Data
Figure 3 for Neural Network-Based Processing and Reconstruction of Compromised Biophotonic Image Data
Figure 4 for Neural Network-Based Processing and Reconstruction of Compromised Biophotonic Image Data
Viaarxiv icon