Alert button

"Image": models, code, and papers
Alert button

Who is bragging more online? A large scale analysis of bragging in social media

Mar 25, 2024
Mali Jin, Daniel Preoţiuc-Pietro, A. Seza Doğruöz, Nikolaos Aletras

Viaarxiv icon

MeaCap: Memory-Augmented Zero-shot Image Captioning

Add code
Bookmark button
Alert button
Mar 06, 2024
Zequn Zeng, Yan Xie, Hao Zhang, Chiyu Chen, Zhengjue Wang, Bo Chen

Figure 1 for MeaCap: Memory-Augmented Zero-shot Image Captioning
Figure 2 for MeaCap: Memory-Augmented Zero-shot Image Captioning
Figure 3 for MeaCap: Memory-Augmented Zero-shot Image Captioning
Figure 4 for MeaCap: Memory-Augmented Zero-shot Image Captioning
Viaarxiv icon

FedFMS: Exploring Federated Foundation Models for Medical Image Segmentation

Add code
Bookmark button
Alert button
Mar 08, 2024
Yuxi Liu, Guibo Luo, Yuesheng Zhu

Figure 1 for FedFMS: Exploring Federated Foundation Models for Medical Image Segmentation
Figure 2 for FedFMS: Exploring Federated Foundation Models for Medical Image Segmentation
Figure 3 for FedFMS: Exploring Federated Foundation Models for Medical Image Segmentation
Figure 4 for FedFMS: Exploring Federated Foundation Models for Medical Image Segmentation
Viaarxiv icon

Deployment of Deep Learning Model in Real World Clinical Setting: A Case Study in Obstetric Ultrasound

Mar 22, 2024
Chun Kit Wong, Mary Ngo, Manxi Lin, Zahra Bashir, Amihai Heen, Morten Bo Søndergaard Svendsen, Martin Grønnebæk Tolsgaard, Anders Nymark Christensen, Aasa Feragen

Viaarxiv icon

Videoshop: Localized Semantic Video Editing with Noise-Extrapolated Diffusion Inversion

Mar 22, 2024
Xiang Fan, Anand Bhattad, Ranjay Krishna

Figure 1 for Videoshop: Localized Semantic Video Editing with Noise-Extrapolated Diffusion Inversion
Figure 2 for Videoshop: Localized Semantic Video Editing with Noise-Extrapolated Diffusion Inversion
Figure 3 for Videoshop: Localized Semantic Video Editing with Noise-Extrapolated Diffusion Inversion
Figure 4 for Videoshop: Localized Semantic Video Editing with Noise-Extrapolated Diffusion Inversion
Viaarxiv icon

AnyV2V: A Plug-and-Play Framework For Any Video-to-Video Editing Tasks

Mar 21, 2024
Max Ku, Cong Wei, Weiming Ren, Huan Yang, Wenhu Chen

Figure 1 for AnyV2V: A Plug-and-Play Framework For Any Video-to-Video Editing Tasks
Figure 2 for AnyV2V: A Plug-and-Play Framework For Any Video-to-Video Editing Tasks
Figure 3 for AnyV2V: A Plug-and-Play Framework For Any Video-to-Video Editing Tasks
Figure 4 for AnyV2V: A Plug-and-Play Framework For Any Video-to-Video Editing Tasks
Viaarxiv icon

Chain-of-Spot: Interactive Reasoning Improves Large Vision-Language Models

Add code
Bookmark button
Alert button
Mar 21, 2024
Zuyan Liu, Yuhao Dong, Yongming Rao, Jie Zhou, Jiwen Lu

Figure 1 for Chain-of-Spot: Interactive Reasoning Improves Large Vision-Language Models
Figure 2 for Chain-of-Spot: Interactive Reasoning Improves Large Vision-Language Models
Figure 3 for Chain-of-Spot: Interactive Reasoning Improves Large Vision-Language Models
Figure 4 for Chain-of-Spot: Interactive Reasoning Improves Large Vision-Language Models
Viaarxiv icon

Holo-Relighting: Controllable Volumetric Portrait Relighting from a Single Image

Mar 14, 2024
Yiqun Mei, Yu Zeng, He Zhang, Zhixin Shu, Xuaner Zhang, Sai Bi, Jianming Zhang, HyunJoon Jung, Vishal M. Patel

Figure 1 for Holo-Relighting: Controllable Volumetric Portrait Relighting from a Single Image
Figure 2 for Holo-Relighting: Controllable Volumetric Portrait Relighting from a Single Image
Figure 3 for Holo-Relighting: Controllable Volumetric Portrait Relighting from a Single Image
Figure 4 for Holo-Relighting: Controllable Volumetric Portrait Relighting from a Single Image
Viaarxiv icon

Self-supervised Photographic Image Layout Representation Learning

Add code
Bookmark button
Alert button
Mar 06, 2024
Zhaoran Zhao, Peng Lu, Xujun Peng, Wenhao Guo

Figure 1 for Self-supervised Photographic Image Layout Representation Learning
Figure 2 for Self-supervised Photographic Image Layout Representation Learning
Figure 3 for Self-supervised Photographic Image Layout Representation Learning
Figure 4 for Self-supervised Photographic Image Layout Representation Learning
Viaarxiv icon

Neural Network-Based Processing and Reconstruction of Compromised Biophotonic Image Data

Mar 21, 2024
Michael John Fanous, Paloma Casteleiro Costa, Cagatay Isil, Luzhe Huang, Aydogan Ozcan

Figure 1 for Neural Network-Based Processing and Reconstruction of Compromised Biophotonic Image Data
Figure 2 for Neural Network-Based Processing and Reconstruction of Compromised Biophotonic Image Data
Figure 3 for Neural Network-Based Processing and Reconstruction of Compromised Biophotonic Image Data
Figure 4 for Neural Network-Based Processing and Reconstruction of Compromised Biophotonic Image Data
Viaarxiv icon