Alert button

"Image": models, code, and papers
Alert button

Ground-A-Video: Zero-shot Grounded Video Editing using Text-to-image Diffusion Models

Oct 02, 2023
Hyeonho Jeong, Jong Chul Ye

Figure 1 for Ground-A-Video: Zero-shot Grounded Video Editing using Text-to-image Diffusion Models
Figure 2 for Ground-A-Video: Zero-shot Grounded Video Editing using Text-to-image Diffusion Models
Figure 3 for Ground-A-Video: Zero-shot Grounded Video Editing using Text-to-image Diffusion Models
Figure 4 for Ground-A-Video: Zero-shot Grounded Video Editing using Text-to-image Diffusion Models
Viaarxiv icon

Neural architecture impact on identifying temporally extended Reinforcement Learning tasks

Oct 04, 2023
Victor Vadakechirayath George

Viaarxiv icon

Bridging Distribution Learning and Image Clustering in High-dimensional Space

Aug 29, 2023
Guanfang Dong, Chenqiu Zhao, Anup Basu

Figure 1 for Bridging Distribution Learning and Image Clustering in High-dimensional Space
Figure 2 for Bridging Distribution Learning and Image Clustering in High-dimensional Space
Figure 3 for Bridging Distribution Learning and Image Clustering in High-dimensional Space
Figure 4 for Bridging Distribution Learning and Image Clustering in High-dimensional Space
Viaarxiv icon

On the Localization of Ultrasound Image Slices within Point Distribution Models

Sep 01, 2023
Lennart Bastian, Vincent Bürgin, Ha Young Kim, Alexander Baumann, Benjamin Busam, Mahdi Saleh, Nassir Navab

Figure 1 for On the Localization of Ultrasound Image Slices within Point Distribution Models
Figure 2 for On the Localization of Ultrasound Image Slices within Point Distribution Models
Figure 3 for On the Localization of Ultrasound Image Slices within Point Distribution Models
Viaarxiv icon

Echocardiography video synthesis from end diastolic semantic map via diffusion model

Oct 11, 2023
Phi Nguyen Van, Duc Tran Minh, Hieu Pham Huy, Long Tran Quoc

Figure 1 for Echocardiography video synthesis from end diastolic semantic map via diffusion model
Figure 2 for Echocardiography video synthesis from end diastolic semantic map via diffusion model
Figure 3 for Echocardiography video synthesis from end diastolic semantic map via diffusion model
Figure 4 for Echocardiography video synthesis from end diastolic semantic map via diffusion model
Viaarxiv icon

Video-Teller: Enhancing Cross-Modal Generation with Fusion and Decoupling

Oct 11, 2023
Haogeng Liu, Qihang Fan, Tingkai Liu, Linjie Yang, Yunzhe Tao, Huaibo Huang, Ran He, Hongxia Yang

Figure 1 for Video-Teller: Enhancing Cross-Modal Generation with Fusion and Decoupling
Figure 2 for Video-Teller: Enhancing Cross-Modal Generation with Fusion and Decoupling
Figure 3 for Video-Teller: Enhancing Cross-Modal Generation with Fusion and Decoupling
Figure 4 for Video-Teller: Enhancing Cross-Modal Generation with Fusion and Decoupling
Viaarxiv icon

Swin-Tempo: Temporal-Aware Lung Nodule Detection in CT Scans as Video Sequences Using Swin Transformer-Enhanced UNet

Oct 05, 2023
Hossein Jafari, Karim Faez, Hamidreza Amindavar

Figure 1 for Swin-Tempo: Temporal-Aware Lung Nodule Detection in CT Scans as Video Sequences Using Swin Transformer-Enhanced UNet
Figure 2 for Swin-Tempo: Temporal-Aware Lung Nodule Detection in CT Scans as Video Sequences Using Swin Transformer-Enhanced UNet
Figure 3 for Swin-Tempo: Temporal-Aware Lung Nodule Detection in CT Scans as Video Sequences Using Swin Transformer-Enhanced UNet
Figure 4 for Swin-Tempo: Temporal-Aware Lung Nodule Detection in CT Scans as Video Sequences Using Swin Transformer-Enhanced UNet
Viaarxiv icon

Text-driven Prompt Generation for Vision-Language Models in Federated Learning

Oct 09, 2023
Chen Qiu, Xingyu Li, Chaithanya Kumar Mummadi, Madan Ravi Ganesh, Zhenzhen Li, Lu Peng, Wan-Yi Lin

Figure 1 for Text-driven Prompt Generation for Vision-Language Models in Federated Learning
Figure 2 for Text-driven Prompt Generation for Vision-Language Models in Federated Learning
Figure 3 for Text-driven Prompt Generation for Vision-Language Models in Federated Learning
Figure 4 for Text-driven Prompt Generation for Vision-Language Models in Federated Learning
Viaarxiv icon

Beyond Self-Attention: Deformable Large Kernel Attention for Medical Image Segmentation

Aug 31, 2023
Reza Azad, Leon Niggemeier, Michael Huttemann, Amirhossein Kazerouni, Ehsan Khodapanah Aghdam, Yury Velichko, Ulas Bagci, Dorit Merhof

Figure 1 for Beyond Self-Attention: Deformable Large Kernel Attention for Medical Image Segmentation
Figure 2 for Beyond Self-Attention: Deformable Large Kernel Attention for Medical Image Segmentation
Figure 3 for Beyond Self-Attention: Deformable Large Kernel Attention for Medical Image Segmentation
Figure 4 for Beyond Self-Attention: Deformable Large Kernel Attention for Medical Image Segmentation
Viaarxiv icon

A Simple Text to Video Model via Transformer

Sep 26, 2023
Gang Chen

Viaarxiv icon