Alert button

"Image": models, code, and papers
Alert button

RT-SRTS: Angle-Agnostic Real-Time Simultaneous 3D Reconstruction and Tumor Segmentation from Single X-Ray Projection

Add code
Bookmark button
Alert button
Oct 12, 2023
Miao Zhu, Qiming Fu, Bo Liu, Mengxi Zhang, Bojian Li, Xiaoyan Luo, Fugen Zhou

Viaarxiv icon

Unified Language-Vision Pretraining in LLM with Dynamic Discrete Visual Tokenization

Add code
Bookmark button
Alert button
Sep 29, 2023
Yang Jin, Kun Xu, Kun Xu, Liwei Chen, Chao Liao, Jianchao Tan, Quzhe Huang, Bin Chen, Chenyi Lei, An Liu, Chengru Song, Xiaoqiang Lei, Di Zhang, Wenwu Ou, Kun Gai, Yadong Mu

Figure 1 for Unified Language-Vision Pretraining in LLM with Dynamic Discrete Visual Tokenization
Figure 2 for Unified Language-Vision Pretraining in LLM with Dynamic Discrete Visual Tokenization
Figure 3 for Unified Language-Vision Pretraining in LLM with Dynamic Discrete Visual Tokenization
Figure 4 for Unified Language-Vision Pretraining in LLM with Dynamic Discrete Visual Tokenization
Viaarxiv icon

SatDM: Synthesizing Realistic Satellite Image with Semantic Layout Conditioning using Diffusion Models

Add code
Bookmark button
Alert button
Sep 28, 2023
Orkhan Baghirli, Hamid Askarov, Imran Ibrahimli, Ismat Bakhishov, Nabi Nabiyev

Figure 1 for SatDM: Synthesizing Realistic Satellite Image with Semantic Layout Conditioning using Diffusion Models
Figure 2 for SatDM: Synthesizing Realistic Satellite Image with Semantic Layout Conditioning using Diffusion Models
Figure 3 for SatDM: Synthesizing Realistic Satellite Image with Semantic Layout Conditioning using Diffusion Models
Figure 4 for SatDM: Synthesizing Realistic Satellite Image with Semantic Layout Conditioning using Diffusion Models
Viaarxiv icon

Generating 3D Brain Tumor Regions in MRI using Vector-Quantization Generative Adversarial Networks

Oct 02, 2023
Meng Zhou, Matthias W Wagner, Uri Tabori, Cynthia Hawkins, Birgit B Ertl-Wagner, Farzad Khalvati

Figure 1 for Generating 3D Brain Tumor Regions in MRI using Vector-Quantization Generative Adversarial Networks
Figure 2 for Generating 3D Brain Tumor Regions in MRI using Vector-Quantization Generative Adversarial Networks
Figure 3 for Generating 3D Brain Tumor Regions in MRI using Vector-Quantization Generative Adversarial Networks
Figure 4 for Generating 3D Brain Tumor Regions in MRI using Vector-Quantization Generative Adversarial Networks
Viaarxiv icon

Text-driven Prompt Generation for Vision-Language Models in Federated Learning

Oct 09, 2023
Chen Qiu, Xingyu Li, Chaithanya Kumar Mummadi, Madan Ravi Ganesh, Zhenzhen Li, Lu Peng, Wan-Yi Lin

Figure 1 for Text-driven Prompt Generation for Vision-Language Models in Federated Learning
Figure 2 for Text-driven Prompt Generation for Vision-Language Models in Federated Learning
Figure 3 for Text-driven Prompt Generation for Vision-Language Models in Federated Learning
Figure 4 for Text-driven Prompt Generation for Vision-Language Models in Federated Learning
Viaarxiv icon

The Role of Linguistic Priors in Measuring Compositional Generalization of Vision-Language Models

Oct 04, 2023
Chenwei Wu, Li Erran Li, Stefano Ermon, Patrick Haffner, Rong Ge, Zaiwei Zhang

Figure 1 for The Role of Linguistic Priors in Measuring Compositional Generalization of Vision-Language Models
Figure 2 for The Role of Linguistic Priors in Measuring Compositional Generalization of Vision-Language Models
Figure 3 for The Role of Linguistic Priors in Measuring Compositional Generalization of Vision-Language Models
Figure 4 for The Role of Linguistic Priors in Measuring Compositional Generalization of Vision-Language Models
Viaarxiv icon

MBIR Training for a 2.5D DL network in X-ray CT

Sep 23, 2023
Obaidullah Rahman, Madhuri Nagare, Ken D. Sauer, Charles A. Bouman, Roman Melnyk, Brian Nett, Jie Tang

Viaarxiv icon

Augmenting medical image classifiers with synthetic data from latent diffusion models

Aug 23, 2023
Luke W. Sagers, James A. Diao, Luke Melas-Kyriazi, Matthew Groh, Pranav Rajpurkar, Adewole S. Adamson, Veronica Rotemberg, Roxana Daneshjou, Arjun K. Manrai

Figure 1 for Augmenting medical image classifiers with synthetic data from latent diffusion models
Figure 2 for Augmenting medical image classifiers with synthetic data from latent diffusion models
Figure 3 for Augmenting medical image classifiers with synthetic data from latent diffusion models
Figure 4 for Augmenting medical image classifiers with synthetic data from latent diffusion models
Viaarxiv icon

Video-Teller: Enhancing Cross-Modal Generation with Fusion and Decoupling

Oct 11, 2023
Haogeng Liu, Qihang Fan, Tingkai Liu, Linjie Yang, Yunzhe Tao, Huaibo Huang, Ran He, Hongxia Yang

Figure 1 for Video-Teller: Enhancing Cross-Modal Generation with Fusion and Decoupling
Figure 2 for Video-Teller: Enhancing Cross-Modal Generation with Fusion and Decoupling
Figure 3 for Video-Teller: Enhancing Cross-Modal Generation with Fusion and Decoupling
Figure 4 for Video-Teller: Enhancing Cross-Modal Generation with Fusion and Decoupling
Viaarxiv icon

Echocardiography video synthesis from end diastolic semantic map via diffusion model

Oct 11, 2023
Phi Nguyen Van, Duc Tran Minh, Hieu Pham Huy, Long Tran Quoc

Figure 1 for Echocardiography video synthesis from end diastolic semantic map via diffusion model
Figure 2 for Echocardiography video synthesis from end diastolic semantic map via diffusion model
Figure 3 for Echocardiography video synthesis from end diastolic semantic map via diffusion model
Figure 4 for Echocardiography video synthesis from end diastolic semantic map via diffusion model
Viaarxiv icon