Alert button

"Image": models, code, and papers
Alert button

Improving Medical Report Generation with Adapter Tuning and Knowledge Enhancement in Vision-Language Foundation Models

Dec 07, 2023
Shibin Wu, Bang Yang, Zhiyu Ye, Haoqian Wang, Hairong Zheng, Tong Zhang

Viaarxiv icon

Dream2Real: Zero-Shot 3D Object Rearrangement with Vision-Language Models

Dec 07, 2023
Ivan Kapelyukh, Yifei Ren, Ignacio Alzugaray, Edward Johns

Viaarxiv icon

Guided Flows for Generative Modeling and Decision Making

Dec 07, 2023
Qinqing Zheng, Matt Le, Neta Shaul, Yaron Lipman, Aditya Grover, Ricky T. Q. Chen

Viaarxiv icon

Disentangled Interaction Representation for One-Stage Human-Object Interaction Detection

Dec 04, 2023
Xubin Zhong, Changxing Ding, Yupeng Hu, Dacheng Tao

Figure 1 for Disentangled Interaction Representation for One-Stage Human-Object Interaction Detection
Figure 2 for Disentangled Interaction Representation for One-Stage Human-Object Interaction Detection
Figure 3 for Disentangled Interaction Representation for One-Stage Human-Object Interaction Detection
Figure 4 for Disentangled Interaction Representation for One-Stage Human-Object Interaction Detection
Viaarxiv icon

Can SAM recognize crops? Quantifying the zero-shot performance of a semantic segmentation foundation model on generating crop-type maps using satellite imagery for precision agriculture

Dec 04, 2023
Rutuja Gurav, Het Patel, Zhuocheng Shang, Ahmed Eldawy, Jia Chen, Elia Scudiero, Evangelos Papalexakis

Viaarxiv icon

Sam-Guided Enhanced Fine-Grained Encoding with Mixed Semantic Learning for Medical Image Captioning

Nov 02, 2023
Gaoang Wang, Zhenyu Zhang, Benlu Wang, Weijie Liang, Yizhi Li, Xuechen Guo, Guanhong Wang, Shiyan Li

Viaarxiv icon

Distance Weighted Trans Network for Image Completion

Oct 25, 2023
Pourya Shamsolmoali, Masoumeh Zareapoor, Huiyu Zhou, Xuelong Li, Yue Lu

Figure 1 for Distance Weighted Trans Network for Image Completion
Figure 2 for Distance Weighted Trans Network for Image Completion
Figure 3 for Distance Weighted Trans Network for Image Completion
Figure 4 for Distance Weighted Trans Network for Image Completion
Viaarxiv icon

High-Resolution Reference Image Assisted Volumetric Super-Resolution of Cardiac Diffusion Weighted Imaging

Oct 31, 2023
Yinzhe Wu, Jiahao Huang, Fanwen Wang, Pedro Ferreira, Andrew Scott, Sonia Nielles-Vallespin, Guang Yang

Figure 1 for High-Resolution Reference Image Assisted Volumetric Super-Resolution of Cardiac Diffusion Weighted Imaging
Figure 2 for High-Resolution Reference Image Assisted Volumetric Super-Resolution of Cardiac Diffusion Weighted Imaging
Figure 3 for High-Resolution Reference Image Assisted Volumetric Super-Resolution of Cardiac Diffusion Weighted Imaging
Figure 4 for High-Resolution Reference Image Assisted Volumetric Super-Resolution of Cardiac Diffusion Weighted Imaging
Viaarxiv icon

Augment the Pairs: Semantics-Preserving Image-Caption Pair Augmentation for Grounding-Based Vision and Language Models

Add code
Bookmark button
Alert button
Nov 05, 2023
Jingru Yi, Burak Uzkent, Oana Ignat, Zili Li, Amanmeet Garg, Xiang Yu, Linda Liu

Viaarxiv icon

Learning Anatomically Consistent Embedding for Chest Radiography

Add code
Bookmark button
Alert button
Dec 01, 2023
Ziyu Zhou, Haozhe Luo, Jiaxuan Pang, Xiaowei Ding, Michael Gotway, Jianming Liang

Viaarxiv icon