Alert button

"Image": models, code, and papers
Alert button

Can GPT-4V(ision) Serve Medical Applications? Case Studies on GPT-4V for Multimodal Medical Diagnosis

Oct 17, 2023
Chaoyi Wu, Jiayu Lei, Qiaoyu Zheng, Weike Zhao, Weixiong Lin, Xiaoman Zhang, Xiao Zhou, Ziheng Zhao, Ya Zhang, Yanfeng Wang, Weidi Xie

Viaarxiv icon

Learning Neural Implicit through Volume Rendering with Attentive Depth Fusion Priors

Oct 17, 2023
Pengchong Hu, Zhizhong Han

Viaarxiv icon

Direct Inversion: Boosting Diffusion-based Editing with 3 Lines of Code

Oct 02, 2023
Xuan Ju, Ailing Zeng, Yuxuan Bian, Shaoteng Liu, Qiang Xu

Figure 1 for Direct Inversion: Boosting Diffusion-based Editing with 3 Lines of Code
Figure 2 for Direct Inversion: Boosting Diffusion-based Editing with 3 Lines of Code
Figure 3 for Direct Inversion: Boosting Diffusion-based Editing with 3 Lines of Code
Figure 4 for Direct Inversion: Boosting Diffusion-based Editing with 3 Lines of Code
Viaarxiv icon

MedSyn: Text-guided Anatomy-aware Synthesis of High-Fidelity 3D CT Images

Oct 18, 2023
Yanwu Xu, Li Sun, Wei Peng, Shyam Visweswaran, Kayhan Batmanghelich

Figure 1 for MedSyn: Text-guided Anatomy-aware Synthesis of High-Fidelity 3D CT Images
Figure 2 for MedSyn: Text-guided Anatomy-aware Synthesis of High-Fidelity 3D CT Images
Figure 3 for MedSyn: Text-guided Anatomy-aware Synthesis of High-Fidelity 3D CT Images
Figure 4 for MedSyn: Text-guided Anatomy-aware Synthesis of High-Fidelity 3D CT Images
Viaarxiv icon

Monarch Mixer: A Simple Sub-Quadratic GEMM-Based Architecture

Oct 18, 2023
Daniel Y. Fu, Simran Arora, Jessica Grogan, Isys Johnson, Sabri Eyuboglu, Armin W. Thomas, Benjamin Spector, Michael Poli, Atri Rudra, Christopher Ré

Viaarxiv icon

Impact of Image Context for Single Deep Learning Face Morphing Attack Detection

Sep 01, 2023
Joana Pimenta, Iurii Medvedev, Nuno Gonçalves

Figure 1 for Impact of Image Context for Single Deep Learning Face Morphing Attack Detection
Figure 2 for Impact of Image Context for Single Deep Learning Face Morphing Attack Detection
Figure 3 for Impact of Image Context for Single Deep Learning Face Morphing Attack Detection
Figure 4 for Impact of Image Context for Single Deep Learning Face Morphing Attack Detection
Viaarxiv icon

FIRE: Food Image to REcipe generation

Aug 28, 2023
Prateek Chhikara, Dhiraj Chaurasia, Yifan Jiang, Omkar Masur, Filip Ilievski

Figure 1 for FIRE: Food Image to REcipe generation
Figure 2 for FIRE: Food Image to REcipe generation
Figure 3 for FIRE: Food Image to REcipe generation
Figure 4 for FIRE: Food Image to REcipe generation
Viaarxiv icon

Beyond Segmentation: Road Network Generation with Multi-Modal LLMs

Oct 15, 2023
Sumedh Rasal, Sanjay Kumar Boddhu

Viaarxiv icon

Estimating Uncertainty in Multimodal Foundation Models using Public Internet Data

Oct 15, 2023
Shiladitya Dutta, Hongbo Wei, Lars van der Laan, Ahmed M. Alaa

Viaarxiv icon

Does resistance to Style-Transfer equal Shape Bias? Evaluating Shape Bias by Distorted Shape

Oct 11, 2023
Ziqi Wen, Tianqin Li, Tai Sing Lee

Figure 1 for Does resistance to Style-Transfer equal Shape Bias? Evaluating Shape Bias by Distorted Shape
Figure 2 for Does resistance to Style-Transfer equal Shape Bias? Evaluating Shape Bias by Distorted Shape
Figure 3 for Does resistance to Style-Transfer equal Shape Bias? Evaluating Shape Bias by Distorted Shape
Figure 4 for Does resistance to Style-Transfer equal Shape Bias? Evaluating Shape Bias by Distorted Shape
Viaarxiv icon