Alert button

"Image": models, code, and papers
Alert button

BS-Diff: Effective Bone Suppression Using Conditional Diffusion Models from Chest X-Ray Images

Add code
Bookmark button
Alert button
Nov 26, 2023
Zhanghao Chen, Yifei Sun, Wenjian Qin, Ruiquan Ge, Cheng Pan, Wenming Deng, Zhou Liu, Wenwen Min, Ahmed Elazab, Xiang Wan, Changmiao Wang

Viaarxiv icon

Consistency Prototype Module and Motion Compensation for Few-Shot Action Recognition (CLIP-CP$\mathbf{M^2}$C)

Add code
Bookmark button
Alert button
Dec 02, 2023
Fei Guo, Li Zhu, YiKang Wang, Han Qi

Viaarxiv icon

StableDreamer: Taming Noisy Score Distillation Sampling for Text-to-3D

Dec 02, 2023
Pengsheng Guo, Hans Hao, Adam Caccavale, Zhongzheng Ren, Edward Zhang, Qi Shan, Aditya Sankar, Alexander G. Schwing, Alex Colburn, Fangchang Ma

Viaarxiv icon

A Probabilistic Method to Predict Classifier Accuracy on Larger Datasets given Small Pilot Data

Add code
Bookmark button
Alert button
Nov 29, 2023
Ethan Harvey, Wansu Chen, David M. Kent, Michael C. Hughes

Figure 1 for A Probabilistic Method to Predict Classifier Accuracy on Larger Datasets given Small Pilot Data
Figure 2 for A Probabilistic Method to Predict Classifier Accuracy on Larger Datasets given Small Pilot Data
Figure 3 for A Probabilistic Method to Predict Classifier Accuracy on Larger Datasets given Small Pilot Data
Figure 4 for A Probabilistic Method to Predict Classifier Accuracy on Larger Datasets given Small Pilot Data
Viaarxiv icon

LICO: Explainable Models with Language-Image Consistency

Add code
Bookmark button
Alert button
Oct 15, 2023
Yiming Lei, Zilong Li, Yangyang Li, Junping Zhang, Hongming Shan

Viaarxiv icon

Importance of Feature Extraction in the Calculation of Fréchet Distance for Medical Imaging

Nov 22, 2023
McKell Woodland, Mais Al Taie, Jessica Albuquerque Marques Silva, Mohamed Eltaher, Frank Mohn, Alexander Shieh, Austin Castelo, Suprateek Kundu, Joshua P. Yung, Ankit B. Patel, Kristy K. Brock

Viaarxiv icon

PG-Video-LLaVA: Pixel Grounding Large Video-Language Models

Add code
Bookmark button
Alert button
Nov 22, 2023
Shehan Munasinghe, Rusiru Thushara, Muhammad Maaz, Hanoona Abdul Rasheed, Salman Khan, Mubarak Shah, Fahad Khan

Figure 1 for PG-Video-LLaVA: Pixel Grounding Large Video-Language Models
Figure 2 for PG-Video-LLaVA: Pixel Grounding Large Video-Language Models
Figure 3 for PG-Video-LLaVA: Pixel Grounding Large Video-Language Models
Figure 4 for PG-Video-LLaVA: Pixel Grounding Large Video-Language Models
Viaarxiv icon

AutoStory: Generating Diverse Storytelling Images with Minimal Human Effort

Add code
Bookmark button
Alert button
Nov 19, 2023
Wen Wang, Canyu Zhao, Hao Chen, Zhekai Chen, Kecheng Zheng, Chunhua Shen

Figure 1 for AutoStory: Generating Diverse Storytelling Images with Minimal Human Effort
Figure 2 for AutoStory: Generating Diverse Storytelling Images with Minimal Human Effort
Figure 3 for AutoStory: Generating Diverse Storytelling Images with Minimal Human Effort
Figure 4 for AutoStory: Generating Diverse Storytelling Images with Minimal Human Effort
Viaarxiv icon

RePoseDM: Recurrent Pose Alignment and Gradient Guidance for Pose Guided Image Synthesis

Oct 24, 2023
Anant Khandelwal

Viaarxiv icon

CoDi-2: In-Context, Interleaved, and Interactive Any-to-Any Generation

Add code
Bookmark button
Alert button
Nov 30, 2023
Zineng Tang, Ziyi Yang, Mahmoud Khademi, Yang Liu, Chenguang Zhu, Mohit Bansal

Viaarxiv icon