Alert button

"Image": models, code, and papers
Alert button

Towards Label-Efficient Human Matting: A Simple Baseline for Weakly Semi-Supervised Trimap-Free Human Matting

Add code
Bookmark button
Alert button
Apr 01, 2024
Beomyoung Kim, Myeong Yeon Yi, Joonsang Yu, Young Joon Yoo, Sung Ju Hwang

Viaarxiv icon

Can Biases in ImageNet Models Explain Generalization?

Add code
Bookmark button
Alert button
Apr 01, 2024
Paul Gavrikov, Janis Keuper

Viaarxiv icon

SV3D: Novel Multi-view Synthesis and 3D Generation from a Single Image using Latent Video Diffusion

Add code
Bookmark button
Alert button
Mar 18, 2024
Vikram Voleti, Chun-Han Yao, Mark Boss, Adam Letts, David Pankratz, Dmitry Tochilkin, Christian Laforte, Robin Rombach, Varun Jampani

Figure 1 for SV3D: Novel Multi-view Synthesis and 3D Generation from a Single Image using Latent Video Diffusion
Figure 2 for SV3D: Novel Multi-view Synthesis and 3D Generation from a Single Image using Latent Video Diffusion
Figure 3 for SV3D: Novel Multi-view Synthesis and 3D Generation from a Single Image using Latent Video Diffusion
Figure 4 for SV3D: Novel Multi-view Synthesis and 3D Generation from a Single Image using Latent Video Diffusion
Viaarxiv icon

ModaLink: Unifying Modalities for Efficient Image-to-PointCloud Place Recognition

Add code
Bookmark button
Alert button
Mar 27, 2024
Weidong Xie, Lun Luo, Nanfei Ye, Yi Ren, Shaoyi Du, Minhang Wang, Jintao Xu, Rui Ai, Weihao Gu, Xieyuanli Chen

Viaarxiv icon

Anytime, Anywhere, Anyone: Investigating the Feasibility of Segment Anything Model for Crowd-Sourcing Medical Image Annotations

Mar 22, 2024
Pranav Kulkarni, Adway Kanhere, Dharmam Savani, Andrew Chan, Devina Chatterjee, Paul H. Yi, Vishwa S. Parekh

Viaarxiv icon

Visual Hallucination: Definition, Quantification, and Prescriptive Remediations

Mar 31, 2024
Anku Rani, Vipula Rawte, Harshad Sharma, Neeraj Anand, Krishnav Rajbangshi, Amit Sheth, Amitava Das

Viaarxiv icon

Imperceptible Protection against Style Imitation from Diffusion Models

Mar 28, 2024
Namhyuk Ahn, Wonhyuk Ahn, KiYoon Yoo, Daesik Kim, Seung-Hun Nam

Viaarxiv icon

E4C: Enhance Editability for Text-Based Image Editing by Harnessing Efficient CLIP Guidance

Add code
Bookmark button
Alert button
Mar 15, 2024
Tianrui Huang, Pu Cao, Lu Yang, Chun Liu, Mengjie Hu, Zhiwei Liu, Qing Song

Figure 1 for E4C: Enhance Editability for Text-Based Image Editing by Harnessing Efficient CLIP Guidance
Figure 2 for E4C: Enhance Editability for Text-Based Image Editing by Harnessing Efficient CLIP Guidance
Figure 3 for E4C: Enhance Editability for Text-Based Image Editing by Harnessing Efficient CLIP Guidance
Figure 4 for E4C: Enhance Editability for Text-Based Image Editing by Harnessing Efficient CLIP Guidance
Viaarxiv icon

Arbitrary-Scale Image Generation and Upsampling using Latent Diffusion Model and Implicit Neural Decoder

Mar 15, 2024
Jinseok Kim, Tae-Kyun Kim

Figure 1 for Arbitrary-Scale Image Generation and Upsampling using Latent Diffusion Model and Implicit Neural Decoder
Figure 2 for Arbitrary-Scale Image Generation and Upsampling using Latent Diffusion Model and Implicit Neural Decoder
Figure 3 for Arbitrary-Scale Image Generation and Upsampling using Latent Diffusion Model and Implicit Neural Decoder
Figure 4 for Arbitrary-Scale Image Generation and Upsampling using Latent Diffusion Model and Implicit Neural Decoder
Viaarxiv icon

AAPMT: AGI Assessment Through Prompt and Metric Transformer

Add code
Bookmark button
Alert button
Mar 28, 2024
Benhao Huang

Viaarxiv icon