Alert button

"Image": models, code, and papers
Alert button

Muse: Text-To-Image Generation via Masked Generative Transformers

Add code
Bookmark button
Alert button
Jan 02, 2023
Huiwen Chang, Han Zhang, Jarred Barber, AJ Maschinot, Jose Lezama, Lu Jiang, Ming-Hsuan Yang, Kevin Murphy, William T. Freeman, Michael Rubinstein, Yuanzhen Li, Dilip Krishnan

Figure 1 for Muse: Text-To-Image Generation via Masked Generative Transformers
Figure 2 for Muse: Text-To-Image Generation via Masked Generative Transformers
Figure 3 for Muse: Text-To-Image Generation via Masked Generative Transformers
Figure 4 for Muse: Text-To-Image Generation via Masked Generative Transformers
Viaarxiv icon

What Affects Learned Equivariance in Deep Image Recognition Models?

Apr 07, 2023
Robert-Jan Bruintjes, Tomasz Motyka, Jan van Gemert

Figure 1 for What Affects Learned Equivariance in Deep Image Recognition Models?
Figure 2 for What Affects Learned Equivariance in Deep Image Recognition Models?
Figure 3 for What Affects Learned Equivariance in Deep Image Recognition Models?
Figure 4 for What Affects Learned Equivariance in Deep Image Recognition Models?
Viaarxiv icon

A Comprehensive Survey on Segment Anything Model for Vision and Beyond

Add code
Bookmark button
Alert button
May 19, 2023
Chunhui Zhang, Li Liu, Yawen Cui, Guanjie Huang, Weilin Lin, Yiqian Yang, Yuehong Hu

Figure 1 for A Comprehensive Survey on Segment Anything Model for Vision and Beyond
Figure 2 for A Comprehensive Survey on Segment Anything Model for Vision and Beyond
Figure 3 for A Comprehensive Survey on Segment Anything Model for Vision and Beyond
Figure 4 for A Comprehensive Survey on Segment Anything Model for Vision and Beyond
Viaarxiv icon

AutoCoreset: An Automatic Practical Coreset Construction Framework

Add code
Bookmark button
Alert button
May 19, 2023
Alaa Maalouf, Murad Tukan, Vladimir Braverman, Daniela Rus

Figure 1 for AutoCoreset: An Automatic Practical Coreset Construction Framework
Figure 2 for AutoCoreset: An Automatic Practical Coreset Construction Framework
Figure 3 for AutoCoreset: An Automatic Practical Coreset Construction Framework
Figure 4 for AutoCoreset: An Automatic Practical Coreset Construction Framework
Viaarxiv icon

Surgical-VQLA: Transformer with Gated Vision-Language Embedding for Visual Question Localized-Answering in Robotic Surgery

Add code
Bookmark button
Alert button
May 19, 2023
Long Bai, Mobarakol Islam, Lalithkumar Seenivasan, Hongliang Ren

Figure 1 for Surgical-VQLA: Transformer with Gated Vision-Language Embedding for Visual Question Localized-Answering in Robotic Surgery
Figure 2 for Surgical-VQLA: Transformer with Gated Vision-Language Embedding for Visual Question Localized-Answering in Robotic Surgery
Figure 3 for Surgical-VQLA: Transformer with Gated Vision-Language Embedding for Visual Question Localized-Answering in Robotic Surgery
Figure 4 for Surgical-VQLA: Transformer with Gated Vision-Language Embedding for Visual Question Localized-Answering in Robotic Surgery
Viaarxiv icon

PTQD: Accurate Post-Training Quantization for Diffusion Models

May 18, 2023
Yefei He, Luping Liu, Jing Liu, Weijia Wu, Hong Zhou, Bohan Zhuang

Figure 1 for PTQD: Accurate Post-Training Quantization for Diffusion Models
Figure 2 for PTQD: Accurate Post-Training Quantization for Diffusion Models
Figure 3 for PTQD: Accurate Post-Training Quantization for Diffusion Models
Figure 4 for PTQD: Accurate Post-Training Quantization for Diffusion Models
Viaarxiv icon

Counterfactuals for Design: A Model-Agnostic Method For Design Recommendations

May 18, 2023
Lyle Regenwetter, Yazan Abu Obaideh, Faez Ahmed

Figure 1 for Counterfactuals for Design: A Model-Agnostic Method For Design Recommendations
Figure 2 for Counterfactuals for Design: A Model-Agnostic Method For Design Recommendations
Figure 3 for Counterfactuals for Design: A Model-Agnostic Method For Design Recommendations
Figure 4 for Counterfactuals for Design: A Model-Agnostic Method For Design Recommendations
Viaarxiv icon

Bootstrapping Objectness from Videos by Relaxed Common Fate and Visual Grouping

Apr 17, 2023
Long Lian, Zhirong Wu, Stella X. Yu

Figure 1 for Bootstrapping Objectness from Videos by Relaxed Common Fate and Visual Grouping
Figure 2 for Bootstrapping Objectness from Videos by Relaxed Common Fate and Visual Grouping
Figure 3 for Bootstrapping Objectness from Videos by Relaxed Common Fate and Visual Grouping
Figure 4 for Bootstrapping Objectness from Videos by Relaxed Common Fate and Visual Grouping
Viaarxiv icon

Invariant Scattering Transform for Medical Imaging

Apr 20, 2023
Md Manjurul Ahsan, Shivakumar Raman, Zahed Siddique

Figure 1 for Invariant Scattering Transform for Medical Imaging
Figure 2 for Invariant Scattering Transform for Medical Imaging
Figure 3 for Invariant Scattering Transform for Medical Imaging
Figure 4 for Invariant Scattering Transform for Medical Imaging
Viaarxiv icon

Forged Image Detection using SOTA Image Classification Deep Learning Methods for Image Forensics with Error Level Analysis

Nov 28, 2022
Raunak Joshi, Abhishek Gupta, Nandan Kanvinde, Pandharinath Ghonge

Figure 1 for Forged Image Detection using SOTA Image Classification Deep Learning Methods for Image Forensics with Error Level Analysis
Figure 2 for Forged Image Detection using SOTA Image Classification Deep Learning Methods for Image Forensics with Error Level Analysis
Figure 3 for Forged Image Detection using SOTA Image Classification Deep Learning Methods for Image Forensics with Error Level Analysis
Figure 4 for Forged Image Detection using SOTA Image Classification Deep Learning Methods for Image Forensics with Error Level Analysis
Viaarxiv icon