Alert button

"Image": models, code, and papers
Alert button

Understanding the Latent Space of Diffusion Models through the Lens of Riemannian Geometry

Jul 24, 2023
Yong-Hyun Park, Mingi Kwon, Jaewoong Choi, Junghyo Jo, Youngjung Uh

Figure 1 for Understanding the Latent Space of Diffusion Models through the Lens of Riemannian Geometry
Figure 2 for Understanding the Latent Space of Diffusion Models through the Lens of Riemannian Geometry
Figure 3 for Understanding the Latent Space of Diffusion Models through the Lens of Riemannian Geometry
Figure 4 for Understanding the Latent Space of Diffusion Models through the Lens of Riemannian Geometry
Viaarxiv icon

A Systematic Survey of Prompt Engineering on Vision-Language Foundation Models

Add code
Bookmark button
Alert button
Jul 24, 2023
Jindong Gu, Zhen Han, Shuo Chen, Ahmad Beirami, Bailan He, Gengyuan Zhang, Ruotong Liao, Yao Qin, Volker Tresp, Philip Torr

Viaarxiv icon

Semantically Structured Image Compression via Irregular Group-Based Decoupling

May 04, 2023
Ruoyu Feng, Yixin Gao, Xin Jin, Runsen Feng, Zhibo Chen

Figure 1 for Semantically Structured Image Compression via Irregular Group-Based Decoupling
Figure 2 for Semantically Structured Image Compression via Irregular Group-Based Decoupling
Figure 3 for Semantically Structured Image Compression via Irregular Group-Based Decoupling
Figure 4 for Semantically Structured Image Compression via Irregular Group-Based Decoupling
Viaarxiv icon

Gender Biases in Automatic Evaluation Metrics: A Case Study on Image Captioning

May 24, 2023
Haoyi Qiu, Zi-Yi Dou, Tianlu Wang, Asli Celikyilmaz, Nanyun Peng

Figure 1 for Gender Biases in Automatic Evaluation Metrics: A Case Study on Image Captioning
Figure 2 for Gender Biases in Automatic Evaluation Metrics: A Case Study on Image Captioning
Figure 3 for Gender Biases in Automatic Evaluation Metrics: A Case Study on Image Captioning
Figure 4 for Gender Biases in Automatic Evaluation Metrics: A Case Study on Image Captioning
Viaarxiv icon

GenKL: An Iterative Framework for Resolving Label Ambiguity and Label Non-conformity in Web Images Via a New Generalized KL Divergence

Add code
Bookmark button
Alert button
Jul 19, 2023
Xia Huang, Kai Fong Ernest Chong

Viaarxiv icon

LEMaRT: Label-Efficient Masked Region Transform for Image Harmonization

Apr 25, 2023
Sheng Liu, Cong Phuoc Huynh, Cong Chen, Maxim Arap, Raffay Hamid

Figure 1 for LEMaRT: Label-Efficient Masked Region Transform for Image Harmonization
Figure 2 for LEMaRT: Label-Efficient Masked Region Transform for Image Harmonization
Figure 3 for LEMaRT: Label-Efficient Masked Region Transform for Image Harmonization
Figure 4 for LEMaRT: Label-Efficient Masked Region Transform for Image Harmonization
Viaarxiv icon

Causal reasoning in typical computer vision tasks

Jul 31, 2023
Kexuan Zhang, Qiyu Sun, Chaoqiang Zhao, Yang Tang

Figure 1 for Causal reasoning in typical computer vision tasks
Figure 2 for Causal reasoning in typical computer vision tasks
Figure 3 for Causal reasoning in typical computer vision tasks
Figure 4 for Causal reasoning in typical computer vision tasks
Viaarxiv icon

DiVA-360: The Dynamic Visuo-Audio Dataset for Immersive Neural Fields

Add code
Bookmark button
Alert button
Jul 31, 2023
Cheng-You Lu, Peisen Zhou, Angela Xing, Chandradeep Pokhariya, Arnab Dey, Ishaan Shah, Rugved Mavidipalli, Dylan Hu, Andrew Comport, Kefan Chen, Srinath Sridhar

Figure 1 for DiVA-360: The Dynamic Visuo-Audio Dataset for Immersive Neural Fields
Figure 2 for DiVA-360: The Dynamic Visuo-Audio Dataset for Immersive Neural Fields
Figure 3 for DiVA-360: The Dynamic Visuo-Audio Dataset for Immersive Neural Fields
Figure 4 for DiVA-360: The Dynamic Visuo-Audio Dataset for Immersive Neural Fields
Viaarxiv icon

Detecting diabetic retinopathy severity through fundus images using an ensemble of classifiers

Jul 31, 2023
Eduard Popescu, Adrian Groza, Ioana Damian

Figure 1 for Detecting diabetic retinopathy severity through fundus images using an ensemble of classifiers
Figure 2 for Detecting diabetic retinopathy severity through fundus images using an ensemble of classifiers
Figure 3 for Detecting diabetic retinopathy severity through fundus images using an ensemble of classifiers
Figure 4 for Detecting diabetic retinopathy severity through fundus images using an ensemble of classifiers
Viaarxiv icon

Exposing the Fake: Effective Diffusion-Generated Images Detection

Jul 12, 2023
Ruipeng Ma, Jinhao Duan, Fei Kong, Xiaoshuang Shi, Kaidi Xu

Figure 1 for Exposing the Fake: Effective Diffusion-Generated Images Detection
Figure 2 for Exposing the Fake: Effective Diffusion-Generated Images Detection
Figure 3 for Exposing the Fake: Effective Diffusion-Generated Images Detection
Figure 4 for Exposing the Fake: Effective Diffusion-Generated Images Detection
Viaarxiv icon