Alert button

"Image": models, code, and papers
Alert button

Using Stochastic Gradient Descent to Smooth Nonconvex Functions: Analysis of Implicit Graduated Optimization with Optimal Noise Scheduling

Nov 29, 2023
Naoki Sato, Hideaki Iiduka

Viaarxiv icon

Manipulation Mask Generator: High-Quality Image Manipulation Mask Generation Method Based on Modified Total Variation Noise Reduction

Oct 23, 2023
Xinyu Yang, Jizhe Zhou

Viaarxiv icon

MAIRA-1: A specialised large multimodal model for radiology report generation

Nov 22, 2023
Stephanie L. Hyland, Shruthi Bannur, Kenza Bouzid, Daniel C. Castro, Mercy Ranjit, Anton Schwaighofer, Fernando Pérez-García, Valentina Salvatelli, Shaury Srivastav, Anja Thieme, Noel Codella, Matthew P. Lungren, Maria Teodora Wetscherek, Ozan Oktay, Javier Alvarez-Valle

Viaarxiv icon

Panda or not Panda? Understanding Adversarial Attacks with Interactive Visualization

Nov 22, 2023
Yuzhe You, Jarvis Tse, Jian Zhao

Viaarxiv icon

Guided Flows for Generative Modeling and Decision Making

Nov 22, 2023
Qinqing Zheng, Matt Le, Neta Shaul, Yaron Lipman, Aditya Grover, Ricky T. Q. Chen

Viaarxiv icon

PixArt-$α$: Fast Training of Diffusion Transformer for Photorealistic Text-to-Image Synthesis

Add code
Bookmark button
Alert button
Oct 16, 2023
Junsong Chen, Jincheng Yu, Chongjian Ge, Lewei Yao, Enze Xie, Yue Wu, Zhongdao Wang, James Kwok, Ping Luo, Huchuan Lu, Zhenguo Li

Figure 1 for PixArt-$α$: Fast Training of Diffusion Transformer for Photorealistic Text-to-Image Synthesis
Figure 2 for PixArt-$α$: Fast Training of Diffusion Transformer for Photorealistic Text-to-Image Synthesis
Figure 3 for PixArt-$α$: Fast Training of Diffusion Transformer for Photorealistic Text-to-Image Synthesis
Figure 4 for PixArt-$α$: Fast Training of Diffusion Transformer for Photorealistic Text-to-Image Synthesis
Viaarxiv icon

MAM-E: Mammographic synthetic image generation with diffusion models

Nov 16, 2023
Ricardo Montoya-del-Angel, Karla Sam-Millan, Joan C Vilanova, Robert Martí

Viaarxiv icon

From Denoising Training to Test-Time Adaptation: Enhancing Domain Generalization for Medical Image Segmentation

Add code
Bookmark button
Alert button
Nov 03, 2023
Ruxue Wen, Hangjie Yuan, Dong Ni, Wenbo Xiao, Yaoyao Wu

Viaarxiv icon

TeG-DG: Textually Guided Domain Generalization for Face Anti-Spoofing

Nov 30, 2023
Lianrui Mu, Jianhong Bai, Xiaoxuan He, Jiangnan Ye, Xiaoyu Liang, Yuchen Yang, Jiedong Zhuang, Haoji Hu

Viaarxiv icon

SPiC-E : Structural Priors in 3D Diffusion Models using Cross-Entity Attention

Add code
Bookmark button
Alert button
Nov 30, 2023
Etai Sella, Gal Fiebelman, Noam Atia, Hadar Averbuch-Elor

Viaarxiv icon