Alert button

"Image": models, code, and papers
Alert button

Compressed image quality assessment using stacking

Feb 01, 2024
S. Farhad Hosseini-Benvidi, Hossein Motamednia, Azadeh Mansouri, Mohammadreza Raei, Ahmad Mahmoudi-Aznaveh

Viaarxiv icon

MuLan: Multimodal-LLM Agent for Progressive Multi-Object Diffusion

Feb 20, 2024
Sen Li, Ruochen Wang, Cho-Jui Hsieh, Minhao Cheng, Tianyi Zhou

Viaarxiv icon

Score-based generative models break the curse of dimensionality in learning a family of sub-Gaussian probability distributions

Feb 23, 2024
Frank Cole, Yulong Lu

Viaarxiv icon

SRNDiff: Short-term Rainfall Nowcasting with Condition Diffusion Model

Feb 21, 2024
Xudong Ling, Chaorong Li, Fengqing Qin, Peng Yang, Yuanyuan Huang

Viaarxiv icon

Dual-Path Coupled Image Deraining Network via Spatial-Frequency Interaction

Feb 07, 2024
Yuhong He, Aiwen Jiang, Lingfang Jiang, Zhifeng Wang, Lu Wang

Viaarxiv icon

Scene Prior Filtering for Depth Map Super-Resolution

Feb 23, 2024
Zhengxue Wang, Zhiqiang Yan, Ming-Hsuan Yang, Jinshan Pan, Jian Yang, Ying Tai, Guangwei Gao

Viaarxiv icon

Topological Analysis of Mouse Brain Vasculature via 3D Light-sheet Microscopy Images

Feb 23, 2024
Jiachen Yao, Nina Hagemann, Qiaojie Xiong, Jianxu Chen, Dirk M. Hermann, Chao Chen

Viaarxiv icon

Selective "Selective Prediction": Reducing Unnecessary Abstention in Vision-Language Reasoning

Feb 23, 2024
Tejas Srinivasan, Jack Hessel, Tanmay Gupta, Bill Yuchen Lin, Yejin Choi, Jesse Thomason, Khyathi Raghavi Chandu

Viaarxiv icon

BIKED++: A Multimodal Dataset of 1.4 Million Bicycle Image and Parametric CAD Designs

Feb 09, 2024
Lyle Regenwetter, Yazan Abu Obaideh, Amin Heyrani Nobari, Faez Ahmed

Viaarxiv icon

Noise Map Guidance: Inversion with Spatial Context for Real Image Editing

Feb 07, 2024
Hansam Cho, Jonghyun Lee, Seoung Bum Kim, Tae-Hyun Oh, Yonghyun Jeong

Viaarxiv icon