Alert button

"Image": models, code, and papers
Alert button

Fine-grained and Explainable Factuality Evaluation for Multimodal Summarization

Feb 18, 2024
Liqiang Jing, Jingxuan Zuo, Yue Zhang

Viaarxiv icon

ARCNet: An Asymmetric Residual Wavelet Column Correction Network for Infrared Image Destriping

Jan 28, 2024
Shuai Yuan, Hanlin Qin, Xiang Yan, Naveed Akhtar, Shiqi Yang, Shuowen Yang

Viaarxiv icon

The (R)Evolution of Multimodal Large Language Models: A Survey

Feb 19, 2024
Davide Caffagni, Federico Cocchi, Luca Barsellotti, Nicholas Moratelli, Sara Sarto, Lorenzo Baraldi, Lorenzo Baraldi, Marcella Cornia, Rita Cucchiara

Viaarxiv icon

GraphKD: Exploring Knowledge Distillation Towards Document Object Detection with Structured Graph Creation

Add code
Bookmark button
Alert button
Feb 20, 2024
Ayan Banerjee, Sanket Biswas, Josep Lladós, Umapada Pal

Viaarxiv icon

Only My Model On My Data: A Privacy Preserving Approach Protecting one Model and Deceiving Unauthorized Black-Box Models

Feb 14, 2024
Weiheng Chai, Brian Testa, Huantao Ren, Asif Salekin, Senem Velipasalar

Viaarxiv icon

Extreme Video Compression with Pre-trained Diffusion Models

Add code
Bookmark button
Alert button
Feb 14, 2024
Bohan Li, Yiming Liu, Xueyan Niu, Bo Bai, Lei Deng, Deniz Gündüz

Viaarxiv icon

Visually Dehallucinative Instruction Generation

Feb 13, 2024
Sungguk Cha, Jusung Lee, Younghyun Lee, Cheoljong Yang

Viaarxiv icon

StyleInject: Parameter Efficient Tuning of Text-to-Image Diffusion Models

Jan 25, 2024
Yalong Bai, Mohan Zhou, Qing Yang

Viaarxiv icon

A Multimodal In-Context Tuning Approach for E-Commerce Product Description Generation

Feb 21, 2024
Yunxin Li, Baotian Hu, Wenhan Luo, Lin Ma, Yuxin Ding, Min Zhang

Viaarxiv icon

Feature Accentuation: Revealing 'What' Features Respond to in Natural Images

Feb 15, 2024
Chris Hamblin, Thomas Fel, Srijani Saha, Talia Konkle, George Alvarez

Viaarxiv icon