Alert button

"Image": models, code, and papers
Alert button

Enhancing Adversarial Training with Prior Knowledge Distillation for Robust Image Compression

Mar 11, 2024
Cao Zhi, Bao Youneng, Meng Fanyang, Li Chao, Tan Wen, Wang Genhong, Liang Yongsheng

Viaarxiv icon

Videoshop: Localized Semantic Video Editing with Noise-Extrapolated Diffusion Inversion

Mar 21, 2024
Xiang Fan, Anand Bhattad, Ranjay Krishna

Viaarxiv icon

DSEG-LIME - Improving Image Explanation by Hierarchical Data-Driven Segmentation

Mar 12, 2024
Patrick Knab, Sascha Marton, Christian Bartelt

Viaarxiv icon

Leveraging Foundation Models for Content-Based Medical Image Retrieval in Radiology

Mar 11, 2024
Stefan Denner, David Zimmerer, Dimitrios Bounias, Markus Bujotzek, Shuhan Xiao, Lisa Kausch, Philipp Schader, Tobias Penzkofer, Paul F. Jäger, Klaus Maier-Hein

Viaarxiv icon

SAMCT: Segment Any CT Allowing Labor-Free Task-Indicator Prompts

Mar 20, 2024
Xian Lin, Yangyang Xiang, Zhehao Wang, Kwang-Ting Cheng, Zengqiang Yan, Li Yu

Viaarxiv icon

MM1: Methods, Analysis & Insights from Multimodal LLM Pre-training

Mar 19, 2024
Brandon McKinzie, Zhe Gan, Jean-Philippe Fauconnier, Sam Dodge, Bowen Zhang, Philipp Dufter, Dhruti Shah, Xianzhi Du, Futang Peng, Floris Weers, Anton Belyi, Haotian Zhang, Karanjeet Singh, Doug Kang, Ankur Jain, Hongyu Hè, Max Schwarzer, Tom Gunter, Xiang Kong, Aonan Zhang, Jianyu Wang, Chong Wang, Nan Du, Tao Lei, Sam Wiseman, Mark Lee, Zirui Wang, Ruoming Pang, Peter Grasch, Alexander Toshev, Yinfei Yang

Viaarxiv icon

Empowering Segmentation Ability to Multi-modal Large Language Models

Mar 21, 2024
Yuqi Yang, Peng-Tao Jiang, Jing Wang, Hao Zhang, Kai Zhao, Jinwei Chen, Bo Li

Viaarxiv icon

Red Teaming Models for Hyperspectral Image Analysis Using Explainable AI

Mar 14, 2024
Vladimir Zaigrajew, Hubert Baniecki, Lukasz Tulczyjew, Agata M. Wijata, Jakub Nalepa, Nicolas Longépé, Przemyslaw Biecek

Viaarxiv icon

LayoutLLM: Large Language Model Instruction Tuning for Visually Rich Document Understanding

Mar 21, 2024
Masato Fujitake

Viaarxiv icon

StyleCineGAN: Landscape Cinemagraph Generation using a Pre-trained StyleGAN

Mar 21, 2024
Jongwoo Choi, Kwanggyoon Seo, Amirsaman Ashtari, Junyong Noh

Viaarxiv icon