Alert button

"Image": models, code, and papers
Alert button

Enhancing Multimodal Large Language Models with Vision Detection Models: An Empirical Study

Jan 31, 2024
Qirui Jiao, Daoyuan Chen, Yilun Huang, Yaliang Li, Ying Shen

Viaarxiv icon

Data and Physics driven Deep Learning Models for Fast MRI Reconstruction: Fundamentals and Methodologies

Jan 29, 2024
Jiahao Huang, Yinzhe Wu, Fanwen Wang, Yingying Fang, Yang Nan, Cagan Alkan, Lei Xu, Zhifan Gao, Weiwen Wu, Lei Zhu, Zhaolin Chen, Peter Lally, Neal Bangerter, Kawin Setsompop, Yike Guo, Daniel Rueckert, Ge Wang, Guang Yang

Viaarxiv icon

Distributional Reduction: Unifying Dimensionality Reduction and Clustering with Gromov-Wasserstein Projection

Feb 03, 2024
Hugues Van Assel, Cédric Vincent-Cuaz, Nicolas Courty, Rémi Flamary, Pascal Frossard, Titouan Vayer

Viaarxiv icon

Tropical Decision Boundaries for Neural Networks Are Robust Against Adversarial Attacks

Feb 01, 2024
Kurt Pasque, Christopher Teska, Ruriko Yoshida, Keiji Miura, Jefferson Huang

Viaarxiv icon

High-Fidelity Diffusion-based Image Editing

Jan 04, 2024
Chen Hou, Guoqiang Wei, Zhibo Chen

Viaarxiv icon

DeSparsify: Adversarial Attack Against Token Sparsification Mechanisms in Vision Transformers

Feb 04, 2024
Oryan Yehezkel, Alon Zolfi, Amit Baras, Yuval Elovici, Asaf Shabtai

Viaarxiv icon

Riemannian Preconditioned LoRA for Fine-Tuning Foundation Models

Add code
Bookmark button
Alert button
Feb 04, 2024
Fangzhao Zhang, Mert Pilanci

Viaarxiv icon

SCoFT: Self-Contrastive Fine-Tuning for Equitable Image Generation

Jan 16, 2024
Zhixuan Liu, Peter Schaldenbrand, Beverley-Claire Okogwu, Wenxuan Peng, Youngsik Yun, Andrew Hundt, Jihie Kim, Jean Oh

Viaarxiv icon

Towards a Flexible Scale-out Framework for Efficient Visual Data Query Processing

Feb 05, 2024
Rohit Verma, Arun Raghunath

Viaarxiv icon

Motion-Aware Video Frame Interpolation

Feb 05, 2024
Pengfei Han, Fuhua Zhang, Bin Zhao, Xuelong Li

Viaarxiv icon