Alert button

"Image": models, code, and papers
Alert button

Probing Multimodal Large Language Models for Global and Local Semantic Representation

Feb 27, 2024
Mingxu Tao, Quzhe Huang, Kun Xu, Liwei Chen, Yansong Feng, Dongyan Zhao

Viaarxiv icon

Physics-Informed Learning for Time-Resolved Angiographic Contrast Agent Concentration Reconstruction

Mar 04, 2024
Noah Maul, Annette Birkhold, Fabian Wagner, Mareike Thies, Maximilian Rohleder, Philipp Berg, Markus Kowarschik, Andreas Maier

Figure 1 for Physics-Informed Learning for Time-Resolved Angiographic Contrast Agent Concentration Reconstruction
Figure 2 for Physics-Informed Learning for Time-Resolved Angiographic Contrast Agent Concentration Reconstruction
Figure 3 for Physics-Informed Learning for Time-Resolved Angiographic Contrast Agent Concentration Reconstruction
Figure 4 for Physics-Informed Learning for Time-Resolved Angiographic Contrast Agent Concentration Reconstruction
Viaarxiv icon

U-shaped Vision Mamba for Single Image Dehazing

Add code
Bookmark button
Alert button
Feb 08, 2024
Zhuoran Zheng, Chen Wu

Viaarxiv icon

MovieLLM: Enhancing Long Video Understanding with AI-Generated Movies

Add code
Bookmark button
Alert button
Mar 03, 2024
Zhende Song, Chenchen Wang, Jiamu Sheng, Chi Zhang, Gang Yu, Jiayuan Fan, Tao Chen

Figure 1 for MovieLLM: Enhancing Long Video Understanding with AI-Generated Movies
Figure 2 for MovieLLM: Enhancing Long Video Understanding with AI-Generated Movies
Figure 3 for MovieLLM: Enhancing Long Video Understanding with AI-Generated Movies
Figure 4 for MovieLLM: Enhancing Long Video Understanding with AI-Generated Movies
Viaarxiv icon

What Is Missing in Multilingual Visual Reasoning and How to Fix It

Add code
Bookmark button
Alert button
Mar 03, 2024
Yueqi Song, Simran Khanuja, Graham Neubig

Figure 1 for What Is Missing in Multilingual Visual Reasoning and How to Fix It
Figure 2 for What Is Missing in Multilingual Visual Reasoning and How to Fix It
Figure 3 for What Is Missing in Multilingual Visual Reasoning and How to Fix It
Figure 4 for What Is Missing in Multilingual Visual Reasoning and How to Fix It
Viaarxiv icon

LUM-ViT: Learnable Under-sampling Mask Vision Transformer for Bandwidth Limited Optical Signal Acquisition

Add code
Bookmark button
Alert button
Mar 03, 2024
Lingfeng Liu, Dong Ni, Hangjie Yuan

Figure 1 for LUM-ViT: Learnable Under-sampling Mask Vision Transformer for Bandwidth Limited Optical Signal Acquisition
Figure 2 for LUM-ViT: Learnable Under-sampling Mask Vision Transformer for Bandwidth Limited Optical Signal Acquisition
Figure 3 for LUM-ViT: Learnable Under-sampling Mask Vision Transformer for Bandwidth Limited Optical Signal Acquisition
Figure 4 for LUM-ViT: Learnable Under-sampling Mask Vision Transformer for Bandwidth Limited Optical Signal Acquisition
Viaarxiv icon

Prompt-Driven Dynamic Object-Centric Learning for Single Domain Generalization

Feb 28, 2024
Deng Li, Aming Wu, Yaowei Wang, Yahong Han

Viaarxiv icon

Impression-CLIP: Contrastive Shape-Impression Embedding for Fonts

Feb 26, 2024
Yugo Kubota, Daichi Haraguchi, Seiichi Uchida

Viaarxiv icon

Simulation of Muon Tomography Projections to Image the Pyramids of Giza

Feb 27, 2024
Mira Liu

Viaarxiv icon

Decompose-and-Compose: A Compositional Approach to Mitigating Spurious Correlation

Feb 29, 2024
Fahimeh Hosseini Noohdani, Parsa Hosseini, Arian Yazdan Parast, Hamidreza Yaghoubi Araghi, Mahdieh Soleymani Baghshah

Viaarxiv icon