Alert button

"Image": models, code, and papers
Alert button

UNK-VQA: A Dataset and A Probe into Multi-modal Large Models' Abstention Ability

Add code
Bookmark button
Alert button
Oct 28, 2023
Yanyang Guo, Fangkai Jiao, Zhiqi Shen, Liqiang Nie, Mohan Kankanhalli

Figure 1 for UNK-VQA: A Dataset and A Probe into Multi-modal Large Models' Abstention Ability
Figure 2 for UNK-VQA: A Dataset and A Probe into Multi-modal Large Models' Abstention Ability
Figure 3 for UNK-VQA: A Dataset and A Probe into Multi-modal Large Models' Abstention Ability
Figure 4 for UNK-VQA: A Dataset and A Probe into Multi-modal Large Models' Abstention Ability
Viaarxiv icon

Degradation-Aware Self-Attention Based Transformer for Blind Image Super-Resolution

Add code
Bookmark button
Alert button
Oct 06, 2023
Qingguo Liu, Pan Gao, Kang Han, Ningzhong Liu, Wei Xiang

Figure 1 for Degradation-Aware Self-Attention Based Transformer for Blind Image Super-Resolution
Figure 2 for Degradation-Aware Self-Attention Based Transformer for Blind Image Super-Resolution
Figure 3 for Degradation-Aware Self-Attention Based Transformer for Blind Image Super-Resolution
Figure 4 for Degradation-Aware Self-Attention Based Transformer for Blind Image Super-Resolution
Viaarxiv icon

High-resolution power equipment recognition based on improved self-attention

Nov 06, 2023
Siyi Zhang, Cheng Liu, Xiang Li, Xin Zhai, Zhen Wei, Sizhe Li, Xun Ma

Viaarxiv icon

GQKVA: Efficient Pre-training of Transformers by Grouping Queries, Keys, and Values

Nov 06, 2023
Farnoosh Javadi, Walid Ahmed, Habib Hajimolahoseini, Foozhan Ataiefard, Mohammad Hassanpour, Saina Asani, Austin Wen, Omar Mohamed Awad, Kangling Liu, Yang Liu

Viaarxiv icon

TEAL: Tokenize and Embed ALL for Multi-modal Large Language Models

Add code
Bookmark button
Alert button
Nov 08, 2023
Zhen Yang, Yingxue Zhang, Fandong Meng, Jie Zhou

Viaarxiv icon

Challenging Common Assumptions in Multi-task Learning

Add code
Bookmark button
Alert button
Nov 08, 2023
Cathrin Elich, Lukas Kirchdorfer, Jan M. Köhler, Lukas Schott

Viaarxiv icon

On Characterizing the Evolution of Embedding Space of Neural Networks using Algebraic Topology

Add code
Bookmark button
Alert button
Nov 08, 2023
Suryaka Suresh, Bishshoy Das, Vinayak Abrol, Sumantra Dutta Roy

Viaarxiv icon

PRED: Pre-training via Semantic Rendering on LiDAR Point Clouds

Add code
Bookmark button
Alert button
Nov 08, 2023
Hao Yang, Haiyang Wang, Di Dai, Liwei Wang

Viaarxiv icon

Style-Aware Radiology Report Generation with RadGraph and Few-Shot Prompting

Oct 26, 2023
Benjamin Yan, Ruochen Liu, David E. Kuo, Subathra Adithan, Eduardo Pontes Reis, Stephen Kwak, Vasantha Kumar Venugopal, Chloe P. O'Connell, Agustina Saenz, Pranav Rajpurkar, Michael Moor

Figure 1 for Style-Aware Radiology Report Generation with RadGraph and Few-Shot Prompting
Figure 2 for Style-Aware Radiology Report Generation with RadGraph and Few-Shot Prompting
Figure 3 for Style-Aware Radiology Report Generation with RadGraph and Few-Shot Prompting
Figure 4 for Style-Aware Radiology Report Generation with RadGraph and Few-Shot Prompting
Viaarxiv icon

SD4Match: Learning to Prompt Stable Diffusion Model for Semantic Matching

Add code
Bookmark button
Alert button
Oct 26, 2023
Xinghui Li, Jingyi Lu, Kai Han, Victor Prisacariu

Viaarxiv icon