Alert button

"Text": models, code, and papers
Alert button

Investigating the Catastrophic Forgetting in Multimodal Large Language Models

Sep 26, 2023
Yuexiang Zhai, Shengbang Tong, Xiao Li, Mu Cai, Qing Qu, Yong Jae Lee, Yi Ma

Figure 1 for Investigating the Catastrophic Forgetting in Multimodal Large Language Models
Figure 2 for Investigating the Catastrophic Forgetting in Multimodal Large Language Models
Figure 3 for Investigating the Catastrophic Forgetting in Multimodal Large Language Models
Figure 4 for Investigating the Catastrophic Forgetting in Multimodal Large Language Models
Viaarxiv icon

Can Sensitive Information Be Deleted From LLMs? Objectives for Defending Against Extraction Attacks

Sep 29, 2023
Vaidehi Patil, Peter Hase, Mohit Bansal

Figure 1 for Can Sensitive Information Be Deleted From LLMs? Objectives for Defending Against Extraction Attacks
Figure 2 for Can Sensitive Information Be Deleted From LLMs? Objectives for Defending Against Extraction Attacks
Figure 3 for Can Sensitive Information Be Deleted From LLMs? Objectives for Defending Against Extraction Attacks
Figure 4 for Can Sensitive Information Be Deleted From LLMs? Objectives for Defending Against Extraction Attacks
Viaarxiv icon

Hierarchical attention interpretation: an interpretable speech-level transformer for bi-modal depression detection

Sep 23, 2023
Qingkun Deng, Saturnino Luz, Sofia de la Fuente Garcia

Figure 1 for Hierarchical attention interpretation: an interpretable speech-level transformer for bi-modal depression detection
Figure 2 for Hierarchical attention interpretation: an interpretable speech-level transformer for bi-modal depression detection
Figure 3 for Hierarchical attention interpretation: an interpretable speech-level transformer for bi-modal depression detection
Figure 4 for Hierarchical attention interpretation: an interpretable speech-level transformer for bi-modal depression detection
Viaarxiv icon

GIT-Mol: A Multi-modal Large Language Model for Molecular Science with Graph, Image, and Text

Aug 14, 2023
Pengfei Liu, Yiming Ren, Zhixiang Ren

Figure 1 for GIT-Mol: A Multi-modal Large Language Model for Molecular Science with Graph, Image, and Text
Figure 2 for GIT-Mol: A Multi-modal Large Language Model for Molecular Science with Graph, Image, and Text
Figure 3 for GIT-Mol: A Multi-modal Large Language Model for Molecular Science with Graph, Image, and Text
Figure 4 for GIT-Mol: A Multi-modal Large Language Model for Molecular Science with Graph, Image, and Text
Viaarxiv icon

Pre-training End-to-end ASR Models with Augmented Speech Samples Queried by Text

Jul 30, 2023
Eric Sun, Jinyu Li, Jian Xue, Yifan Gong

Viaarxiv icon

Probabilistic Linguistic Knowledge and Token-level Text Augmentation

Jul 03, 2023
Zhengxiang Wang

Figure 1 for Probabilistic Linguistic Knowledge and Token-level Text Augmentation
Figure 2 for Probabilistic Linguistic Knowledge and Token-level Text Augmentation
Figure 3 for Probabilistic Linguistic Knowledge and Token-level Text Augmentation
Figure 4 for Probabilistic Linguistic Knowledge and Token-level Text Augmentation
Viaarxiv icon

NineRec: A Benchmark Dataset Suite for Evaluating Transferable Recommendation

Sep 14, 2023
Jiaqi Zhang, Yu Cheng, Yongxin Ni, Yunzhu Pan, Zheng Yuan, Junchen Fu, Youhua Li, Jie Wang, Fajie Yuan

Figure 1 for NineRec: A Benchmark Dataset Suite for Evaluating Transferable Recommendation
Figure 2 for NineRec: A Benchmark Dataset Suite for Evaluating Transferable Recommendation
Figure 3 for NineRec: A Benchmark Dataset Suite for Evaluating Transferable Recommendation
Figure 4 for NineRec: A Benchmark Dataset Suite for Evaluating Transferable Recommendation
Viaarxiv icon

LLMR: Real-time Prompting of Interactive Worlds using Large Language Models

Sep 21, 2023
Fernanda De La Torre, Cathy Mengying Fang, Han Huang, Andrzej Banburski-Fahey, Judith Amores Fernandez, Jaron Lanier

Figure 1 for LLMR: Real-time Prompting of Interactive Worlds using Large Language Models
Figure 2 for LLMR: Real-time Prompting of Interactive Worlds using Large Language Models
Figure 3 for LLMR: Real-time Prompting of Interactive Worlds using Large Language Models
Figure 4 for LLMR: Real-time Prompting of Interactive Worlds using Large Language Models
Viaarxiv icon

Reading Between the Lanes: Text VideoQA on the Road

Jul 08, 2023
George Tom, Minesh Mathew, Sergi Garcia, Dimosthenis Karatzas, C. V. Jawahar

Figure 1 for Reading Between the Lanes: Text VideoQA on the Road
Figure 2 for Reading Between the Lanes: Text VideoQA on the Road
Figure 3 for Reading Between the Lanes: Text VideoQA on the Road
Figure 4 for Reading Between the Lanes: Text VideoQA on the Road
Viaarxiv icon

Q-Bench: A Benchmark for General-Purpose Foundation Models on Low-level Vision

Sep 28, 2023
Haoning Wu, Zicheng Zhang, Erli Zhang, Chaofeng Chen, Liang Liao, Annan Wang, Chunyi Li, Wenxiu Sun, Qiong Yan, Guangtao Zhai, Weisi Lin

Viaarxiv icon