Alert button

"Text": models, code, and papers
Alert button

GIT-Mol: A Multi-modal Large Language Model for Molecular Science with Graph, Image, and Text

Aug 14, 2023
Pengfei Liu, Yiming Ren, Zhixiang Ren

Figure 1 for GIT-Mol: A Multi-modal Large Language Model for Molecular Science with Graph, Image, and Text
Figure 2 for GIT-Mol: A Multi-modal Large Language Model for Molecular Science with Graph, Image, and Text
Figure 3 for GIT-Mol: A Multi-modal Large Language Model for Molecular Science with Graph, Image, and Text
Figure 4 for GIT-Mol: A Multi-modal Large Language Model for Molecular Science with Graph, Image, and Text
Viaarxiv icon

NineRec: A Benchmark Dataset Suite for Evaluating Transferable Recommendation

Sep 14, 2023
Jiaqi Zhang, Yu Cheng, Yongxin Ni, Yunzhu Pan, Zheng Yuan, Junchen Fu, Youhua Li, Jie Wang, Fajie Yuan

Figure 1 for NineRec: A Benchmark Dataset Suite for Evaluating Transferable Recommendation
Figure 2 for NineRec: A Benchmark Dataset Suite for Evaluating Transferable Recommendation
Figure 3 for NineRec: A Benchmark Dataset Suite for Evaluating Transferable Recommendation
Figure 4 for NineRec: A Benchmark Dataset Suite for Evaluating Transferable Recommendation
Viaarxiv icon

Pre-training End-to-end ASR Models with Augmented Speech Samples Queried by Text

Jul 30, 2023
Eric Sun, Jinyu Li, Jian Xue, Yifan Gong

Viaarxiv icon

Probabilistic Linguistic Knowledge and Token-level Text Augmentation

Jul 03, 2023
Zhengxiang Wang

Figure 1 for Probabilistic Linguistic Knowledge and Token-level Text Augmentation
Figure 2 for Probabilistic Linguistic Knowledge and Token-level Text Augmentation
Figure 3 for Probabilistic Linguistic Knowledge and Token-level Text Augmentation
Figure 4 for Probabilistic Linguistic Knowledge and Token-level Text Augmentation
Viaarxiv icon

LLMR: Real-time Prompting of Interactive Worlds using Large Language Models

Sep 21, 2023
Fernanda De La Torre, Cathy Mengying Fang, Han Huang, Andrzej Banburski-Fahey, Judith Amores Fernandez, Jaron Lanier

Figure 1 for LLMR: Real-time Prompting of Interactive Worlds using Large Language Models
Figure 2 for LLMR: Real-time Prompting of Interactive Worlds using Large Language Models
Figure 3 for LLMR: Real-time Prompting of Interactive Worlds using Large Language Models
Figure 4 for LLMR: Real-time Prompting of Interactive Worlds using Large Language Models
Viaarxiv icon

Q-Bench: A Benchmark for General-Purpose Foundation Models on Low-level Vision

Sep 28, 2023
Haoning Wu, Zicheng Zhang, Erli Zhang, Chaofeng Chen, Liang Liao, Annan Wang, Chunyi Li, Wenxiu Sun, Qiong Yan, Guangtao Zhai, Weisi Lin

Viaarxiv icon

Reading Between the Lanes: Text VideoQA on the Road

Jul 08, 2023
George Tom, Minesh Mathew, Sergi Garcia, Dimosthenis Karatzas, C. V. Jawahar

Figure 1 for Reading Between the Lanes: Text VideoQA on the Road
Figure 2 for Reading Between the Lanes: Text VideoQA on the Road
Figure 3 for Reading Between the Lanes: Text VideoQA on the Road
Figure 4 for Reading Between the Lanes: Text VideoQA on the Road
Viaarxiv icon

Text Analysis Using Deep Neural Networks in Digital Humanities and Information Science

Jul 30, 2023
Omri Suissa, Avshalom Elmalech, Maayan Zhitomirsky-Geffet

Viaarxiv icon

Missing-modality Enabled Multi-modal Fusion Architecture for Medical Data

Sep 27, 2023
Muyu Wang, Shiyu Fan, Yichen Li, Hui Chen

Figure 1 for Missing-modality Enabled Multi-modal Fusion Architecture for Medical Data
Figure 2 for Missing-modality Enabled Multi-modal Fusion Architecture for Medical Data
Figure 3 for Missing-modality Enabled Multi-modal Fusion Architecture for Medical Data
Figure 4 for Missing-modality Enabled Multi-modal Fusion Architecture for Medical Data
Viaarxiv icon

VideoAdviser: Video Knowledge Distillation for Multimodal Transfer Learning

Sep 27, 2023
Yanan Wang, Donghuo Zeng, Shinya Wada, Satoshi Kurihara

Figure 1 for VideoAdviser: Video Knowledge Distillation for Multimodal Transfer Learning
Figure 2 for VideoAdviser: Video Knowledge Distillation for Multimodal Transfer Learning
Figure 3 for VideoAdviser: Video Knowledge Distillation for Multimodal Transfer Learning
Figure 4 for VideoAdviser: Video Knowledge Distillation for Multimodal Transfer Learning
Viaarxiv icon