Alert button

"Text": models, code, and papers
Alert button

Visual Captioning at Will: Describing Images and Videos Guided by a Few Stylized Sentences

Jul 31, 2023
Dingyi Yang, Hongyu Chen, Xinglin Hou, Tiezheng Ge, Yuning Jiang, Qin Jin

Figure 1 for Visual Captioning at Will: Describing Images and Videos Guided by a Few Stylized Sentences
Figure 2 for Visual Captioning at Will: Describing Images and Videos Guided by a Few Stylized Sentences
Figure 3 for Visual Captioning at Will: Describing Images and Videos Guided by a Few Stylized Sentences
Figure 4 for Visual Captioning at Will: Describing Images and Videos Guided by a Few Stylized Sentences
Viaarxiv icon

Chatbot Application to Support Smart Agriculture in Thailand

Jul 31, 2023
Paweena Suebsombut, Pradorn Sureephong, Aicha Sekhari, Suepphong Chernbumroong, Abdelaziz Bouras

Figure 1 for Chatbot Application to Support Smart Agriculture in Thailand
Figure 2 for Chatbot Application to Support Smart Agriculture in Thailand
Figure 3 for Chatbot Application to Support Smart Agriculture in Thailand
Figure 4 for Chatbot Application to Support Smart Agriculture in Thailand
Viaarxiv icon

GRILL: Grounded Vision-language Pre-training via Aligning Text and Image Regions

May 24, 2023
Woojeong Jin, Subhabrata Mukherjee, Yu Cheng, Yelong Shen, Weizhu Chen, Ahmed Hassan Awadallah, Damien Jose, Xiang Ren

Figure 1 for GRILL: Grounded Vision-language Pre-training via Aligning Text and Image Regions
Figure 2 for GRILL: Grounded Vision-language Pre-training via Aligning Text and Image Regions
Figure 3 for GRILL: Grounded Vision-language Pre-training via Aligning Text and Image Regions
Figure 4 for GRILL: Grounded Vision-language Pre-training via Aligning Text and Image Regions
Viaarxiv icon

S2vNTM: Semi-supervised vMF Neural Topic Modeling

Jul 06, 2023
Weijie Xu, Jay Desai, Srinivasan Sengamedu, Xiaoyu Jiang, Francis Iannacci

Figure 1 for S2vNTM: Semi-supervised vMF Neural Topic Modeling
Figure 2 for S2vNTM: Semi-supervised vMF Neural Topic Modeling
Figure 3 for S2vNTM: Semi-supervised vMF Neural Topic Modeling
Figure 4 for S2vNTM: Semi-supervised vMF Neural Topic Modeling
Viaarxiv icon

MoMo: A shared encoder Model for text, image and multi-Modal representations

Apr 11, 2023
Rakesh Chada, Zhaoheng Zheng, Pradeep Natarajan

Figure 1 for MoMo: A shared encoder Model for text, image and multi-Modal representations
Figure 2 for MoMo: A shared encoder Model for text, image and multi-Modal representations
Figure 3 for MoMo: A shared encoder Model for text, image and multi-Modal representations
Figure 4 for MoMo: A shared encoder Model for text, image and multi-Modal representations
Viaarxiv icon

Parameter-Efficient Learning for Text-to-Speech Accent Adaptation

May 18, 2023
Li-Jen Yang, Chao-Han Huck Yang, Jen-Tzung Chien

Figure 1 for Parameter-Efficient Learning for Text-to-Speech Accent Adaptation
Figure 2 for Parameter-Efficient Learning for Text-to-Speech Accent Adaptation
Figure 3 for Parameter-Efficient Learning for Text-to-Speech Accent Adaptation
Figure 4 for Parameter-Efficient Learning for Text-to-Speech Accent Adaptation
Viaarxiv icon

DreamDiffusion: Generating High-Quality Images from Brain EEG Signals

Jun 29, 2023
Yunpeng Bai, Xintao Wang, Yanpei Cao, Yixiao Ge, Chun Yuan, Ying Shan

Figure 1 for DreamDiffusion: Generating High-Quality Images from Brain EEG Signals
Figure 2 for DreamDiffusion: Generating High-Quality Images from Brain EEG Signals
Figure 3 for DreamDiffusion: Generating High-Quality Images from Brain EEG Signals
Figure 4 for DreamDiffusion: Generating High-Quality Images from Brain EEG Signals
Viaarxiv icon

Attentive Graph-based Text-aware Preference Modeling for Top-N Recommendation

May 22, 2023
Ming-Hao Juan, Pu-Jen Cheng, Hui-Neng Hsu, Pin-Hsin Hsiao

Figure 1 for Attentive Graph-based Text-aware Preference Modeling for Top-N Recommendation
Figure 2 for Attentive Graph-based Text-aware Preference Modeling for Top-N Recommendation
Figure 3 for Attentive Graph-based Text-aware Preference Modeling for Top-N Recommendation
Figure 4 for Attentive Graph-based Text-aware Preference Modeling for Top-N Recommendation
Viaarxiv icon

A Neural Divide-and-Conquer Reasoning Framework for Image Retrieval from Linguistically Complex Text

May 05, 2023
Yunxin Li, Baotian Hu, Yuxin Ding, Lin Ma, Min Zhang

Figure 1 for A Neural Divide-and-Conquer Reasoning Framework for Image Retrieval from Linguistically Complex Text
Figure 2 for A Neural Divide-and-Conquer Reasoning Framework for Image Retrieval from Linguistically Complex Text
Figure 3 for A Neural Divide-and-Conquer Reasoning Framework for Image Retrieval from Linguistically Complex Text
Figure 4 for A Neural Divide-and-Conquer Reasoning Framework for Image Retrieval from Linguistically Complex Text
Viaarxiv icon

Information Retrieval Meets Large Language Models: A Strategic Report from Chinese IR Community

Jul 27, 2023
Qingyao Ai, Ting Bai, Zhao Cao, Yi Chang, Jiawei Chen, Zhumin Chen, Zhiyong Cheng, Shoubin Dong, Zhicheng Dou, Fuli Feng, Shen Gao, Jiafeng Guo, Xiangnan He, Yanyan Lan, Chenliang Li, Yiqun Liu, Ziyu Lyu, Weizhi Ma, Jun Ma, Zhaochun Ren, Pengjie Ren, Zhiqiang Wang, Mingwen Wang, Ji-Rong Wen, Le Wu, Xin Xin, Jun Xu, Dawei Yin, Peng Zhang, Fan Zhang, Weinan Zhang, Min Zhang, Xiaofei Zhu

Figure 1 for Information Retrieval Meets Large Language Models: A Strategic Report from Chinese IR Community
Viaarxiv icon