Alert button

"Text": models, code, and papers
Alert button

Bridging Vision and Language from the Video-to-Text Perspective: A Comprehensive Review

Mar 27, 2021
Jesus Perez-Martin, Benjamin Bustos, Silvio Jamil F. Guimarães, Ivan Sipiran, Jorge Pérez, Grethel Coello Said

Figure 1 for Bridging Vision and Language from the Video-to-Text Perspective: A Comprehensive Review
Figure 2 for Bridging Vision and Language from the Video-to-Text Perspective: A Comprehensive Review
Figure 3 for Bridging Vision and Language from the Video-to-Text Perspective: A Comprehensive Review
Figure 4 for Bridging Vision and Language from the Video-to-Text Perspective: A Comprehensive Review
Viaarxiv icon

Read, Retrospect, Select: An MRC Framework to Short Text Entity Linking

Jan 07, 2021
Yingjie Gu, Xiaoye Qu, Zhefeng Wang, Baoxing Huai, Nicholas Jing Yuan, Xiaolin Gui

Figure 1 for Read, Retrospect, Select: An MRC Framework to Short Text Entity Linking
Figure 2 for Read, Retrospect, Select: An MRC Framework to Short Text Entity Linking
Figure 3 for Read, Retrospect, Select: An MRC Framework to Short Text Entity Linking
Figure 4 for Read, Retrospect, Select: An MRC Framework to Short Text Entity Linking
Viaarxiv icon

Bridging the Modality Gap for Speech-to-Text Translation

Oct 28, 2020
Yuchen Liu, Junnan Zhu, Jiajun Zhang, Chengqing Zong

Figure 1 for Bridging the Modality Gap for Speech-to-Text Translation
Figure 2 for Bridging the Modality Gap for Speech-to-Text Translation
Figure 3 for Bridging the Modality Gap for Speech-to-Text Translation
Figure 4 for Bridging the Modality Gap for Speech-to-Text Translation
Viaarxiv icon

Don't Take It Literally: An Edit-Invariant Sequence Loss for Text Generation

Jun 29, 2021
Guangyi Liu, Zichao Yang, Tianhua Tao, Xiaodan Liang, Zhen Li, Bowen Zhou, Shuguang Cui, Zhiting Hu

Figure 1 for Don't Take It Literally: An Edit-Invariant Sequence Loss for Text Generation
Figure 2 for Don't Take It Literally: An Edit-Invariant Sequence Loss for Text Generation
Figure 3 for Don't Take It Literally: An Edit-Invariant Sequence Loss for Text Generation
Figure 4 for Don't Take It Literally: An Edit-Invariant Sequence Loss for Text Generation
Viaarxiv icon

Scene Text Detection with Selected Anchor

Aug 19, 2020
Anna Zhu, Hang Du, Shengwu Xiong

Figure 1 for Scene Text Detection with Selected Anchor
Figure 2 for Scene Text Detection with Selected Anchor
Figure 3 for Scene Text Detection with Selected Anchor
Figure 4 for Scene Text Detection with Selected Anchor
Viaarxiv icon

Machine Learning-Based Analysis of Free-Text Keystroke Dynamics

Jul 01, 2021
Han-Chih Chang, Jianwei Li, Mark Stamp

Figure 1 for Machine Learning-Based Analysis of Free-Text Keystroke Dynamics
Figure 2 for Machine Learning-Based Analysis of Free-Text Keystroke Dynamics
Figure 3 for Machine Learning-Based Analysis of Free-Text Keystroke Dynamics
Figure 4 for Machine Learning-Based Analysis of Free-Text Keystroke Dynamics
Viaarxiv icon

ERNIE-ViLG: Unified Generative Pre-training for Bidirectional Vision-Language Generation

Dec 31, 2021
Han Zhang, Weichong Yin, Yewei Fang, Lanxin Li, Boqiang Duan, Zhihua Wu, Yu Sun, Hao Tian, Hua Wu, Haifeng Wang

Figure 1 for ERNIE-ViLG: Unified Generative Pre-training for Bidirectional Vision-Language Generation
Figure 2 for ERNIE-ViLG: Unified Generative Pre-training for Bidirectional Vision-Language Generation
Figure 3 for ERNIE-ViLG: Unified Generative Pre-training for Bidirectional Vision-Language Generation
Figure 4 for ERNIE-ViLG: Unified Generative Pre-training for Bidirectional Vision-Language Generation
Viaarxiv icon

Towards Building an Open-Domain Dialogue System Incorporated with Internet Memes

Mar 08, 2022
Hua Lu, Zhen Guo, Chanjuan Li, Yunyi Yang, Huang He, Siqi Bao

Figure 1 for Towards Building an Open-Domain Dialogue System Incorporated with Internet Memes
Figure 2 for Towards Building an Open-Domain Dialogue System Incorporated with Internet Memes
Figure 3 for Towards Building an Open-Domain Dialogue System Incorporated with Internet Memes
Figure 4 for Towards Building an Open-Domain Dialogue System Incorporated with Internet Memes
Viaarxiv icon

Accurate Emotion Strength Assessment for Seen and Unseen Speech Based on Data-Driven Deep Learning

Jun 15, 2022
Rui Liu, Berrak Sisman, Björn Schuller, Guanglai Gao, Haizhou Li

Figure 1 for Accurate Emotion Strength Assessment for Seen and Unseen Speech Based on Data-Driven Deep Learning
Figure 2 for Accurate Emotion Strength Assessment for Seen and Unseen Speech Based on Data-Driven Deep Learning
Figure 3 for Accurate Emotion Strength Assessment for Seen and Unseen Speech Based on Data-Driven Deep Learning
Figure 4 for Accurate Emotion Strength Assessment for Seen and Unseen Speech Based on Data-Driven Deep Learning
Viaarxiv icon

Lutma: a Frame-Making Tool for Collaborative FrameNet Development

May 24, 2022
Tiago Timponi Torrent, Arthur Lorenzi, Ely Edison da Silva Matos, Frederico Belcavello, Marcelo Viridiano, Maucha Andrade Gamonal

Figure 1 for Lutma: a Frame-Making Tool for Collaborative FrameNet Development
Viaarxiv icon