Alert button

"Text": models, code, and papers
Alert button

Noisy Text Data: Achilles' Heel of popular transformer based NLP models

Oct 07, 2021
Kartikay Bagla, Ankit Kumar, Shivam Gupta, Anuj Gupta

Figure 1 for Noisy Text Data: Achilles' Heel of popular transformer based NLP models
Figure 2 for Noisy Text Data: Achilles' Heel of popular transformer based NLP models
Figure 3 for Noisy Text Data: Achilles' Heel of popular transformer based NLP models
Figure 4 for Noisy Text Data: Achilles' Heel of popular transformer based NLP models
Viaarxiv icon

Egocentric Video-Language Pretraining

Jun 03, 2022
Kevin Qinghong Lin, Alex Jinpeng Wang, Mattia Soldan, Michael Wray, Rui Yan, Eric Zhongcong Xu, Difei Gao, Rongcheng Tu, Wenzhe Zhao, Weijie Kong, Chengfei Cai, Hongfa Wang, Dima Damen, Bernard Ghanem, Wei Liu, Mike Zheng Shou

Figure 1 for Egocentric Video-Language Pretraining
Figure 2 for Egocentric Video-Language Pretraining
Figure 3 for Egocentric Video-Language Pretraining
Figure 4 for Egocentric Video-Language Pretraining
Viaarxiv icon

PortaSpeech: Portable and High-Quality Generative Text-to-Speech

Sep 30, 2021
Yi Ren, Jinglin Liu, Zhou Zhao

Figure 1 for PortaSpeech: Portable and High-Quality Generative Text-to-Speech
Figure 2 for PortaSpeech: Portable and High-Quality Generative Text-to-Speech
Figure 3 for PortaSpeech: Portable and High-Quality Generative Text-to-Speech
Figure 4 for PortaSpeech: Portable and High-Quality Generative Text-to-Speech
Viaarxiv icon

LaTeRF: Label and Text Driven Object Radiance Fields

Jul 05, 2022
Ashkan Mirzaei, Yash Kant, Jonathan Kelly, Igor Gilitschenski

Figure 1 for LaTeRF: Label and Text Driven Object Radiance Fields
Figure 2 for LaTeRF: Label and Text Driven Object Radiance Fields
Figure 3 for LaTeRF: Label and Text Driven Object Radiance Fields
Figure 4 for LaTeRF: Label and Text Driven Object Radiance Fields
Viaarxiv icon

A Dual-Attention Learning Network with Word and Sentence Embedding for Medical Visual Question Answering

Oct 01, 2022
Xiaofei Huang, Hongfang Gong

Figure 1 for A Dual-Attention Learning Network with Word and Sentence Embedding for Medical Visual Question Answering
Figure 2 for A Dual-Attention Learning Network with Word and Sentence Embedding for Medical Visual Question Answering
Figure 3 for A Dual-Attention Learning Network with Word and Sentence Embedding for Medical Visual Question Answering
Figure 4 for A Dual-Attention Learning Network with Word and Sentence Embedding for Medical Visual Question Answering
Viaarxiv icon

TACo: Token-aware Cascade Contrastive Learning for Video-Text Alignment

Aug 23, 2021
Jianwei Yang, Yonatan Bisk, Jianfeng Gao

Figure 1 for TACo: Token-aware Cascade Contrastive Learning for Video-Text Alignment
Figure 2 for TACo: Token-aware Cascade Contrastive Learning for Video-Text Alignment
Figure 3 for TACo: Token-aware Cascade Contrastive Learning for Video-Text Alignment
Figure 4 for TACo: Token-aware Cascade Contrastive Learning for Video-Text Alignment
Viaarxiv icon

Pretrained Models for Multilingual Federated Learning

Jun 06, 2022
Orion Weller, Marc Marone, Vladimir Braverman, Dawn Lawrie, Benjamin Van Durme

Figure 1 for Pretrained Models for Multilingual Federated Learning
Figure 2 for Pretrained Models for Multilingual Federated Learning
Figure 3 for Pretrained Models for Multilingual Federated Learning
Figure 4 for Pretrained Models for Multilingual Federated Learning
Viaarxiv icon

Unsupervised and Distributional Detection of Machine-Generated Text

Nov 04, 2021
Matthias Gallé, Jos Rozen, Germán Kruszewski, Hady Elsahar

Figure 1 for Unsupervised and Distributional Detection of Machine-Generated Text
Figure 2 for Unsupervised and Distributional Detection of Machine-Generated Text
Figure 3 for Unsupervised and Distributional Detection of Machine-Generated Text
Figure 4 for Unsupervised and Distributional Detection of Machine-Generated Text
Viaarxiv icon

nnSpeech: Speaker-Guided Conditional Variational Autoencoder for Zero-shot Multi-speaker Text-to-Speech

Feb 22, 2022
Botao Zhao, Xulong Zhang, Jianzong Wang, Ning Cheng, Jing Xiao

Figure 1 for nnSpeech: Speaker-Guided Conditional Variational Autoencoder for Zero-shot Multi-speaker Text-to-Speech
Figure 2 for nnSpeech: Speaker-Guided Conditional Variational Autoencoder for Zero-shot Multi-speaker Text-to-Speech
Figure 3 for nnSpeech: Speaker-Guided Conditional Variational Autoencoder for Zero-shot Multi-speaker Text-to-Speech
Figure 4 for nnSpeech: Speaker-Guided Conditional Variational Autoencoder for Zero-shot Multi-speaker Text-to-Speech
Viaarxiv icon

Multimodal Dialog Systems with Dual Knowledge-enhanced Generative Pretrained Language Model

Jul 16, 2022
Xiaolin Chen, Xuemeng Song, Liqiang Jing, Shuo Li, Linmei Hu, Liqiang Nie

Figure 1 for Multimodal Dialog Systems with Dual Knowledge-enhanced Generative Pretrained Language Model
Figure 2 for Multimodal Dialog Systems with Dual Knowledge-enhanced Generative Pretrained Language Model
Figure 3 for Multimodal Dialog Systems with Dual Knowledge-enhanced Generative Pretrained Language Model
Figure 4 for Multimodal Dialog Systems with Dual Knowledge-enhanced Generative Pretrained Language Model
Viaarxiv icon