Alert button

"Text": models, code, and papers
Alert button

Unsupervised Melody-Guided Lyrics Generation

May 12, 2023
Yufei Tian, Anjali Narayan-Chen, Shereen Oraby, Alessandra Cervone, Gunnar Sigurdsson, Chenyang Tao, Wenbo Zhao, Tagyoung Chung, Jing Huang, Nanyun Peng

Figure 1 for Unsupervised Melody-Guided Lyrics Generation
Figure 2 for Unsupervised Melody-Guided Lyrics Generation
Figure 3 for Unsupervised Melody-Guided Lyrics Generation
Figure 4 for Unsupervised Melody-Guided Lyrics Generation
Viaarxiv icon

ChatGPT to Replace Crowdsourcing of Paraphrases for Intent Classification: Higher Diversity and Comparable Model Robustness

May 22, 2023
Jan Cegin, Jakub Simko, Peter Brusilovsky

Figure 1 for ChatGPT to Replace Crowdsourcing of Paraphrases for Intent Classification: Higher Diversity and Comparable Model Robustness
Figure 2 for ChatGPT to Replace Crowdsourcing of Paraphrases for Intent Classification: Higher Diversity and Comparable Model Robustness
Figure 3 for ChatGPT to Replace Crowdsourcing of Paraphrases for Intent Classification: Higher Diversity and Comparable Model Robustness
Figure 4 for ChatGPT to Replace Crowdsourcing of Paraphrases for Intent Classification: Higher Diversity and Comparable Model Robustness
Viaarxiv icon

Bridging the Granularity Gap for Acoustic Modeling

May 27, 2023
Chen Xu, Yuhao Zhang, Chengbo Jiao, Xiaoqian Liu, Chi Hu, Xin Zeng, Tong Xiao, Anxiang Ma, Huizhen Wang, JingBo Zhu

Figure 1 for Bridging the Granularity Gap for Acoustic Modeling
Figure 2 for Bridging the Granularity Gap for Acoustic Modeling
Figure 3 for Bridging the Granularity Gap for Acoustic Modeling
Figure 4 for Bridging the Granularity Gap for Acoustic Modeling
Viaarxiv icon

DocFormerv2: Local Features for Document Understanding

Jun 02, 2023
Srikar Appalaraju, Peng Tang, Qi Dong, Nishant Sankaran, Yichu Zhou, R. Manmatha

Figure 1 for DocFormerv2: Local Features for Document Understanding
Figure 2 for DocFormerv2: Local Features for Document Understanding
Figure 3 for DocFormerv2: Local Features for Document Understanding
Figure 4 for DocFormerv2: Local Features for Document Understanding
Viaarxiv icon

Stochastic Pitch Prediction Improves the Diversity and Naturalness of Speech in Glow-TTS

May 28, 2023
Sewade Ogun, Vincent Colotte, Emmanuel Vincent

Figure 1 for Stochastic Pitch Prediction Improves the Diversity and Naturalness of Speech in Glow-TTS
Figure 2 for Stochastic Pitch Prediction Improves the Diversity and Naturalness of Speech in Glow-TTS
Figure 3 for Stochastic Pitch Prediction Improves the Diversity and Naturalness of Speech in Glow-TTS
Viaarxiv icon

How to Plant Trees in Language Models: Data and Architectural Effects on the Emergence of Syntactic Inductive Biases

May 31, 2023
Aaron Mueller, Tal Linzen

Figure 1 for How to Plant Trees in Language Models: Data and Architectural Effects on the Emergence of Syntactic Inductive Biases
Figure 2 for How to Plant Trees in Language Models: Data and Architectural Effects on the Emergence of Syntactic Inductive Biases
Figure 3 for How to Plant Trees in Language Models: Data and Architectural Effects on the Emergence of Syntactic Inductive Biases
Figure 4 for How to Plant Trees in Language Models: Data and Architectural Effects on the Emergence of Syntactic Inductive Biases
Viaarxiv icon

Control4D: Dynamic Portrait Editing by Learning 4D GAN from 2D Diffusion-based Editor

May 31, 2023
Ruizhi Shao, Jingxiang Sun, Cheng Peng, Zerong Zheng, Boyao Zhou, Hongwen Zhang, Yebin Liu

Figure 1 for Control4D: Dynamic Portrait Editing by Learning 4D GAN from 2D Diffusion-based Editor
Figure 2 for Control4D: Dynamic Portrait Editing by Learning 4D GAN from 2D Diffusion-based Editor
Figure 3 for Control4D: Dynamic Portrait Editing by Learning 4D GAN from 2D Diffusion-based Editor
Figure 4 for Control4D: Dynamic Portrait Editing by Learning 4D GAN from 2D Diffusion-based Editor
Viaarxiv icon

A Critical Evaluation of Evaluations for Long-form Question Answering

May 29, 2023
Fangyuan Xu, Yixiao Song, Mohit Iyyer, Eunsol Choi

Figure 1 for A Critical Evaluation of Evaluations for Long-form Question Answering
Figure 2 for A Critical Evaluation of Evaluations for Long-form Question Answering
Figure 3 for A Critical Evaluation of Evaluations for Long-form Question Answering
Figure 4 for A Critical Evaluation of Evaluations for Long-form Question Answering
Viaarxiv icon

Three Towers: Flexible Contrastive Learning with Pretrained Image Models

May 29, 2023
Jannik Kossen, Mark Collier, Basil Mustafa, Xiao Wang, Xiaohua Zhai, Lucas Beyer, Andreas Steiner, Jesse Berent, Rodolphe Jenatton, Efi Kokiopoulou

Figure 1 for Three Towers: Flexible Contrastive Learning with Pretrained Image Models
Figure 2 for Three Towers: Flexible Contrastive Learning with Pretrained Image Models
Figure 3 for Three Towers: Flexible Contrastive Learning with Pretrained Image Models
Figure 4 for Three Towers: Flexible Contrastive Learning with Pretrained Image Models
Viaarxiv icon

ADAPTERMIX: Exploring the Efficacy of Mixture of Adapters for Low-Resource TTS Adaptation

May 29, 2023
Ambuj Mehrish, Abhinav Ramesh Kashyap, Li Yingting, Navonil Majumder, Soujanya Poria

Figure 1 for ADAPTERMIX: Exploring the Efficacy of Mixture of Adapters for Low-Resource TTS Adaptation
Figure 2 for ADAPTERMIX: Exploring the Efficacy of Mixture of Adapters for Low-Resource TTS Adaptation
Figure 3 for ADAPTERMIX: Exploring the Efficacy of Mixture of Adapters for Low-Resource TTS Adaptation
Figure 4 for ADAPTERMIX: Exploring the Efficacy of Mixture of Adapters for Low-Resource TTS Adaptation
Viaarxiv icon