Alert button

"Text": models, code, and papers
Alert button

Adversarial Speaker-Consistency Learning Using Untranscribed Speech Data for Zero-Shot Multi-Speaker Text-to-Speech

Oct 12, 2022
Byoung Jin Choi, Myeonghun Jeong, Minchan Kim, Sung Hwan Mun, Nam Soo Kim

Figure 1 for Adversarial Speaker-Consistency Learning Using Untranscribed Speech Data for Zero-Shot Multi-Speaker Text-to-Speech
Figure 2 for Adversarial Speaker-Consistency Learning Using Untranscribed Speech Data for Zero-Shot Multi-Speaker Text-to-Speech
Figure 3 for Adversarial Speaker-Consistency Learning Using Untranscribed Speech Data for Zero-Shot Multi-Speaker Text-to-Speech
Figure 4 for Adversarial Speaker-Consistency Learning Using Untranscribed Speech Data for Zero-Shot Multi-Speaker Text-to-Speech
Viaarxiv icon

Ered: Enhanced Text Representations with Entities and Descriptions

Aug 18, 2022
Qinghua Zhao, Shuai Ma, Yuxuan Lei

Figure 1 for Ered: Enhanced Text Representations with Entities and Descriptions
Figure 2 for Ered: Enhanced Text Representations with Entities and Descriptions
Figure 3 for Ered: Enhanced Text Representations with Entities and Descriptions
Figure 4 for Ered: Enhanced Text Representations with Entities and Descriptions
Viaarxiv icon

Unsupervised Task Graph Generation from Instructional Video Transcripts

Feb 17, 2023
Lajanugen Logeswaran, Sungryull Sohn, Yunseok Jang, Moontae Lee, Honglak Lee

Figure 1 for Unsupervised Task Graph Generation from Instructional Video Transcripts
Figure 2 for Unsupervised Task Graph Generation from Instructional Video Transcripts
Figure 3 for Unsupervised Task Graph Generation from Instructional Video Transcripts
Figure 4 for Unsupervised Task Graph Generation from Instructional Video Transcripts
Viaarxiv icon

Improving Speech Prosody of Audiobook Text-to-Speech Synthesis with Acoustic and Textual Contexts

Nov 04, 2022
Detai Xin, Sharath Adavanne, Federico Ang, Ashish Kulkarni, Shinnosuke Takamichi, Hiroshi Saruwatari

Figure 1 for Improving Speech Prosody of Audiobook Text-to-Speech Synthesis with Acoustic and Textual Contexts
Figure 2 for Improving Speech Prosody of Audiobook Text-to-Speech Synthesis with Acoustic and Textual Contexts
Figure 3 for Improving Speech Prosody of Audiobook Text-to-Speech Synthesis with Acoustic and Textual Contexts
Figure 4 for Improving Speech Prosody of Audiobook Text-to-Speech Synthesis with Acoustic and Textual Contexts
Viaarxiv icon

Entity Tagging: Extracting Entities in Text Without Mention Supervision

Sep 13, 2022
Christina Du, Kashyap Popat, Louis Martin, Fabio Petroni

Figure 1 for Entity Tagging: Extracting Entities in Text Without Mention Supervision
Figure 2 for Entity Tagging: Extracting Entities in Text Without Mention Supervision
Figure 3 for Entity Tagging: Extracting Entities in Text Without Mention Supervision
Figure 4 for Entity Tagging: Extracting Entities in Text Without Mention Supervision
Viaarxiv icon

A Watermark for Large Language Models

Jan 27, 2023
John Kirchenbauer, Jonas Geiping, Yuxin Wen, Jonathan Katz, Ian Miers, Tom Goldstein

Figure 1 for A Watermark for Large Language Models
Figure 2 for A Watermark for Large Language Models
Figure 3 for A Watermark for Large Language Models
Figure 4 for A Watermark for Large Language Models
Viaarxiv icon

Mask3D: Pre-training 2D Vision Transformers by Learning Masked 3D Priors

Feb 28, 2023
Ji Hou, Xiaoliang Dai, Zijian He, Angela Dai, Matthias Nießner

Figure 1 for Mask3D: Pre-training 2D Vision Transformers by Learning Masked 3D Priors
Figure 2 for Mask3D: Pre-training 2D Vision Transformers by Learning Masked 3D Priors
Figure 3 for Mask3D: Pre-training 2D Vision Transformers by Learning Masked 3D Priors
Figure 4 for Mask3D: Pre-training 2D Vision Transformers by Learning Masked 3D Priors
Viaarxiv icon

Human-Art: A Versatile Human-Centric Dataset Bridging Natural and Artificial Scenes

Mar 05, 2023
Xuan Ju, Ailing Zeng, Jianan Wang, Qiang Xu, Lei Zhang

Figure 1 for Human-Art: A Versatile Human-Centric Dataset Bridging Natural and Artificial Scenes
Figure 2 for Human-Art: A Versatile Human-Centric Dataset Bridging Natural and Artificial Scenes
Figure 3 for Human-Art: A Versatile Human-Centric Dataset Bridging Natural and Artificial Scenes
Figure 4 for Human-Art: A Versatile Human-Centric Dataset Bridging Natural and Artificial Scenes
Viaarxiv icon

Agile Modeling: Image Classification with Domain Experts in the Loop

Feb 25, 2023
Otilia Stretcu, Edward Vendrow, Kenji Hata, Krishnamurthy Viswanathan, Vittorio Ferrari, Sasan Tavakkol, Wenlei Zhou, Aditya Avinash, Enming Luo, Neil Gordon Alldrin, MohammadHossein Bateni, Gabriel Berger, Andrew Bunner, Chun-Ta Lu, Javier A Rey, Ariel Fuxman

Figure 1 for Agile Modeling: Image Classification with Domain Experts in the Loop
Figure 2 for Agile Modeling: Image Classification with Domain Experts in the Loop
Figure 3 for Agile Modeling: Image Classification with Domain Experts in the Loop
Figure 4 for Agile Modeling: Image Classification with Domain Experts in the Loop
Viaarxiv icon

Learning Deep Semantics for Test Completion

Mar 07, 2023
Pengyu Nie, Rahul Banerjee, Junyi Jessy Li, Raymond J. Mooney, Milos Gligoric

Figure 1 for Learning Deep Semantics for Test Completion
Figure 2 for Learning Deep Semantics for Test Completion
Figure 3 for Learning Deep Semantics for Test Completion
Figure 4 for Learning Deep Semantics for Test Completion
Viaarxiv icon