Alert button

"speech": models, code, and papers
Alert button

MuLanTTS: The Microsoft Speech Synthesis System for Blizzard Challenge 2023

Sep 12, 2023
Zhihang Xu, Shaofei Zhang, Xi Wang, Jiajun Zhang, Wenning Wei, Lei He, Sheng Zhao

Figure 1 for MuLanTTS: The Microsoft Speech Synthesis System for Blizzard Challenge 2023
Figure 2 for MuLanTTS: The Microsoft Speech Synthesis System for Blizzard Challenge 2023
Figure 3 for MuLanTTS: The Microsoft Speech Synthesis System for Blizzard Challenge 2023
Figure 4 for MuLanTTS: The Microsoft Speech Synthesis System for Blizzard Challenge 2023
Viaarxiv icon

Exploring the Potential of Large Language Models in Computational Argumentation

Nov 15, 2023
Guizhen Chen, Liying Cheng, Luu Anh Tuan, Lidong Bing

Figure 1 for Exploring the Potential of Large Language Models in Computational Argumentation
Figure 2 for Exploring the Potential of Large Language Models in Computational Argumentation
Figure 3 for Exploring the Potential of Large Language Models in Computational Argumentation
Figure 4 for Exploring the Potential of Large Language Models in Computational Argumentation
Viaarxiv icon

DPATD: Dual-Phase Audio Transformer for Denoising

Oct 30, 2023
Junhui Li, Pu Wang, Jialu Li, Xinzhe Wang, Youshan Zhang

Viaarxiv icon

CLARA: Multilingual Contrastive Learning for Audio Representation Acquisition

Oct 18, 2023
Kari A Noriy, Xiaosong Yang, Marcin Budka, Jian Jun Zhang

Viaarxiv icon

SALTTS: Leveraging Self-Supervised Speech Representations for improved Text-to-Speech Synthesis

Aug 02, 2023
Ramanan Sivaguru, Vasista Sai Lodagala, S Umesh

Figure 1 for SALTTS: Leveraging Self-Supervised Speech Representations for improved Text-to-Speech Synthesis
Figure 2 for SALTTS: Leveraging Self-Supervised Speech Representations for improved Text-to-Speech Synthesis
Viaarxiv icon

SD-HuBERT: Self-Distillation Induces Syllabic Organization in HuBERT

Add code
Bookmark button
Alert button
Oct 16, 2023
Cheol Jun Cho, Abdelrahman Mohamed, Shang-Wen Li, Alan W Black, Gopala K. Anumanchipalli

Figure 1 for SD-HuBERT: Self-Distillation Induces Syllabic Organization in HuBERT
Figure 2 for SD-HuBERT: Self-Distillation Induces Syllabic Organization in HuBERT
Figure 3 for SD-HuBERT: Self-Distillation Induces Syllabic Organization in HuBERT
Figure 4 for SD-HuBERT: Self-Distillation Induces Syllabic Organization in HuBERT
Viaarxiv icon

How Far Can We Extract Diverse Perspectives from Large Language Models? Criteria-Based Diversity Prompting!

Nov 16, 2023
Shirley Anugrah Hayati, Minhwa Lee, Dheeraj Rajagopal, Dongyeop Kang

Viaarxiv icon

The Mason-Alberta Phonetic Segmenter: A forced alignment system based on deep neural networks and interpolation

Add code
Bookmark button
Alert button
Oct 24, 2023
Matthew C. Kelley, Scott James Perry, Benjamin V. Tucker

Viaarxiv icon

Bit Cipher -- A Simple yet Powerful Word Representation System that Integrates Efficiently with Language Models

Nov 18, 2023
Haoran Zhao, Jake Ryland Williams

Viaarxiv icon

Unimodal Aggregation for CTC-based Speech Recognition

Add code
Bookmark button
Alert button
Sep 15, 2023
Ying Fang, Xiaofei Li

Figure 1 for Unimodal Aggregation for CTC-based Speech Recognition
Figure 2 for Unimodal Aggregation for CTC-based Speech Recognition
Figure 3 for Unimodal Aggregation for CTC-based Speech Recognition
Figure 4 for Unimodal Aggregation for CTC-based Speech Recognition
Viaarxiv icon