Alert button

"speech": models, code, and papers
Alert button

Humane Speech Synthesis through Zero-Shot Emotion and Disfluency Generation

Add code
Bookmark button
Alert button
Mar 31, 2024
Rohan Chaudhury, Mihir Godbole, Aakash Garg, Jinsil Hwaryoung Seo

Viaarxiv icon

Removing Speaker Information from Speech Representation using Variable-Length Soft Pooling

Apr 01, 2024
Injune Hwang, Kyogu Lee

Viaarxiv icon

ART: The Alternating Reading Task Corpus for Speech Entrainment and Imitation

Apr 03, 2024
Zheng Yuan, Dorina de Jong, Štefan Beňuš, Noël Nguyen, Ruitao Feng, Róbert Sabo, Luciano Fadiga, Alessandro D`Ausilio

Viaarxiv icon

WavLLM: Towards Robust and Adaptive Speech Large Language Model

Mar 31, 2024
Shujie Hu, Long Zhou, Shujie Liu, Sanyuan Chen, Hongkun Hao, Jing Pan, Xunying Liu, Jinyu Li, Sunit Sivasankaran, Linquan Liu, Furu Wei

Viaarxiv icon

RALL-E: Robust Codec Language Modeling with Chain-of-Thought Prompting for Text-to-Speech Synthesis

Apr 06, 2024
Detai Xin, Xu Tan, Kai Shen, Zeqian Ju, Dongchao Yang, Yuancheng Wang, Shinnosuke Takamichi, Hiroshi Saruwatari, Shujie Liu, Jinyu Li, Sheng Zhao

Viaarxiv icon

Llama-VITS: Enhancing TTS Synthesis with Semantic Awareness

Apr 12, 2024
Xincan Feng, Akifumi Yoshimoto

Viaarxiv icon

VoiceCraft: Zero-Shot Speech Editing and Text-to-Speech in the Wild

Add code
Bookmark button
Alert button
Mar 25, 2024
Puyuan Peng, Po-Yao Huang, Daniel Li, Abdelrahman Mohamed, David Harwath

Viaarxiv icon

Towards Variable and Coordinated Holistic Co-Speech Motion Generation

Add code
Bookmark button
Alert button
Mar 30, 2024
Yifei Liu, Qiong Cao, Yandong Wen, Huaiguang Jiang, Changxing Ding

Viaarxiv icon

THQA: A Perceptual Quality Assessment Database for Talking Heads

Add code
Bookmark button
Alert button
Apr 13, 2024
Yingjie Zhou, Zicheng Zhang, Wei Sun, Xiaohong Liu, Xiongkuo Min, Zhihua Wang, Xiao-Ping Zhang, Guangtao Zhai

Viaarxiv icon

Linguistic Changes in Spontaneous Speech for Detecting Parkinsons Disease Using Large Language Models

Apr 08, 2024
Jonathan Crawford

Viaarxiv icon