Alert button

"speech": models, code, and papers
Alert button

Evaluating Speech-in-Speech Perception via a Humanoid Robot

Dec 19, 2023
Luke Meyer, Gloria Araiza-Illan, Laura Rachman, Etienne Gaudrain, Deniz Baskent

Viaarxiv icon

Streaming Sequence Transduction through Dynamic Compression

Feb 02, 2024
Weiting Tan, Yunmo Chen, Tongfei Chen, Guanghui Qin, Haoran Xu, Heidi C. Zhang, Benjamin Van Durme, Philipp Koehn

Viaarxiv icon

Robot voice a voice controlled robot using arduino

Feb 06, 2024
Vineeth Teeda, K Sujatha, Rakesh Mutukuru

Viaarxiv icon

EmphAssess : a Prosodic Benchmark on Assessing Emphasis Transfer in Speech-to-Speech Models

Dec 21, 2023
Maureen de Seyssel, Antony D'Avirro, Adina Williams, Emmanuel Dupoux

Viaarxiv icon

Progressive unsupervised domain adaptation for ASR using ensemble models and multi-stage training

Feb 07, 2024
Rehan Ahmad, Muhammad Umar Farooq, Thomas Hain

Viaarxiv icon

Self-consistent context aware conformer transducer for speech recognition

Feb 09, 2024
Konstantin Kolokolov, Pavel Pekichev, Karthik Raghunathan

Viaarxiv icon

DiffSHEG: A Diffusion-Based Approach for Real-Time Speech-driven Holistic 3D Expression and Gesture Generation

Jan 09, 2024
Junming Chen, Yunfei Liu, Jianan Wang, Ailing Zeng, Yu Li, Qifeng Chen

Viaarxiv icon

Boosting Large Language Model for Speech Synthesis: An Empirical Study

Dec 30, 2023
Hongkun Hao, Long Zhou, Shujie Liu, Jinyu Li, Shujie Hu, Rui Wang, Furu Wei

Viaarxiv icon

OrderBkd: Textual backdoor attack through repositioning

Add code
Bookmark button
Alert button
Feb 12, 2024
Irina Alekseevskaia, Konstantin Arkhipenko

Viaarxiv icon

Using LLMs to discover emerging coded antisemitic hate-speech in extremist social media

Jan 23, 2024
Dhanush Kikkisetti, Raza Ul Mustafa, Wendy Melillo, Roberto Corizzo, Zois Boukouvalas, Jeff Gill, Nathalie Japkowicz

Viaarxiv icon