Alert button

"speech": models, code, and papers
Alert button

Identifying Planetary Names in Astronomy Papers: A Multi-Step Approach

Dec 17, 2023
Golnaz Shapurian, Michael J Kurtz, Alberto Accomazzi

Viaarxiv icon

Do self-supervised speech and language models extract similar representations as human brain?

Oct 07, 2023
Peili Chen, Linyang He, Li Fu, Lu Fan, Edward F. Chang, Yuanning Li

Viaarxiv icon

Modular Customizable ROS-Based Framework for Rapid Development of Social Robots

Nov 27, 2023
Mahta Akhyani, Hadi Moradi

Viaarxiv icon

JVNV: A Corpus of Japanese Emotional Speech with Verbal Content and Nonverbal Expressions

Add code
Bookmark button
Alert button
Oct 09, 2023
Detai Xin, Junfeng Jiang, Shinnosuke Takamichi, Yuki Saito, Akiko Aizawa, Hiroshi Saruwatari

Figure 1 for JVNV: A Corpus of Japanese Emotional Speech with Verbal Content and Nonverbal Expressions
Figure 2 for JVNV: A Corpus of Japanese Emotional Speech with Verbal Content and Nonverbal Expressions
Figure 3 for JVNV: A Corpus of Japanese Emotional Speech with Verbal Content and Nonverbal Expressions
Figure 4 for JVNV: A Corpus of Japanese Emotional Speech with Verbal Content and Nonverbal Expressions
Viaarxiv icon

Adaptive Computation Modules: Granular Conditional Computation For Efficient Inference

Dec 15, 2023
Bartosz Wójcik, Alessio Devoto, Karol Pustelnik, Pasquale Minervini, Simone Scardapane

Viaarxiv icon

Subspace Hybrid MVDR Beamforming for Augmented Hearing

Nov 30, 2023
Sina Hafezi, Alastair H. Moore, Pierre H. Guiraud, Patrick A. Naylor, Jacob Donley, Vladimir Tourbabin, Thomas Lunner

Viaarxiv icon

Exploiting Symmetric Temporally Sparse BPTT for Efficient RNN Training

Dec 14, 2023
Xi Chen, Chang Gao, Zuowen Wang, Longbiao Cheng, Sheng Zhou, Shih-Chii Liu, Tobi Delbruck

Viaarxiv icon

AV-CPL: Continuous Pseudo-Labeling for Audio-Visual Speech Recognition

Add code
Bookmark button
Alert button
Sep 29, 2023
Andrew Rouditchenko, Ronan Collobert, Tatiana Likhomanenko

Figure 1 for AV-CPL: Continuous Pseudo-Labeling for Audio-Visual Speech Recognition
Figure 2 for AV-CPL: Continuous Pseudo-Labeling for Audio-Visual Speech Recognition
Figure 3 for AV-CPL: Continuous Pseudo-Labeling for Audio-Visual Speech Recognition
Figure 4 for AV-CPL: Continuous Pseudo-Labeling for Audio-Visual Speech Recognition
Viaarxiv icon

CorrTalk: Correlation Between Hierarchical Speech and Facial Activity Variances for 3D Animation

Add code
Bookmark button
Alert button
Oct 17, 2023
Zhaojie Chu, Kailing Guo, Xiaofen Xing, Yilin Lan, Bolun Cai, Xiangmin Xu

Viaarxiv icon

Whispering LLaMA: A Cross-Modal Generative Error Correction Framework for Speech Recognition

Add code
Bookmark button
Alert button
Oct 16, 2023
Srijith Radhakrishnan, Chao-Han Huck Yang, Sumeer Ahmad Khan, Rohit Kumar, Narsis A. Kiani, David Gomez-Cabrero, Jesper N. Tegner

Figure 1 for Whispering LLaMA: A Cross-Modal Generative Error Correction Framework for Speech Recognition
Figure 2 for Whispering LLaMA: A Cross-Modal Generative Error Correction Framework for Speech Recognition
Figure 3 for Whispering LLaMA: A Cross-Modal Generative Error Correction Framework for Speech Recognition
Figure 4 for Whispering LLaMA: A Cross-Modal Generative Error Correction Framework for Speech Recognition
Viaarxiv icon