Alert button

"speech": models, code, and papers
Alert button

Influence Scores at Scale for Efficient Language Data Sampling

Nov 27, 2023
Nikhil Anand, Joshua Tan, Maria Minakova

Viaarxiv icon

Multi-teacher Distillation for Multilingual Spelling Correction

Nov 20, 2023
Jingfen Zhang, Xuan Guo, Sravan Bodapati, Christopher Potts

Viaarxiv icon

Multi-Scale Sub-Band Constant-Q Transform Discriminator for High-Fidelity Vocoder

Nov 25, 2023
Yicheng Gu, Xueyao Zhang, Liumeng Xue, Zhizheng Wu

Viaarxiv icon

Let There Be Sound: Reconstructing High Quality Speech from Silent Videos

Add code
Bookmark button
Alert button
Aug 29, 2023
Ji-Hoon Kim, Jaehun Kim, Joon Son Chung

Figure 1 for Let There Be Sound: Reconstructing High Quality Speech from Silent Videos
Figure 2 for Let There Be Sound: Reconstructing High Quality Speech from Silent Videos
Figure 3 for Let There Be Sound: Reconstructing High Quality Speech from Silent Videos
Figure 4 for Let There Be Sound: Reconstructing High Quality Speech from Silent Videos
Viaarxiv icon

ChatGPT in the context of precision agriculture data analytics

Nov 10, 2023
Ilyas Potamitis

Viaarxiv icon

Topological Data Mapping of Online Hate Speech, Misinformation, and General Mental Health: A Large Language Model Based Study

Add code
Bookmark button
Alert button
Sep 22, 2023
Andrew Alexander, Hongbin Wang

Figure 1 for Topological Data Mapping of Online Hate Speech, Misinformation, and General Mental Health: A Large Language Model Based Study
Figure 2 for Topological Data Mapping of Online Hate Speech, Misinformation, and General Mental Health: A Large Language Model Based Study
Figure 3 for Topological Data Mapping of Online Hate Speech, Misinformation, and General Mental Health: A Large Language Model Based Study
Figure 4 for Topological Data Mapping of Online Hate Speech, Misinformation, and General Mental Health: A Large Language Model Based Study
Viaarxiv icon

Syn-Att: Synthetic Speech Attribution via Semi-Supervised Unknown Multi-Class Ensemble of CNNs

Add code
Bookmark button
Alert button
Sep 15, 2023
Md Awsafur Rahman, Bishmoy Paul, Najibul Haque Sarker, Zaber Ibn Abdul Hakim, Shaikh Anowarul Fattah, Mohammad Saquib

Viaarxiv icon

NoLACE: Improving Low-Complexity Speech Codec Enhancement Through Adaptive Temporal Shaping

Add code
Bookmark button
Alert button
Sep 25, 2023
Jan Büthe, Ahmed Mustafa, Jean-Marc Valin, Karim Helwani, Michael M. Goodwin

Viaarxiv icon

Soft Random Sampling: A Theoretical and Empirical Analysis

Nov 21, 2023
Xiaodong Cui, Ashish Mittal, Songtao Lu, Wei Zhang, George Saon, Brian Kingsbury

Viaarxiv icon

Iterative Shallow Fusion of Backward Language Model for End-to-End Speech Recognition

Oct 17, 2023
Atsunori Ogawa, Takafumi Moriya, Naoyuki Kamo, Naohiro Tawara, Marc Delcroix

Viaarxiv icon