Alert button

"speech": models, code, and papers
Alert button

Pushing the Limits of ChatGPT on NLP Tasks

Add code
Bookmark button
Alert button
Jun 16, 2023
Xiaofei Sun, Linfeng Dong, Xiaoya Li, Zhen Wan, Shuhe Wang, Tianwei Zhang, Jiwei Li, Fei Cheng, Lingjuan Lyu, Fei Wu, Guoyin Wang

Figure 1 for Pushing the Limits of ChatGPT on NLP Tasks
Figure 2 for Pushing the Limits of ChatGPT on NLP Tasks
Figure 3 for Pushing the Limits of ChatGPT on NLP Tasks
Figure 4 for Pushing the Limits of ChatGPT on NLP Tasks
Viaarxiv icon

Stutter-TTS: Controlled Synthesis and Improved Recognition of Stuttered Speech

Nov 04, 2022
Xin Zhang, Iván Vallés-Pérez, Andreas Stolcke, Chengzhu Yu, Jasha Droppo, Olabanji Shonibare, Roberto Barra-Chicote, Venkatesh Ravichandran

Figure 1 for Stutter-TTS: Controlled Synthesis and Improved Recognition of Stuttered Speech
Figure 2 for Stutter-TTS: Controlled Synthesis and Improved Recognition of Stuttered Speech
Figure 3 for Stutter-TTS: Controlled Synthesis and Improved Recognition of Stuttered Speech
Figure 4 for Stutter-TTS: Controlled Synthesis and Improved Recognition of Stuttered Speech
Viaarxiv icon

Utilizing Longitudinal Chest X-Rays and Reports to Pre-Fill Radiology Reports

Jun 14, 2023
Qingqing Zhu, Tejas Sudharshan Mathai, Pritam Mukherjee, Yifan Peng, Ronald M. Summers, Zhiyong Lu

Figure 1 for Utilizing Longitudinal Chest X-Rays and Reports to Pre-Fill Radiology Reports
Figure 2 for Utilizing Longitudinal Chest X-Rays and Reports to Pre-Fill Radiology Reports
Figure 3 for Utilizing Longitudinal Chest X-Rays and Reports to Pre-Fill Radiology Reports
Figure 4 for Utilizing Longitudinal Chest X-Rays and Reports to Pre-Fill Radiology Reports
Viaarxiv icon

Blind Signal Dereverberation for Machine Speech Recognition

Sep 30, 2022
Samik Sadhu, Hynek Hermansky

Figure 1 for Blind Signal Dereverberation for Machine Speech Recognition
Figure 2 for Blind Signal Dereverberation for Machine Speech Recognition
Figure 3 for Blind Signal Dereverberation for Machine Speech Recognition
Figure 4 for Blind Signal Dereverberation for Machine Speech Recognition
Viaarxiv icon

Context-aware Fine-tuning of Self-supervised Speech Models

Add code
Bookmark button
Alert button
Dec 16, 2022
Suwon Shon, Felix Wu, Kwangyoun Kim, Prashant Sridhar, Karen Livescu, Shinji Watanabe

Figure 1 for Context-aware Fine-tuning of Self-supervised Speech Models
Figure 2 for Context-aware Fine-tuning of Self-supervised Speech Models
Figure 3 for Context-aware Fine-tuning of Self-supervised Speech Models
Figure 4 for Context-aware Fine-tuning of Self-supervised Speech Models
Viaarxiv icon

EarSpy: Spying Caller Speech and Identity through Tiny Vibrations of Smartphone Ear Speakers

Dec 23, 2022
Ahmed Tanvir Mahdad, Cong Shi, Zhengkun Ye, Tianming Zhao, Yan Wang, Yingying Chen, Nitesh Saxena

Figure 1 for EarSpy: Spying Caller Speech and Identity through Tiny Vibrations of Smartphone Ear Speakers
Figure 2 for EarSpy: Spying Caller Speech and Identity through Tiny Vibrations of Smartphone Ear Speakers
Figure 3 for EarSpy: Spying Caller Speech and Identity through Tiny Vibrations of Smartphone Ear Speakers
Figure 4 for EarSpy: Spying Caller Speech and Identity through Tiny Vibrations of Smartphone Ear Speakers
Viaarxiv icon

Human-in-the-Loop Hate Speech Classification in a Multilingual Context

Add code
Bookmark button
Alert button
Dec 05, 2022
Ana Kotarcic, Dominik Hangartner, Fabrizio Gilardi, Selina Kurer, Karsten Donnay

Figure 1 for Human-in-the-Loop Hate Speech Classification in a Multilingual Context
Figure 2 for Human-in-the-Loop Hate Speech Classification in a Multilingual Context
Figure 3 for Human-in-the-Loop Hate Speech Classification in a Multilingual Context
Figure 4 for Human-in-the-Loop Hate Speech Classification in a Multilingual Context
Viaarxiv icon

Encoder-decoder multimodal speaker change detection

Jun 01, 2023
Jee-weon Jung, Soonshin Seo, Hee-Soo Heo, Geonmin Kim, You Jin Kim, Young-ki Kwon, Minjae Lee, Bong-Jin Lee

Figure 1 for Encoder-decoder multimodal speaker change detection
Figure 2 for Encoder-decoder multimodal speaker change detection
Figure 3 for Encoder-decoder multimodal speaker change detection
Figure 4 for Encoder-decoder multimodal speaker change detection
Viaarxiv icon

Enhancing the Unified Streaming and Non-streaming Model with Contrastive Learning

Add code
Bookmark button
Alert button
Jun 01, 2023
Yuting Yang, Yuke Li, Binbin Du

Figure 1 for Enhancing the Unified Streaming and Non-streaming Model with Contrastive Learning
Figure 2 for Enhancing the Unified Streaming and Non-streaming Model with Contrastive Learning
Figure 3 for Enhancing the Unified Streaming and Non-streaming Model with Contrastive Learning
Figure 4 for Enhancing the Unified Streaming and Non-streaming Model with Contrastive Learning
Viaarxiv icon

Stuttering Detection Using Speaker Representations and Self-supervised Contextual Embeddings

Jun 01, 2023
Shakeel A. Sheikh, Md Sahidullah, Fabrice Hirsch, Slim Ouni

Figure 1 for Stuttering Detection Using Speaker Representations and Self-supervised Contextual Embeddings
Figure 2 for Stuttering Detection Using Speaker Representations and Self-supervised Contextual Embeddings
Figure 3 for Stuttering Detection Using Speaker Representations and Self-supervised Contextual Embeddings
Figure 4 for Stuttering Detection Using Speaker Representations and Self-supervised Contextual Embeddings
Viaarxiv icon