Alert button

"speech": models, code, and papers
Alert button

Analyzing the Representational Geometry of Acoustic Word Embeddings

Jan 08, 2023
Badr M. Abdullah, Dietrich Klakow

Figure 1 for Analyzing the Representational Geometry of Acoustic Word Embeddings
Figure 2 for Analyzing the Representational Geometry of Acoustic Word Embeddings
Figure 3 for Analyzing the Representational Geometry of Acoustic Word Embeddings
Figure 4 for Analyzing the Representational Geometry of Acoustic Word Embeddings
Viaarxiv icon

Joint Noise Reduction and Listening Enhancement for Full-End Speech Enhancement

Add code
Bookmark button
Alert button
Mar 22, 2022
Haoyu Li, Yun Liu, Junichi Yamagishi

Figure 1 for Joint Noise Reduction and Listening Enhancement for Full-End Speech Enhancement
Figure 2 for Joint Noise Reduction and Listening Enhancement for Full-End Speech Enhancement
Figure 3 for Joint Noise Reduction and Listening Enhancement for Full-End Speech Enhancement
Figure 4 for Joint Noise Reduction and Listening Enhancement for Full-End Speech Enhancement
Viaarxiv icon

AISHELL-NER: Named Entity Recognition from Chinese Speech

Add code
Bookmark button
Alert button
Feb 17, 2022
Boli Chen, Guangwei Xu, Xiaobin Wang, Pengjun Xie, Meishan Zhang, Fei Huang

Figure 1 for AISHELL-NER: Named Entity Recognition from Chinese Speech
Figure 2 for AISHELL-NER: Named Entity Recognition from Chinese Speech
Figure 3 for AISHELL-NER: Named Entity Recognition from Chinese Speech
Figure 4 for AISHELL-NER: Named Entity Recognition from Chinese Speech
Viaarxiv icon

Separator-Transducer-Segmenter: Streaming Recognition and Segmentation of Multi-party Speech

May 10, 2022
Ilya Sklyar, Anna Piunova, Christian Osendorfer

Figure 1 for Separator-Transducer-Segmenter: Streaming Recognition and Segmentation of Multi-party Speech
Figure 2 for Separator-Transducer-Segmenter: Streaming Recognition and Segmentation of Multi-party Speech
Figure 3 for Separator-Transducer-Segmenter: Streaming Recognition and Segmentation of Multi-party Speech
Figure 4 for Separator-Transducer-Segmenter: Streaming Recognition and Segmentation of Multi-party Speech
Viaarxiv icon

Characterizing Financial Market Coverage using Artificial Intelligence

Add code
Bookmark button
Alert button
Feb 07, 2023
Jean Marie Tshimula, D'Jeff K. Nkashama, Patrick Owusu, Marc Frappier, Pierre-Martin Tardif, Froduald Kabanza, Armelle Brun, Jean-Marc Patenaude, Shengrui Wang, Belkacem Chikhaoui

Figure 1 for Characterizing Financial Market Coverage using Artificial Intelligence
Figure 2 for Characterizing Financial Market Coverage using Artificial Intelligence
Figure 3 for Characterizing Financial Market Coverage using Artificial Intelligence
Figure 4 for Characterizing Financial Market Coverage using Artificial Intelligence
Viaarxiv icon

Speech-to-SQL: Towards Speech-driven SQL Query Generation From Natural Language Question

Jan 04, 2022
Yuanfeng Song, Raymond Chi-Wing Wong, Xuefang Zhao, Di Jiang

Figure 1 for Speech-to-SQL: Towards Speech-driven SQL Query Generation From Natural Language Question
Figure 2 for Speech-to-SQL: Towards Speech-driven SQL Query Generation From Natural Language Question
Figure 3 for Speech-to-SQL: Towards Speech-driven SQL Query Generation From Natural Language Question
Figure 4 for Speech-to-SQL: Towards Speech-driven SQL Query Generation From Natural Language Question
Viaarxiv icon

ProDiff: Progressive Fast Diffusion Model For High-Quality Text-to-Speech

Add code
Bookmark button
Alert button
Jul 13, 2022
Rongjie Huang, Zhou Zhao, Huadai Liu, Jinglin Liu, Chenye Cui, Yi Ren

Figure 1 for ProDiff: Progressive Fast Diffusion Model For High-Quality Text-to-Speech
Figure 2 for ProDiff: Progressive Fast Diffusion Model For High-Quality Text-to-Speech
Figure 3 for ProDiff: Progressive Fast Diffusion Model For High-Quality Text-to-Speech
Figure 4 for ProDiff: Progressive Fast Diffusion Model For High-Quality Text-to-Speech
Viaarxiv icon

Speech Emotion Recognition using Supervised Deep Recurrent System for Mental Health Monitoring

Aug 26, 2022
Nelly Elsayed, Zag ElSayed, Navid Asadizanjani, Murat Ozer, Ahmed Abdelgawad, Magdy Bayoumi

Figure 1 for Speech Emotion Recognition using Supervised Deep Recurrent System for Mental Health Monitoring
Figure 2 for Speech Emotion Recognition using Supervised Deep Recurrent System for Mental Health Monitoring
Figure 3 for Speech Emotion Recognition using Supervised Deep Recurrent System for Mental Health Monitoring
Figure 4 for Speech Emotion Recognition using Supervised Deep Recurrent System for Mental Health Monitoring
Viaarxiv icon

Exploiting Programmatic Behavior of LLMs: Dual-Use Through Standard Security Attacks

Add code
Bookmark button
Alert button
Feb 11, 2023
Daniel Kang, Xuechen Li, Ion Stoica, Carlos Guestrin, Matei Zaharia, Tatsunori Hashimoto

Figure 1 for Exploiting Programmatic Behavior of LLMs: Dual-Use Through Standard Security Attacks
Figure 2 for Exploiting Programmatic Behavior of LLMs: Dual-Use Through Standard Security Attacks
Figure 3 for Exploiting Programmatic Behavior of LLMs: Dual-Use Through Standard Security Attacks
Figure 4 for Exploiting Programmatic Behavior of LLMs: Dual-Use Through Standard Security Attacks
Viaarxiv icon

Remap, warp and attend: Non-parallel many-to-many accent conversion with Normalizing Flows

Nov 10, 2022
Abdelhamid Ezzerg, Thomas Merritt, Kayoko Yanagisawa, Piotr Bilinski, Magdalena Proszewska, Kamil Pokora, Renard Korzeniowski, Roberto Barra-Chicote, Daniel Korzekwa

Figure 1 for Remap, warp and attend: Non-parallel many-to-many accent conversion with Normalizing Flows
Figure 2 for Remap, warp and attend: Non-parallel many-to-many accent conversion with Normalizing Flows
Figure 3 for Remap, warp and attend: Non-parallel many-to-many accent conversion with Normalizing Flows
Figure 4 for Remap, warp and attend: Non-parallel many-to-many accent conversion with Normalizing Flows
Viaarxiv icon