Alert button

"speech recognition": models, code, and papers
Alert button

An Investigation of End-to-End Models for Robust Speech Recognition

Add code
Bookmark button
Alert button
Feb 11, 2021
Archiki Prasad, Preethi Jyothi, Rajbabu Velmurugan

Figure 1 for An Investigation of End-to-End Models for Robust Speech Recognition
Figure 2 for An Investigation of End-to-End Models for Robust Speech Recognition
Figure 3 for An Investigation of End-to-End Models for Robust Speech Recognition
Figure 4 for An Investigation of End-to-End Models for Robust Speech Recognition
Viaarxiv icon

A Meeting Transcription System for an Ad-Hoc Acoustic Sensor Network

May 02, 2022
Tobias Gburrek, Christoph Boeddeker, Thilo von Neumann, Tobias Cord-Landwehr, Joerg Schmalenstroeer, Reinhold Haeb-Umbach

Figure 1 for A Meeting Transcription System for an Ad-Hoc Acoustic Sensor Network
Figure 2 for A Meeting Transcription System for an Ad-Hoc Acoustic Sensor Network
Figure 3 for A Meeting Transcription System for an Ad-Hoc Acoustic Sensor Network
Viaarxiv icon

Modelling word learning and recognition using visually grounded speech

Mar 14, 2022
Danny Merkx, Sebastiaan Scholten, Stefan L. Frank, Mirjam Ernestus, Odette Scharenborg

Figure 1 for Modelling word learning and recognition using visually grounded speech
Figure 2 for Modelling word learning and recognition using visually grounded speech
Figure 3 for Modelling word learning and recognition using visually grounded speech
Figure 4 for Modelling word learning and recognition using visually grounded speech
Viaarxiv icon

Hallucination of speech recognition errors with sequence to sequence learning

Mar 31, 2021
Prashant Serai, Vishal Sunder, Eric Fosler-Lussier

Figure 1 for Hallucination of speech recognition errors with sequence to sequence learning
Figure 2 for Hallucination of speech recognition errors with sequence to sequence learning
Figure 3 for Hallucination of speech recognition errors with sequence to sequence learning
Figure 4 for Hallucination of speech recognition errors with sequence to sequence learning
Viaarxiv icon

Multichannel End-to-end Speech Recognition

Mar 14, 2017
Tsubasa Ochiai, Shinji Watanabe, Takaaki Hori, John R. Hershey

Figure 1 for Multichannel End-to-end Speech Recognition
Figure 2 for Multichannel End-to-end Speech Recognition
Figure 3 for Multichannel End-to-end Speech Recognition
Figure 4 for Multichannel End-to-end Speech Recognition
Viaarxiv icon

Large Raw Emotional Dataset with Aggregation Mechanism

Add code
Bookmark button
Alert button
Dec 23, 2022
Vladimir Kondratenko, Artem Sokolov, Nikolay Karpov, Oleg Kutuzov, Nikita Savushkin, Fyodor Minkin

Figure 1 for Large Raw Emotional Dataset with Aggregation Mechanism
Figure 2 for Large Raw Emotional Dataset with Aggregation Mechanism
Figure 3 for Large Raw Emotional Dataset with Aggregation Mechanism
Figure 4 for Large Raw Emotional Dataset with Aggregation Mechanism
Viaarxiv icon

Efficient acoustic feature transformation in mismatched environments using a Guided-GAN

Add code
Bookmark button
Alert button
Oct 06, 2022
Walter Heymans, Marelie H. Davel, Charl van Heerden

Figure 1 for Efficient acoustic feature transformation in mismatched environments using a Guided-GAN
Figure 2 for Efficient acoustic feature transformation in mismatched environments using a Guided-GAN
Figure 3 for Efficient acoustic feature transformation in mismatched environments using a Guided-GAN
Figure 4 for Efficient acoustic feature transformation in mismatched environments using a Guided-GAN
Viaarxiv icon

Towards Relation Extraction From Speech

Add code
Bookmark button
Alert button
Oct 17, 2022
Tongtong Wu, Guitao Wang, Jinming Zhao, Zhaoran Liu, Guilin Qi, Yuan-Fang Li, Gholamreza Haffari

Figure 1 for Towards Relation Extraction From Speech
Figure 2 for Towards Relation Extraction From Speech
Figure 3 for Towards Relation Extraction From Speech
Figure 4 for Towards Relation Extraction From Speech
Viaarxiv icon

Speech Recognition: Keyword Spotting Through Image Recognition

Mar 10, 2018
Sanjay Krishna Gouda, Salil Kanetkar, David Harrison, Manfred K Warmuth

Figure 1 for Speech Recognition: Keyword Spotting Through Image Recognition
Figure 2 for Speech Recognition: Keyword Spotting Through Image Recognition
Figure 3 for Speech Recognition: Keyword Spotting Through Image Recognition
Figure 4 for Speech Recognition: Keyword Spotting Through Image Recognition
Viaarxiv icon

Sotto Voce: Federated Speech Recognition with Differential Privacy Guarantees

Jul 16, 2022
Michael Shoemate, Kevin Jett, Ethan Cowan, Sean Colbath, James Honaker, Prasanna Muthukumar

Figure 1 for Sotto Voce: Federated Speech Recognition with Differential Privacy Guarantees
Figure 2 for Sotto Voce: Federated Speech Recognition with Differential Privacy Guarantees
Figure 3 for Sotto Voce: Federated Speech Recognition with Differential Privacy Guarantees
Viaarxiv icon