Alert button

"speech": models, code, and papers
Alert button

Two-Staged Acoustic Modeling Adaption for Robust Speech Recognition by the Example of German Oral History Interviews

Aug 19, 2019
Michael Gref, Christoph Schmidt, Sven Behnke, Joachim Köhler

Figure 1 for Two-Staged Acoustic Modeling Adaption for Robust Speech Recognition by the Example of German Oral History Interviews
Figure 2 for Two-Staged Acoustic Modeling Adaption for Robust Speech Recognition by the Example of German Oral History Interviews
Figure 3 for Two-Staged Acoustic Modeling Adaption for Robust Speech Recognition by the Example of German Oral History Interviews
Figure 4 for Two-Staged Acoustic Modeling Adaption for Robust Speech Recognition by the Example of German Oral History Interviews
Viaarxiv icon

Attentive Temporal Pooling for Conformer-based Streaming Language Identification in Long-form Speech

Add code
Bookmark button
Alert button
Feb 24, 2022
Quan Wang, Yang Yu, Jason Pelecanos, Yiling Huang, Ignacio Lopez Moreno

Figure 1 for Attentive Temporal Pooling for Conformer-based Streaming Language Identification in Long-form Speech
Figure 2 for Attentive Temporal Pooling for Conformer-based Streaming Language Identification in Long-form Speech
Figure 3 for Attentive Temporal Pooling for Conformer-based Streaming Language Identification in Long-form Speech
Figure 4 for Attentive Temporal Pooling for Conformer-based Streaming Language Identification in Long-form Speech
Viaarxiv icon

A small Griko-Italian speech translation corpus

Jul 27, 2018
Marcely Zanon Boito, Antonios Anastasopoulos, Marika Lekakou, Aline Villavicencio, Laurent Besacier

Figure 1 for A small Griko-Italian speech translation corpus
Figure 2 for A small Griko-Italian speech translation corpus
Figure 3 for A small Griko-Italian speech translation corpus
Figure 4 for A small Griko-Italian speech translation corpus
Viaarxiv icon

Speech Map: A Statistical Multimodal Atlas of 4D Tongue Motion During Speech from Tagged and Cine MR Images

Sep 15, 2018
Jonghye Woo, Fangxu Xing, Maureen Stone, Jordan Green, Timothy G. Reese, Thomas J. Brady, Van J. Wedeen, Jerry L. Prince, Georges El Fakhri

Figure 1 for Speech Map: A Statistical Multimodal Atlas of 4D Tongue Motion During Speech from Tagged and Cine MR Images
Figure 2 for Speech Map: A Statistical Multimodal Atlas of 4D Tongue Motion During Speech from Tagged and Cine MR Images
Figure 3 for Speech Map: A Statistical Multimodal Atlas of 4D Tongue Motion During Speech from Tagged and Cine MR Images
Figure 4 for Speech Map: A Statistical Multimodal Atlas of 4D Tongue Motion During Speech from Tagged and Cine MR Images
Viaarxiv icon

Right-wing German Hate Speech on Twitter: Analysis and Automatic Detection

Oct 16, 2019
Sylvia Jaki, Tom De Smedt

Figure 1 for Right-wing German Hate Speech on Twitter: Analysis and Automatic Detection
Figure 2 for Right-wing German Hate Speech on Twitter: Analysis and Automatic Detection
Figure 3 for Right-wing German Hate Speech on Twitter: Analysis and Automatic Detection
Figure 4 for Right-wing German Hate Speech on Twitter: Analysis and Automatic Detection
Viaarxiv icon

Multi-task RNN-T with Semantic Decoder for Streamable Spoken Language Understanding

Apr 01, 2022
Xuandi Fu, Feng-Ju Chang, Martin Radfar, Kai Wei, Jing Liu, Grant P. Strimel, Kanthashree Mysore Sathyendra

Figure 1 for Multi-task RNN-T with Semantic Decoder for Streamable Spoken Language Understanding
Figure 2 for Multi-task RNN-T with Semantic Decoder for Streamable Spoken Language Understanding
Figure 3 for Multi-task RNN-T with Semantic Decoder for Streamable Spoken Language Understanding
Figure 4 for Multi-task RNN-T with Semantic Decoder for Streamable Spoken Language Understanding
Viaarxiv icon

Leveraging End-to-End Speech Recognition with Neural Architecture Search

Dec 11, 2019
Ahmed Baruwa, Mojeed Abisiga, Ibrahim Gbadegesin, Afeez Fakunle

Figure 1 for Leveraging End-to-End Speech Recognition with Neural Architecture Search
Figure 2 for Leveraging End-to-End Speech Recognition with Neural Architecture Search
Figure 3 for Leveraging End-to-End Speech Recognition with Neural Architecture Search
Figure 4 for Leveraging End-to-End Speech Recognition with Neural Architecture Search
Viaarxiv icon

Quantitative phase and absorption contrast imaging

Mar 23, 2022
Miguel Moscoso, Alexei Novikov, George Papanicolaou, Chrysoula Tsogka

Figure 1 for Quantitative phase and absorption contrast imaging
Figure 2 for Quantitative phase and absorption contrast imaging
Figure 3 for Quantitative phase and absorption contrast imaging
Figure 4 for Quantitative phase and absorption contrast imaging
Viaarxiv icon

MeetDot: Videoconferencing with Live Translation Captions

Add code
Bookmark button
Alert button
Sep 20, 2021
Arkady Arkhangorodsky, Christopher Chu, Scot Fang, Yiqi Huang, Denglin Jiang, Ajay Nagesh, Boliang Zhang, Kevin Knight

Figure 1 for MeetDot: Videoconferencing with Live Translation Captions
Figure 2 for MeetDot: Videoconferencing with Live Translation Captions
Figure 3 for MeetDot: Videoconferencing with Live Translation Captions
Figure 4 for MeetDot: Videoconferencing with Live Translation Captions
Viaarxiv icon

A Context-Aware Feature Fusion Framework for Punctuation Restoration

Add code
Bookmark button
Alert button
Mar 23, 2022
Yangjun Wu, Kebin Fang, Yao Zhao

Figure 1 for A Context-Aware Feature Fusion Framework for Punctuation Restoration
Figure 2 for A Context-Aware Feature Fusion Framework for Punctuation Restoration
Figure 3 for A Context-Aware Feature Fusion Framework for Punctuation Restoration
Figure 4 for A Context-Aware Feature Fusion Framework for Punctuation Restoration
Viaarxiv icon