Alert button

"speech": models, code, and papers
Alert button

Wideband Audio Waveform Evaluation Networks: Efficient, Accurate Estimation of Speech Qualities

Jun 27, 2022
Andrew Catellier, Stephen Voran

Figure 1 for Wideband Audio Waveform Evaluation Networks: Efficient, Accurate Estimation of Speech Qualities
Figure 2 for Wideband Audio Waveform Evaluation Networks: Efficient, Accurate Estimation of Speech Qualities
Figure 3 for Wideband Audio Waveform Evaluation Networks: Efficient, Accurate Estimation of Speech Qualities
Figure 4 for Wideband Audio Waveform Evaluation Networks: Efficient, Accurate Estimation of Speech Qualities
Viaarxiv icon

Speech frame implementation for speech analysis and recognition

Dec 15, 2021
A. A. Konev, V. S. Khlebnikov, A. Yu. Yakimuk

Figure 1 for Speech frame implementation for speech analysis and recognition
Figure 2 for Speech frame implementation for speech analysis and recognition
Figure 3 for Speech frame implementation for speech analysis and recognition
Figure 4 for Speech frame implementation for speech analysis and recognition
Viaarxiv icon

Minimum Latency Training of Sequence Transducers for Streaming End-to-End Speech Recognition

Nov 04, 2022
Yusuke Shinohara, Shinji Watanabe

Figure 1 for Minimum Latency Training of Sequence Transducers for Streaming End-to-End Speech Recognition
Figure 2 for Minimum Latency Training of Sequence Transducers for Streaming End-to-End Speech Recognition
Figure 3 for Minimum Latency Training of Sequence Transducers for Streaming End-to-End Speech Recognition
Viaarxiv icon

Is Speech Pathology a Biomarker in Automatic Speaker Verification?

Apr 13, 2022
Soroosh Tayebi Arasteh, Tobias Weise, Maria Schuster, Elmar Nöth, Andreas Maier, Seung Hee Yang

Figure 1 for Is Speech Pathology a Biomarker in Automatic Speaker Verification?
Figure 2 for Is Speech Pathology a Biomarker in Automatic Speaker Verification?
Figure 3 for Is Speech Pathology a Biomarker in Automatic Speaker Verification?
Figure 4 for Is Speech Pathology a Biomarker in Automatic Speaker Verification?
Viaarxiv icon

Knowledge-Based Counterfactual Queries for Visual Question Answering

Mar 05, 2023
Theodoti Stoikou, Maria Lymperaiou, Giorgos Stamou

Figure 1 for Knowledge-Based Counterfactual Queries for Visual Question Answering
Figure 2 for Knowledge-Based Counterfactual Queries for Visual Question Answering
Figure 3 for Knowledge-Based Counterfactual Queries for Visual Question Answering
Figure 4 for Knowledge-Based Counterfactual Queries for Visual Question Answering
Viaarxiv icon

Communicating human intent to a robotic companion by multi-type gesture sentences

Mar 08, 2023
Petr Vanc, Jan Kristof Behrens, Karla Stepanova, Vaclav Hlavac

Figure 1 for Communicating human intent to a robotic companion by multi-type gesture sentences
Figure 2 for Communicating human intent to a robotic companion by multi-type gesture sentences
Figure 3 for Communicating human intent to a robotic companion by multi-type gesture sentences
Figure 4 for Communicating human intent to a robotic companion by multi-type gesture sentences
Viaarxiv icon

Multilingual Simultaneous Speech Translation

Mar 29, 2022
Shashank Subramanya, Jan Niehues

Figure 1 for Multilingual Simultaneous Speech Translation
Figure 2 for Multilingual Simultaneous Speech Translation
Figure 3 for Multilingual Simultaneous Speech Translation
Figure 4 for Multilingual Simultaneous Speech Translation
Viaarxiv icon

Efficiency 360: Efficient Vision Transformers

Feb 23, 2023
Badri N. Patro, Vijay Srinivas Agneeswaran

Figure 1 for Efficiency 360: Efficient Vision Transformers
Figure 2 for Efficiency 360: Efficient Vision Transformers
Figure 3 for Efficiency 360: Efficient Vision Transformers
Figure 4 for Efficiency 360: Efficient Vision Transformers
Viaarxiv icon

Freeform Body Motion Generation from Speech

Mar 04, 2022
Jing Xu, Wei Zhang, Yalong Bai, Qibin Sun, Tao Mei

Figure 1 for Freeform Body Motion Generation from Speech
Figure 2 for Freeform Body Motion Generation from Speech
Figure 3 for Freeform Body Motion Generation from Speech
Figure 4 for Freeform Body Motion Generation from Speech
Viaarxiv icon

TRESTLE: Toolkit for Reproducible Execution of Speech, Text and Language Experiments

Feb 14, 2023
Changye Li, Trevor Cohen, Martin Michalowski, Serguei Pakhomov

Figure 1 for TRESTLE: Toolkit for Reproducible Execution of Speech, Text and Language Experiments
Figure 2 for TRESTLE: Toolkit for Reproducible Execution of Speech, Text and Language Experiments
Figure 3 for TRESTLE: Toolkit for Reproducible Execution of Speech, Text and Language Experiments
Figure 4 for TRESTLE: Toolkit for Reproducible Execution of Speech, Text and Language Experiments
Viaarxiv icon