Alert button

"speech": models, code, and papers
Alert button

Modelling word learning and recognition using visually grounded speech

Mar 14, 2022
Danny Merkx, Sebastiaan Scholten, Stefan L. Frank, Mirjam Ernestus, Odette Scharenborg

Figure 1 for Modelling word learning and recognition using visually grounded speech
Figure 2 for Modelling word learning and recognition using visually grounded speech
Figure 3 for Modelling word learning and recognition using visually grounded speech
Figure 4 for Modelling word learning and recognition using visually grounded speech
Viaarxiv icon

Multi-stage Progressive Compression of Conformer Transducer for On-device Speech Recognition

Oct 01, 2022
Jash Rathod, Nauman Dawalatabad, Shatrughan Singh, Dhananjaya Gowda

Figure 1 for Multi-stage Progressive Compression of Conformer Transducer for On-device Speech Recognition
Figure 2 for Multi-stage Progressive Compression of Conformer Transducer for On-device Speech Recognition
Figure 3 for Multi-stage Progressive Compression of Conformer Transducer for On-device Speech Recognition
Figure 4 for Multi-stage Progressive Compression of Conformer Transducer for On-device Speech Recognition
Viaarxiv icon

Explainable Human-centered Traits from Head Motion and Facial Expression Dynamics

Feb 20, 2023
Surbhi Madan, Monika Gahalawat, Tanaya Guha, Roland Goecke, Ramanathan Subramanian

Figure 1 for Explainable Human-centered Traits from Head Motion and Facial Expression Dynamics
Figure 2 for Explainable Human-centered Traits from Head Motion and Facial Expression Dynamics
Figure 3 for Explainable Human-centered Traits from Head Motion and Facial Expression Dynamics
Figure 4 for Explainable Human-centered Traits from Head Motion and Facial Expression Dynamics
Viaarxiv icon

Spoofed training data for speech spoofing countermeasure can be efficiently created using neural vocoders

Add code
Bookmark button
Alert button
Oct 19, 2022
Xin Wang, Junichi Yamagishi

Figure 1 for Spoofed training data for speech spoofing countermeasure can be efficiently created using neural vocoders
Figure 2 for Spoofed training data for speech spoofing countermeasure can be efficiently created using neural vocoders
Figure 3 for Spoofed training data for speech spoofing countermeasure can be efficiently created using neural vocoders
Figure 4 for Spoofed training data for speech spoofing countermeasure can be efficiently created using neural vocoders
Viaarxiv icon

Language-specific Characteristic Assistance for Code-switching Speech Recognition

Add code
Bookmark button
Alert button
Jul 05, 2022
Tongtong Song, Qiang Xu, Meng Ge, Longbiao Wang, Hao Shi, Yongjie Lv, Yuqin Lin, Jianwu Dang

Figure 1 for Language-specific Characteristic Assistance for Code-switching Speech Recognition
Figure 2 for Language-specific Characteristic Assistance for Code-switching Speech Recognition
Figure 3 for Language-specific Characteristic Assistance for Code-switching Speech Recognition
Figure 4 for Language-specific Characteristic Assistance for Code-switching Speech Recognition
Viaarxiv icon

AISHELL-NER: Named Entity Recognition from Chinese Speech

Add code
Bookmark button
Alert button
Feb 17, 2022
Boli Chen, Guangwei Xu, Xiaobin Wang, Pengjun Xie, Meishan Zhang, Fei Huang

Figure 1 for AISHELL-NER: Named Entity Recognition from Chinese Speech
Figure 2 for AISHELL-NER: Named Entity Recognition from Chinese Speech
Figure 3 for AISHELL-NER: Named Entity Recognition from Chinese Speech
Figure 4 for AISHELL-NER: Named Entity Recognition from Chinese Speech
Viaarxiv icon

BrainBERT: Self-supervised representation learning for intracranial recordings

Add code
Bookmark button
Alert button
Feb 28, 2023
Christopher Wang, Vighnesh Subramaniam, Adam Uri Yaari, Gabriel Kreiman, Boris Katz, Ignacio Cases, Andrei Barbu

Figure 1 for BrainBERT: Self-supervised representation learning for intracranial recordings
Figure 2 for BrainBERT: Self-supervised representation learning for intracranial recordings
Figure 3 for BrainBERT: Self-supervised representation learning for intracranial recordings
Figure 4 for BrainBERT: Self-supervised representation learning for intracranial recordings
Viaarxiv icon

Speech-to-SQL: Towards Speech-driven SQL Query Generation From Natural Language Question

Jan 04, 2022
Yuanfeng Song, Raymond Chi-Wing Wong, Xuefang Zhao, Di Jiang

Figure 1 for Speech-to-SQL: Towards Speech-driven SQL Query Generation From Natural Language Question
Figure 2 for Speech-to-SQL: Towards Speech-driven SQL Query Generation From Natural Language Question
Figure 3 for Speech-to-SQL: Towards Speech-driven SQL Query Generation From Natural Language Question
Figure 4 for Speech-to-SQL: Towards Speech-driven SQL Query Generation From Natural Language Question
Viaarxiv icon

Preliminary Study on SSCF-derived Polar Coordinate for ASR

Nov 30, 2022
Sotheara Leang, Eric Castelli, Dominique Vaufreydaz, Sethserey Sam

Figure 1 for Preliminary Study on SSCF-derived Polar Coordinate for ASR
Figure 2 for Preliminary Study on SSCF-derived Polar Coordinate for ASR
Figure 3 for Preliminary Study on SSCF-derived Polar Coordinate for ASR
Figure 4 for Preliminary Study on SSCF-derived Polar Coordinate for ASR
Viaarxiv icon

Meeting the Needs of Low-Resource Languages: The Value of Automatic Alignments via Pretrained Models

Add code
Bookmark button
Alert button
Feb 15, 2023
Abteen Ebrahimi, Arya D. McCarthy, Arturo Oncevay, Luis Chiruzzo, John E. Ortega, Gustavo A. Giménez-Lugo, Rolando Coto-Solano, Katharina Kann

Figure 1 for Meeting the Needs of Low-Resource Languages: The Value of Automatic Alignments via Pretrained Models
Figure 2 for Meeting the Needs of Low-Resource Languages: The Value of Automatic Alignments via Pretrained Models
Figure 3 for Meeting the Needs of Low-Resource Languages: The Value of Automatic Alignments via Pretrained Models
Figure 4 for Meeting the Needs of Low-Resource Languages: The Value of Automatic Alignments via Pretrained Models
Viaarxiv icon