Alert button

"speech": models, code, and papers
Alert button

Pretraining Conformer with ASR or ASV for Anti-Spoofing Countermeasure

Add code
Bookmark button
Alert button
Jul 04, 2023
Yikang Wang, Hiromitsu Nishizaki, Ming Li

Figure 1 for Pretraining Conformer with ASR or ASV for Anti-Spoofing Countermeasure
Figure 2 for Pretraining Conformer with ASR or ASV for Anti-Spoofing Countermeasure
Figure 3 for Pretraining Conformer with ASR or ASV for Anti-Spoofing Countermeasure
Figure 4 for Pretraining Conformer with ASR or ASV for Anti-Spoofing Countermeasure
Viaarxiv icon

DIVERSIFY: A General Framework for Time Series Out-of-distribution Detection and Generalization

Aug 04, 2023
Wang Lu, Jindong Wang, Xinwei Sun, Yiqiang Chen, Xiangyang Ji, Qiang Yang, Xing Xie

Figure 1 for DIVERSIFY: A General Framework for Time Series Out-of-distribution Detection and Generalization
Figure 2 for DIVERSIFY: A General Framework for Time Series Out-of-distribution Detection and Generalization
Figure 3 for DIVERSIFY: A General Framework for Time Series Out-of-distribution Detection and Generalization
Figure 4 for DIVERSIFY: A General Framework for Time Series Out-of-distribution Detection and Generalization
Viaarxiv icon

DSARSR: Deep Stacked Auto-encoders Enhanced Robust Speaker Recognition

Jul 06, 2023
Zhifeng Wang, Chunyan Zeng, Surong Duan, Hongjie Ouyang, Hongmin Xu

Figure 1 for DSARSR: Deep Stacked Auto-encoders Enhanced Robust Speaker Recognition
Figure 2 for DSARSR: Deep Stacked Auto-encoders Enhanced Robust Speaker Recognition
Figure 3 for DSARSR: Deep Stacked Auto-encoders Enhanced Robust Speaker Recognition
Figure 4 for DSARSR: Deep Stacked Auto-encoders Enhanced Robust Speaker Recognition
Viaarxiv icon

On the Audio-visual Synchronization for Lip-to-Speech Synthesis

Add code
Bookmark button
Alert button
Mar 01, 2023
Zhe Niu, Brian Mak

Figure 1 for On the Audio-visual Synchronization for Lip-to-Speech Synthesis
Figure 2 for On the Audio-visual Synchronization for Lip-to-Speech Synthesis
Figure 3 for On the Audio-visual Synchronization for Lip-to-Speech Synthesis
Figure 4 for On the Audio-visual Synchronization for Lip-to-Speech Synthesis
Viaarxiv icon

Factual Consistency Oriented Speech Recognition

Feb 24, 2023
Naoyuki Kanda, Takuya Yoshioka, Yang Liu

Figure 1 for Factual Consistency Oriented Speech Recognition
Figure 2 for Factual Consistency Oriented Speech Recognition
Figure 3 for Factual Consistency Oriented Speech Recognition
Figure 4 for Factual Consistency Oriented Speech Recognition
Viaarxiv icon

SpokenWOZ: A Large-Scale Speech-Text Benchmark for Spoken Task-Oriented Dialogue in Multiple Domains

Add code
Bookmark button
Alert button
May 22, 2023
Shuzheng Si, Wentao Ma, Yuchuan Wu, Yinpei Dai, Haoyu Gao, Ting-En Lin, Hangyu Li, Rui Yan, Fei Huang, Yongbin Li

Figure 1 for SpokenWOZ: A Large-Scale Speech-Text Benchmark for Spoken Task-Oriented Dialogue in Multiple Domains
Figure 2 for SpokenWOZ: A Large-Scale Speech-Text Benchmark for Spoken Task-Oriented Dialogue in Multiple Domains
Figure 3 for SpokenWOZ: A Large-Scale Speech-Text Benchmark for Spoken Task-Oriented Dialogue in Multiple Domains
Figure 4 for SpokenWOZ: A Large-Scale Speech-Text Benchmark for Spoken Task-Oriented Dialogue in Multiple Domains
Viaarxiv icon

Speech Corpora Divergence Based Unsupervised Data Selection for ASR

Feb 26, 2023
Changfeng Gao, Gaofeng Cheng, Pengyuan Zhang, Yonghong Yan

Figure 1 for Speech Corpora Divergence Based Unsupervised Data Selection for ASR
Figure 2 for Speech Corpora Divergence Based Unsupervised Data Selection for ASR
Figure 3 for Speech Corpora Divergence Based Unsupervised Data Selection for ASR
Figure 4 for Speech Corpora Divergence Based Unsupervised Data Selection for ASR
Viaarxiv icon

Improving Meeting Inclusiveness using Speech Interruption Analysis

Apr 02, 2023
Szu-Wei Fu, Yaran Fan, Yasaman Hosseinkashi, Jayant Gupchup, Ross Cutler

Figure 1 for Improving Meeting Inclusiveness using Speech Interruption Analysis
Figure 2 for Improving Meeting Inclusiveness using Speech Interruption Analysis
Figure 3 for Improving Meeting Inclusiveness using Speech Interruption Analysis
Figure 4 for Improving Meeting Inclusiveness using Speech Interruption Analysis
Viaarxiv icon

CrossSpeech: Speaker-independent Acoustic Representation for Cross-lingual Speech Synthesis

Add code
Bookmark button
Alert button
Feb 28, 2023
Ji-Hoon Kim, Hong-Sun Yang, Yoon-Cheol Ju, Il-Hwan Kim, Byeong-Yeol Kim

Figure 1 for CrossSpeech: Speaker-independent Acoustic Representation for Cross-lingual Speech Synthesis
Figure 2 for CrossSpeech: Speaker-independent Acoustic Representation for Cross-lingual Speech Synthesis
Figure 3 for CrossSpeech: Speaker-independent Acoustic Representation for Cross-lingual Speech Synthesis
Figure 4 for CrossSpeech: Speaker-independent Acoustic Representation for Cross-lingual Speech Synthesis
Viaarxiv icon

Developmental Bootstrapping of AIs

Aug 08, 2023
Mark Stefik, Robert Price

Figure 1 for Developmental Bootstrapping of AIs
Figure 2 for Developmental Bootstrapping of AIs
Figure 3 for Developmental Bootstrapping of AIs
Figure 4 for Developmental Bootstrapping of AIs
Viaarxiv icon