Picture for Yuexian Zou

Yuexian Zou

LAE: Language-Aware Encoder for Monolingual and Multilingual ASR

Add code
Jun 05, 2022
Figure 1 for LAE: Language-Aware Encoder for Monolingual and Multilingual ASR
Figure 2 for LAE: Language-Aware Encoder for Monolingual and Multilingual ASR
Figure 3 for LAE: Language-Aware Encoder for Monolingual and Multilingual ASR
Figure 4 for LAE: Language-Aware Encoder for Monolingual and Multilingual ASR
Viaarxiv icon

Improving Dual-Microphone Speech Enhancement by Learning Cross-Channel Features with Multi-Head Attention

Add code
May 03, 2022
Figure 1 for Improving Dual-Microphone Speech Enhancement by Learning Cross-Channel Features with Multi-Head Attention
Figure 2 for Improving Dual-Microphone Speech Enhancement by Learning Cross-Channel Features with Multi-Head Attention
Figure 3 for Improving Dual-Microphone Speech Enhancement by Learning Cross-Channel Features with Multi-Head Attention
Figure 4 for Improving Dual-Microphone Speech Enhancement by Learning Cross-Channel Features with Multi-Head Attention
Viaarxiv icon

End-to-end Spoken Conversational Question Answering: Task, Dataset and Model

Add code
Apr 29, 2022
Figure 1 for End-to-end Spoken Conversational Question Answering: Task, Dataset and Model
Figure 2 for End-to-end Spoken Conversational Question Answering: Task, Dataset and Model
Figure 3 for End-to-end Spoken Conversational Question Answering: Task, Dataset and Model
Figure 4 for End-to-end Spoken Conversational Question Answering: Task, Dataset and Model
Viaarxiv icon

Speaker-Aware Mixture of Mixtures Training for Weakly Supervised Speaker Extraction

Add code
Apr 15, 2022
Figure 1 for Speaker-Aware Mixture of Mixtures Training for Weakly Supervised Speaker Extraction
Figure 2 for Speaker-Aware Mixture of Mixtures Training for Weakly Supervised Speaker Extraction
Figure 3 for Speaker-Aware Mixture of Mixtures Training for Weakly Supervised Speaker Extraction
Figure 4 for Speaker-Aware Mixture of Mixtures Training for Weakly Supervised Speaker Extraction
Viaarxiv icon

RaDur: A Reference-aware and Duration-robust Network for Target Sound Detection

Add code
Apr 05, 2022
Figure 1 for RaDur: A Reference-aware and Duration-robust Network for Target Sound Detection
Figure 2 for RaDur: A Reference-aware and Duration-robust Network for Target Sound Detection
Figure 3 for RaDur: A Reference-aware and Duration-robust Network for Target Sound Detection
Figure 4 for RaDur: A Reference-aware and Duration-robust Network for Target Sound Detection
Viaarxiv icon

A Two-student Learning Framework for Mixed Supervised Target Sound Detection

Add code
Apr 05, 2022
Figure 1 for A Two-student Learning Framework for Mixed Supervised Target Sound Detection
Figure 2 for A Two-student Learning Framework for Mixed Supervised Target Sound Detection
Figure 3 for A Two-student Learning Framework for Mixed Supervised Target Sound Detection
Figure 4 for A Two-student Learning Framework for Mixed Supervised Target Sound Detection
Viaarxiv icon

Target Confusion in End-to-end Speaker Extraction: Analysis and Approaches

Add code
Apr 04, 2022
Figure 1 for Target Confusion in End-to-end Speaker Extraction: Analysis and Approaches
Figure 2 for Target Confusion in End-to-end Speaker Extraction: Analysis and Approaches
Figure 3 for Target Confusion in End-to-end Speaker Extraction: Analysis and Approaches
Figure 4 for Target Confusion in End-to-end Speaker Extraction: Analysis and Approaches
Viaarxiv icon

Improving Target Sound Extraction with Timestamp Information

Add code
Apr 02, 2022
Figure 1 for Improving Target Sound Extraction with Timestamp Information
Figure 2 for Improving Target Sound Extraction with Timestamp Information
Figure 3 for Improving Target Sound Extraction with Timestamp Information
Figure 4 for Improving Target Sound Extraction with Timestamp Information
Viaarxiv icon

Integrate Lattice-Free MMI into End-to-End Speech Recognition

Add code
Apr 02, 2022
Figure 1 for Integrate Lattice-Free MMI into End-to-End Speech Recognition
Figure 2 for Integrate Lattice-Free MMI into End-to-End Speech Recognition
Figure 3 for Integrate Lattice-Free MMI into End-to-End Speech Recognition
Figure 4 for Integrate Lattice-Free MMI into End-to-End Speech Recognition
Viaarxiv icon

Learning Decoupling Features Through Orthogonality Regularization

Add code
Mar 31, 2022
Figure 1 for Learning Decoupling Features Through Orthogonality Regularization
Figure 2 for Learning Decoupling Features Through Orthogonality Regularization
Figure 3 for Learning Decoupling Features Through Orthogonality Regularization
Figure 4 for Learning Decoupling Features Through Orthogonality Regularization
Viaarxiv icon