Alert button
Picture for Quan Wang

Quan Wang

Alert button

Attentive Temporal Pooling for Conformer-based Streaming Language Identification in Long-form Speech

Add code
Bookmark button
Alert button
Mar 21, 2022
Quan Wang, Yang Yu, Jason Pelecanos, Yiling Huang, Ignacio Lopez Moreno

Figure 1 for Attentive Temporal Pooling for Conformer-based Streaming Language Identification in Long-form Speech
Figure 2 for Attentive Temporal Pooling for Conformer-based Streaming Language Identification in Long-form Speech
Figure 3 for Attentive Temporal Pooling for Conformer-based Streaming Language Identification in Long-form Speech
Figure 4 for Attentive Temporal Pooling for Conformer-based Streaming Language Identification in Long-form Speech
Viaarxiv icon

Parameter-Free Attentive Scoring for Speaker Verification

Add code
Bookmark button
Alert button
Mar 10, 2022
Jason Pelecanos, Quan Wang, Yiling Huang, Ignacio Lopez Moreno

Figure 1 for Parameter-Free Attentive Scoring for Speaker Verification
Figure 2 for Parameter-Free Attentive Scoring for Speaker Verification
Figure 3 for Parameter-Free Attentive Scoring for Speaker Verification
Figure 4 for Parameter-Free Attentive Scoring for Speaker Verification
Viaarxiv icon

Closing the Gap between Single-User and Multi-User VoiceFilter-Lite

Add code
Bookmark button
Alert button
Feb 24, 2022
Rajeev Rikhye, Quan Wang, Qiao Liang, Yanzhang He, Ian McGraw

Figure 1 for Closing the Gap between Single-User and Multi-User VoiceFilter-Lite
Figure 2 for Closing the Gap between Single-User and Multi-User VoiceFilter-Lite
Figure 3 for Closing the Gap between Single-User and Multi-User VoiceFilter-Lite
Figure 4 for Closing the Gap between Single-User and Multi-User VoiceFilter-Lite
Viaarxiv icon

CVSS Corpus and Massively Multilingual Speech-to-Speech Translation

Add code
Bookmark button
Alert button
Jan 16, 2022
Ye Jia, Michelle Tadmor Ramanovich, Quan Wang, Heiga Zen

Figure 1 for CVSS Corpus and Massively Multilingual Speech-to-Speech Translation
Figure 2 for CVSS Corpus and Massively Multilingual Speech-to-Speech Translation
Figure 3 for CVSS Corpus and Massively Multilingual Speech-to-Speech Translation
Figure 4 for CVSS Corpus and Massively Multilingual Speech-to-Speech Translation
Viaarxiv icon

A Conformer-based ASR Frontend for Joint Acoustic Echo Cancellation, Speech Enhancement and Speech Separation

Add code
Bookmark button
Alert button
Nov 18, 2021
Tom O'Malley, Arun Narayanan, Quan Wang, Alex Park, James Walker, Nathan Howard

Figure 1 for A Conformer-based ASR Frontend for Joint Acoustic Echo Cancellation, Speech Enhancement and Speech Separation
Figure 2 for A Conformer-based ASR Frontend for Joint Acoustic Echo Cancellation, Speech Enhancement and Speech Separation
Figure 3 for A Conformer-based ASR Frontend for Joint Acoustic Echo Cancellation, Speech Enhancement and Speech Separation
Figure 4 for A Conformer-based ASR Frontend for Joint Acoustic Echo Cancellation, Speech Enhancement and Speech Separation
Viaarxiv icon

Cross-attention conformer for context modeling in speech enhancement for ASR

Add code
Bookmark button
Alert button
Oct 30, 2021
Arun Narayanan, Chung-Cheng Chiu, Tom O'Malley, Quan Wang, Yanzhang He

Figure 1 for Cross-attention conformer for context modeling in speech enhancement for ASR
Figure 2 for Cross-attention conformer for context modeling in speech enhancement for ASR
Figure 3 for Cross-attention conformer for context modeling in speech enhancement for ASR
Figure 4 for Cross-attention conformer for context modeling in speech enhancement for ASR
Viaarxiv icon

Building Chinese Biomedical Language Models via Multi-Level Text Discrimination

Add code
Bookmark button
Alert button
Oct 14, 2021
Quan Wang, Songtai Dai, Benfeng Xu, Yajuan Lyu, Yong Zhu, Hua Wu, Haifeng Wang

Figure 1 for Building Chinese Biomedical Language Models via Multi-Level Text Discrimination
Figure 2 for Building Chinese Biomedical Language Models via Multi-Level Text Discrimination
Figure 3 for Building Chinese Biomedical Language Models via Multi-Level Text Discrimination
Figure 4 for Building Chinese Biomedical Language Models via Multi-Level Text Discrimination
Viaarxiv icon

Turn-to-Diarize: Online Speaker Diarization Constrained by Transformer Transducer Speaker Turn Detection

Add code
Bookmark button
Alert button
Oct 05, 2021
Wei Xia, Han Lu, Quan Wang, Anshuman Tripathi, Yiling Huang, Ignacio Lopez Moreno, Hasim Sak

Figure 1 for Turn-to-Diarize: Online Speaker Diarization Constrained by Transformer Transducer Speaker Turn Detection
Figure 2 for Turn-to-Diarize: Online Speaker Diarization Constrained by Transformer Transducer Speaker Turn Detection
Figure 3 for Turn-to-Diarize: Online Speaker Diarization Constrained by Transformer Transducer Speaker Turn Detection
Figure 4 for Turn-to-Diarize: Online Speaker Diarization Constrained by Transformer Transducer Speaker Turn Detection
Viaarxiv icon

Learning Oculomotor Behaviors from Scanpath

Add code
Bookmark button
Alert button
Aug 11, 2021
Beibin Li, Nicholas Nuechterlein, Erin Barney, Claire Foster, Minah Kim, Monique Mahony, Adham Atyabi, Li Feng, Quan Wang, Pamela Ventola, Linda Shapiro, Frederick Shic

Figure 1 for Learning Oculomotor Behaviors from Scanpath
Figure 2 for Learning Oculomotor Behaviors from Scanpath
Figure 3 for Learning Oculomotor Behaviors from Scanpath
Figure 4 for Learning Oculomotor Behaviors from Scanpath
Viaarxiv icon

Multi-user VoiceFilter-Lite via Attentive Speaker Embedding

Add code
Bookmark button
Alert button
Jul 02, 2021
Rajeev Rikhye, Quan Wang, Qiao Liang, Yanzhang He, Ian McGraw

Figure 1 for Multi-user VoiceFilter-Lite via Attentive Speaker Embedding
Figure 2 for Multi-user VoiceFilter-Lite via Attentive Speaker Embedding
Figure 3 for Multi-user VoiceFilter-Lite via Attentive Speaker Embedding
Figure 4 for Multi-user VoiceFilter-Lite via Attentive Speaker Embedding
Viaarxiv icon