Picture for Peidong Wang

Peidong Wang

Improving Stability in Simultaneous Speech Translation: A Revision-Controllable Decoding Approach

Add code
Oct 06, 2023
Figure 1 for Improving Stability in Simultaneous Speech Translation: A Revision-Controllable Decoding Approach
Figure 2 for Improving Stability in Simultaneous Speech Translation: A Revision-Controllable Decoding Approach
Figure 3 for Improving Stability in Simultaneous Speech Translation: A Revision-Controllable Decoding Approach
Figure 4 for Improving Stability in Simultaneous Speech Translation: A Revision-Controllable Decoding Approach
Viaarxiv icon

DiariST: Streaming Speech Translation with Speaker Diarization

Add code
Sep 14, 2023
Viaarxiv icon

Building High-accuracy Multilingual ASR with Gated Language Experts and Curriculum Training

Add code
Mar 01, 2023
Figure 1 for Building High-accuracy Multilingual ASR with Gated Language Experts and Curriculum Training
Figure 2 for Building High-accuracy Multilingual ASR with Gated Language Experts and Curriculum Training
Figure 3 for Building High-accuracy Multilingual ASR with Gated Language Experts and Curriculum Training
Figure 4 for Building High-accuracy Multilingual ASR with Gated Language Experts and Curriculum Training
Viaarxiv icon

Self-supervised learning with bi-label masked speech prediction for streaming multi-talker speech recognition

Add code
Nov 10, 2022
Viaarxiv icon

LAMASSU: Streaming Language-Agnostic Multilingual Speech Recognition and Translation Using Neural Transducers

Add code
Nov 05, 2022
Figure 1 for LAMASSU: Streaming Language-Agnostic Multilingual Speech Recognition and Translation Using Neural Transducers
Figure 2 for LAMASSU: Streaming Language-Agnostic Multilingual Speech Recognition and Translation Using Neural Transducers
Figure 3 for LAMASSU: Streaming Language-Agnostic Multilingual Speech Recognition and Translation Using Neural Transducers
Figure 4 for LAMASSU: Streaming Language-Agnostic Multilingual Speech Recognition and Translation Using Neural Transducers
Viaarxiv icon

A Weakly-Supervised Streaming Multilingual Speech Model with Truly Zero-Shot Capability

Add code
Nov 04, 2022
Viaarxiv icon

Why does Self-Supervised Learning for Speech Recognition Benefit Speaker Recognition?

Add code
Apr 27, 2022
Figure 1 for Why does Self-Supervised Learning for Speech Recognition Benefit Speaker Recognition?
Figure 2 for Why does Self-Supervised Learning for Speech Recognition Benefit Speaker Recognition?
Figure 3 for Why does Self-Supervised Learning for Speech Recognition Benefit Speaker Recognition?
Figure 4 for Why does Self-Supervised Learning for Speech Recognition Benefit Speaker Recognition?
Viaarxiv icon

Large-Scale Streaming End-to-End Speech Translation with Neural Transducers

Add code
Apr 11, 2022
Figure 1 for Large-Scale Streaming End-to-End Speech Translation with Neural Transducers
Figure 2 for Large-Scale Streaming End-to-End Speech Translation with Neural Transducers
Figure 3 for Large-Scale Streaming End-to-End Speech Translation with Neural Transducers
Figure 4 for Large-Scale Streaming End-to-End Speech Translation with Neural Transducers
Viaarxiv icon

A Conformer Based Acoustic Model for Robust Automatic Speech Recognition

Add code
Mar 20, 2022
Figure 1 for A Conformer Based Acoustic Model for Robust Automatic Speech Recognition
Figure 2 for A Conformer Based Acoustic Model for Robust Automatic Speech Recognition
Figure 3 for A Conformer Based Acoustic Model for Robust Automatic Speech Recognition
Figure 4 for A Conformer Based Acoustic Model for Robust Automatic Speech Recognition
Viaarxiv icon

Predicting Atlantic Multidecadal Variability

Add code
Oct 29, 2021
Figure 1 for Predicting Atlantic Multidecadal Variability
Figure 2 for Predicting Atlantic Multidecadal Variability
Viaarxiv icon