Picture for Jing Pan

Jing Pan

Uber Technologies, San Francisco, CA, USA

WavLLM: Towards Robust and Adaptive Speech Large Language Model

Add code
Mar 31, 2024
Figure 1 for WavLLM: Towards Robust and Adaptive Speech Large Language Model
Figure 2 for WavLLM: Towards Robust and Adaptive Speech Large Language Model
Figure 3 for WavLLM: Towards Robust and Adaptive Speech Large Language Model
Figure 4 for WavLLM: Towards Robust and Adaptive Speech Large Language Model
Viaarxiv icon

COSMIC: Data Efficient Instruction-tuning For Speech In-Context Learning

Add code
Nov 03, 2023
Figure 1 for COSMIC: Data Efficient Instruction-tuning For Speech In-Context Learning
Figure 2 for COSMIC: Data Efficient Instruction-tuning For Speech In-Context Learning
Figure 3 for COSMIC: Data Efficient Instruction-tuning For Speech In-Context Learning
Figure 4 for COSMIC: Data Efficient Instruction-tuning For Speech In-Context Learning
Viaarxiv icon

Improving Stability in Simultaneous Speech Translation: A Revision-Controllable Decoding Approach

Add code
Oct 06, 2023
Viaarxiv icon

E-Branchformer: Branchformer with Enhanced merging for speech recognition

Add code
Sep 30, 2022
Figure 1 for E-Branchformer: Branchformer with Enhanced merging for speech recognition
Figure 2 for E-Branchformer: Branchformer with Enhanced merging for speech recognition
Figure 3 for E-Branchformer: Branchformer with Enhanced merging for speech recognition
Figure 4 for E-Branchformer: Branchformer with Enhanced merging for speech recognition
Viaarxiv icon

SRU++: Pioneering Fast Recurrence with Attention for Speech Recognition

Add code
Oct 11, 2021
Figure 1 for SRU++: Pioneering Fast Recurrence with Attention for Speech Recognition
Figure 2 for SRU++: Pioneering Fast Recurrence with Attention for Speech Recognition
Figure 3 for SRU++: Pioneering Fast Recurrence with Attention for Speech Recognition
Figure 4 for SRU++: Pioneering Fast Recurrence with Attention for Speech Recognition
Viaarxiv icon

Sensoring and Application of Multimodal Data for the Detection of Freezing of Gait in Parkinson's Disease

Add code
Oct 09, 2021
Figure 1 for Sensoring and Application of Multimodal Data for the Detection of Freezing of Gait in Parkinson's Disease
Figure 2 for Sensoring and Application of Multimodal Data for the Detection of Freezing of Gait in Parkinson's Disease
Figure 3 for Sensoring and Application of Multimodal Data for the Detection of Freezing of Gait in Parkinson's Disease
Figure 4 for Sensoring and Application of Multimodal Data for the Detection of Freezing of Gait in Parkinson's Disease
Viaarxiv icon

Performance-Efficiency Trade-offs in Unsupervised Pre-training for Speech Recognition

Add code
Sep 14, 2021
Figure 1 for Performance-Efficiency Trade-offs in Unsupervised Pre-training for Speech Recognition
Figure 2 for Performance-Efficiency Trade-offs in Unsupervised Pre-training for Speech Recognition
Figure 3 for Performance-Efficiency Trade-offs in Unsupervised Pre-training for Speech Recognition
Figure 4 for Performance-Efficiency Trade-offs in Unsupervised Pre-training for Speech Recognition
Viaarxiv icon

Leveraging Pre-trained Language Model for Speech Sentiment Analysis

Add code
Jun 11, 2021
Figure 1 for Leveraging Pre-trained Language Model for Speech Sentiment Analysis
Figure 2 for Leveraging Pre-trained Language Model for Speech Sentiment Analysis
Figure 3 for Leveraging Pre-trained Language Model for Speech Sentiment Analysis
Figure 4 for Leveraging Pre-trained Language Model for Speech Sentiment Analysis
Viaarxiv icon

Multistream CNN for Robust Acoustic Modeling

Add code
May 21, 2020
Figure 1 for Multistream CNN for Robust Acoustic Modeling
Figure 2 for Multistream CNN for Robust Acoustic Modeling
Figure 3 for Multistream CNN for Robust Acoustic Modeling
Figure 4 for Multistream CNN for Robust Acoustic Modeling
Viaarxiv icon

ASAPP-ASR: Multistream CNN and Self-Attentive SRU for SOTA Speech Recognition

Add code
May 21, 2020
Figure 1 for ASAPP-ASR: Multistream CNN and Self-Attentive SRU for SOTA Speech Recognition
Figure 2 for ASAPP-ASR: Multistream CNN and Self-Attentive SRU for SOTA Speech Recognition
Figure 3 for ASAPP-ASR: Multistream CNN and Self-Attentive SRU for SOTA Speech Recognition
Figure 4 for ASAPP-ASR: Multistream CNN and Self-Attentive SRU for SOTA Speech Recognition
Viaarxiv icon