Picture for Brian Yan

Brian Yan

Enhancing End-to-End Conversational Speech Translation Through Target Language Context Utilization

Add code
Sep 27, 2023
Figure 1 for Enhancing End-to-End Conversational Speech Translation Through Target Language Context Utilization
Figure 2 for Enhancing End-to-End Conversational Speech Translation Through Target Language Context Utilization
Figure 3 for Enhancing End-to-End Conversational Speech Translation Through Target Language Context Utilization
Figure 4 for Enhancing End-to-End Conversational Speech Translation Through Target Language Context Utilization
Viaarxiv icon

Incremental Blockwise Beam Search for Simultaneous Speech Translation with Controllable Quality-Latency Tradeoff

Add code
Sep 20, 2023
Figure 1 for Incremental Blockwise Beam Search for Simultaneous Speech Translation with Controllable Quality-Latency Tradeoff
Figure 2 for Incremental Blockwise Beam Search for Simultaneous Speech Translation with Controllable Quality-Latency Tradeoff
Figure 3 for Incremental Blockwise Beam Search for Simultaneous Speech Translation with Controllable Quality-Latency Tradeoff
Figure 4 for Incremental Blockwise Beam Search for Simultaneous Speech Translation with Controllable Quality-Latency Tradeoff
Viaarxiv icon

Bayes Risk Transducer: Transducer with Controllable Alignment Prediction

Add code
Aug 19, 2023
Viaarxiv icon

Integrating Pretrained ASR and LM to Perform Sequence Generation for Spoken Language Understanding

Add code
Jul 20, 2023
Figure 1 for Integrating Pretrained ASR and LM to Perform Sequence Generation for Spoken Language Understanding
Figure 2 for Integrating Pretrained ASR and LM to Perform Sequence Generation for Spoken Language Understanding
Figure 3 for Integrating Pretrained ASR and LM to Perform Sequence Generation for Spoken Language Understanding
Viaarxiv icon

Tensor decomposition for minimization of E2E SLU model toward on-device processing

Add code
Jun 02, 2023
Figure 1 for Tensor decomposition for minimization of E2E SLU model toward on-device processing
Figure 2 for Tensor decomposition for minimization of E2E SLU model toward on-device processing
Figure 3 for Tensor decomposition for minimization of E2E SLU model toward on-device processing
Figure 4 for Tensor decomposition for minimization of E2E SLU model toward on-device processing
Viaarxiv icon

Exploration of Efficient End-to-End ASR using Discretized Input from Self-Supervised Learning

Add code
May 29, 2023
Figure 1 for Exploration of Efficient End-to-End ASR using Discretized Input from Self-Supervised Learning
Figure 2 for Exploration of Efficient End-to-End ASR using Discretized Input from Self-Supervised Learning
Figure 3 for Exploration of Efficient End-to-End ASR using Discretized Input from Self-Supervised Learning
Figure 4 for Exploration of Efficient End-to-End ASR using Discretized Input from Self-Supervised Learning
Viaarxiv icon

Prompting the Hidden Talent of Web-Scale Speech Models for Zero-Shot Task Generalization

Add code
May 18, 2023
Figure 1 for Prompting the Hidden Talent of Web-Scale Speech Models for Zero-Shot Task Generalization
Figure 2 for Prompting the Hidden Talent of Web-Scale Speech Models for Zero-Shot Task Generalization
Figure 3 for Prompting the Hidden Talent of Web-Scale Speech Models for Zero-Shot Task Generalization
Figure 4 for Prompting the Hidden Talent of Web-Scale Speech Models for Zero-Shot Task Generalization
Viaarxiv icon

A Comparative Study on E-Branchformer vs Conformer in Speech Recognition, Translation, and Understanding Tasks

Add code
May 18, 2023
Viaarxiv icon

The Pipeline System of ASR and NLU with MLM-based Data Augmentation toward STOP Low-resource Challenge

Add code
May 11, 2023
Viaarxiv icon

A Study on the Integration of Pipeline and E2E SLU systems for Spoken Semantic Parsing toward STOP Quality Challenge

Add code
May 06, 2023
Viaarxiv icon