Picture for Shuo-yiin Chang

Shuo-yiin Chang

Google

Streaming Intended Query Detection using E2E Modeling for Continued Conversation

Add code
Aug 29, 2022
Figure 1 for Streaming Intended Query Detection using E2E Modeling for Continued Conversation
Figure 2 for Streaming Intended Query Detection using E2E Modeling for Continued Conversation
Figure 3 for Streaming Intended Query Detection using E2E Modeling for Continued Conversation
Figure 4 for Streaming Intended Query Detection using E2E Modeling for Continued Conversation
Viaarxiv icon

Turn-Taking Prediction for Natural Conversational Speech

Add code
Aug 29, 2022
Figure 1 for Turn-Taking Prediction for Natural Conversational Speech
Figure 2 for Turn-Taking Prediction for Natural Conversational Speech
Figure 3 for Turn-Taking Prediction for Natural Conversational Speech
Figure 4 for Turn-Taking Prediction for Natural Conversational Speech
Viaarxiv icon

E2E Segmenter: Joint Segmenting and Decoding for Long-Form ASR

Add code
Apr 22, 2022
Figure 1 for E2E Segmenter: Joint Segmenting and Decoding for Long-Form ASR
Figure 2 for E2E Segmenter: Joint Segmenting and Decoding for Long-Form ASR
Figure 3 for E2E Segmenter: Joint Segmenting and Decoding for Long-Form ASR
Figure 4 for E2E Segmenter: Joint Segmenting and Decoding for Long-Form ASR
Viaarxiv icon

Improving the fusion of acoustic and text representations in RNN-T

Add code
Jan 25, 2022
Figure 1 for Improving the fusion of acoustic and text representations in RNN-T
Figure 2 for Improving the fusion of acoustic and text representations in RNN-T
Figure 3 for Improving the fusion of acoustic and text representations in RNN-T
Figure 4 for Improving the fusion of acoustic and text representations in RNN-T
Viaarxiv icon

FastEmit: Low-latency Streaming ASR with Sequence-level Emission Regularization

Add code
Oct 21, 2020
Figure 1 for FastEmit: Low-latency Streaming ASR with Sequence-level Emission Regularization
Figure 2 for FastEmit: Low-latency Streaming ASR with Sequence-level Emission Regularization
Figure 3 for FastEmit: Low-latency Streaming ASR with Sequence-level Emission Regularization
Figure 4 for FastEmit: Low-latency Streaming ASR with Sequence-level Emission Regularization
Viaarxiv icon

A Streaming On-Device End-to-End Model Surpassing Server-Side Conventional Model Quality and Latency

Add code
Mar 28, 2020
Figure 1 for A Streaming On-Device End-to-End Model Surpassing Server-Side Conventional Model Quality and Latency
Figure 2 for A Streaming On-Device End-to-End Model Surpassing Server-Side Conventional Model Quality and Latency
Figure 3 for A Streaming On-Device End-to-End Model Surpassing Server-Side Conventional Model Quality and Latency
Figure 4 for A Streaming On-Device End-to-End Model Surpassing Server-Side Conventional Model Quality and Latency
Viaarxiv icon

Personal VAD: Speaker-Conditioned Voice Activity Detection

Add code
Aug 12, 2019
Figure 1 for Personal VAD: Speaker-Conditioned Voice Activity Detection
Figure 2 for Personal VAD: Speaker-Conditioned Voice Activity Detection
Figure 3 for Personal VAD: Speaker-Conditioned Voice Activity Detection
Figure 4 for Personal VAD: Speaker-Conditioned Voice Activity Detection
Viaarxiv icon

Deep Learning for Audio Signal Processing

Add code
May 25, 2019
Figure 1 for Deep Learning for Audio Signal Processing
Viaarxiv icon

Streaming End-to-end Speech Recognition For Mobile Devices

Add code
Nov 15, 2018
Figure 1 for Streaming End-to-end Speech Recognition For Mobile Devices
Figure 2 for Streaming End-to-end Speech Recognition For Mobile Devices
Figure 3 for Streaming End-to-end Speech Recognition For Mobile Devices
Figure 4 for Streaming End-to-end Speech Recognition For Mobile Devices
Viaarxiv icon