Alert button
Picture for Keyu An

Keyu An

Alert button

Advancing VAD Systems Based on Multi-Task Learning with Improved Model Structures

Add code
Bookmark button
Alert button
Dec 19, 2023
Lingyun Zuo, Keyu An, Shiliang Zhang, Zhijie Yan

Viaarxiv icon

Exploring RWKV for Memory Efficient and Low Latency Streaming ASR

Add code
Bookmark button
Alert button
Sep 26, 2023
Keyu An, Shiliang Zhang

Viaarxiv icon

BAT: Boundary aware transducer for memory-efficient and low-latency ASR

Add code
Bookmark button
Alert button
May 19, 2023
Keyu An, Xian Shi, Shiliang Zhang

Figure 1 for BAT: Boundary aware transducer for memory-efficient and low-latency ASR
Figure 2 for BAT: Boundary aware transducer for memory-efficient and low-latency ASR
Figure 3 for BAT: Boundary aware transducer for memory-efficient and low-latency ASR
Figure 4 for BAT: Boundary aware transducer for memory-efficient and low-latency ASR
Viaarxiv icon

An Empirical Study of Language Model Integration for Transducer based Speech Recognition

Add code
Bookmark button
Alert button
Mar 31, 2022
Huahuan Zheng, Keyu An, Zhijian Ou, Chen Huang, Ke Ding, Guanglu Wan

Figure 1 for An Empirical Study of Language Model Integration for Transducer based Speech Recognition
Figure 2 for An Empirical Study of Language Model Integration for Transducer based Speech Recognition
Figure 3 for An Empirical Study of Language Model Integration for Transducer based Speech Recognition
Viaarxiv icon

CUSIDE: Chunking, Simulating Future Context and Decoding for Streaming ASR

Add code
Bookmark button
Alert button
Mar 31, 2022
Keyu An, Huahuan Zheng, Zhijian Ou, Hongyu Xiang, Ke Ding, Guanglu Wan

Figure 1 for CUSIDE: Chunking, Simulating Future Context and Decoding for Streaming ASR
Figure 2 for CUSIDE: Chunking, Simulating Future Context and Decoding for Streaming ASR
Figure 3 for CUSIDE: Chunking, Simulating Future Context and Decoding for Streaming ASR
Figure 4 for CUSIDE: Chunking, Simulating Future Context and Decoding for Streaming ASR
Viaarxiv icon

Exploiting Single-Channel Speech for Multi-Channel End-to-End Speech Recognition: A Comparative Study

Add code
Bookmark button
Alert button
Mar 31, 2022
Keyu An, Zhijian Ou

Figure 1 for Exploiting Single-Channel Speech for Multi-Channel End-to-End Speech Recognition: A Comparative Study
Figure 2 for Exploiting Single-Channel Speech for Multi-Channel End-to-End Speech Recognition: A Comparative Study
Figure 3 for Exploiting Single-Channel Speech for Multi-Channel End-to-End Speech Recognition: A Comparative Study
Figure 4 for Exploiting Single-Channel Speech for Multi-Channel End-to-End Speech Recognition: A Comparative Study
Viaarxiv icon

Multilingual and crosslingual speech recognition using phonological-vector based phone embeddings

Add code
Bookmark button
Alert button
Jul 11, 2021
Chengrui Zhu, Keyu An, Huahuan Zheng, Zhijian Ou

Figure 1 for Multilingual and crosslingual speech recognition using phonological-vector based phone embeddings
Figure 2 for Multilingual and crosslingual speech recognition using phonological-vector based phone embeddings
Figure 3 for Multilingual and crosslingual speech recognition using phonological-vector based phone embeddings
Figure 4 for Multilingual and crosslingual speech recognition using phonological-vector based phone embeddings
Viaarxiv icon

Exploiting Single-Channel Speech For Multi-channel End-to-end Speech Recognition

Add code
Bookmark button
Alert button
Jul 06, 2021
Keyu An, Zhijian Ou

Figure 1 for Exploiting Single-Channel Speech For Multi-channel End-to-end Speech Recognition
Figure 2 for Exploiting Single-Channel Speech For Multi-channel End-to-end Speech Recognition
Figure 3 for Exploiting Single-Channel Speech For Multi-channel End-to-end Speech Recognition
Figure 4 for Exploiting Single-Channel Speech For Multi-channel End-to-end Speech Recognition
Viaarxiv icon

Deformable TDNN with adaptive receptive fields for speech recognition

Add code
Bookmark button
Alert button
Apr 30, 2021
Keyu An, Yi Zhang, Zhijian Ou

Figure 1 for Deformable TDNN with adaptive receptive fields for speech recognition
Figure 2 for Deformable TDNN with adaptive receptive fields for speech recognition
Figure 3 for Deformable TDNN with adaptive receptive fields for speech recognition
Figure 4 for Deformable TDNN with adaptive receptive fields for speech recognition
Viaarxiv icon

Efficient Neural Architecture Search for End-to-end Speech Recognition via Straight-Through Gradients

Add code
Bookmark button
Alert button
Nov 11, 2020
Huahuan Zheng, Keyu An, Zhijian Ou

Figure 1 for Efficient Neural Architecture Search for End-to-end Speech Recognition via Straight-Through Gradients
Figure 2 for Efficient Neural Architecture Search for End-to-end Speech Recognition via Straight-Through Gradients
Figure 3 for Efficient Neural Architecture Search for End-to-end Speech Recognition via Straight-Through Gradients
Figure 4 for Efficient Neural Architecture Search for End-to-end Speech Recognition via Straight-Through Gradients
Viaarxiv icon