Alert button

"speech": models, code, and papers
Alert button

Evolution of Part-of-Speech in Classical Chinese

Sep 23, 2020
Bai Li

Figure 1 for Evolution of Part-of-Speech in Classical Chinese
Figure 2 for Evolution of Part-of-Speech in Classical Chinese
Figure 3 for Evolution of Part-of-Speech in Classical Chinese
Figure 4 for Evolution of Part-of-Speech in Classical Chinese
Viaarxiv icon

Improving Streaming Automatic Speech Recognition With Non-Streaming Model Distillation On Unsupervised Data

Oct 22, 2020
Thibault Doutre, Wei Han, Min Ma, Zhiyun Lu, Chung-Cheng Chiu, Ruoming Pang, Arun Narayanan, Ananya Misra, Yu Zhang, Liangliang Cao

Figure 1 for Improving Streaming Automatic Speech Recognition With Non-Streaming Model Distillation On Unsupervised Data
Figure 2 for Improving Streaming Automatic Speech Recognition With Non-Streaming Model Distillation On Unsupervised Data
Figure 3 for Improving Streaming Automatic Speech Recognition With Non-Streaming Model Distillation On Unsupervised Data
Figure 4 for Improving Streaming Automatic Speech Recognition With Non-Streaming Model Distillation On Unsupervised Data
Viaarxiv icon

Don't shoot butterfly with rifles: Multi-channel Continuous Speech Separation with Early Exit Transformer

Add code
Bookmark button
Alert button
Oct 23, 2020
Sanyuan Chen, Yu Wu, Zhuo Chen, Takuya Yoshioka, Shujie Liu, Jinyu Li

Figure 1 for Don't shoot butterfly with rifles: Multi-channel Continuous Speech Separation with Early Exit Transformer
Figure 2 for Don't shoot butterfly with rifles: Multi-channel Continuous Speech Separation with Early Exit Transformer
Figure 3 for Don't shoot butterfly with rifles: Multi-channel Continuous Speech Separation with Early Exit Transformer
Viaarxiv icon

PSO-Convolutional Neural Networks with Heterogeneous Learning Rate

Add code
Bookmark button
Alert button
May 20, 2022
Nguyen Huu Phong, Augusto Santos, Bernardete Ribeiro

Figure 1 for PSO-Convolutional Neural Networks with Heterogeneous Learning Rate
Figure 2 for PSO-Convolutional Neural Networks with Heterogeneous Learning Rate
Figure 3 for PSO-Convolutional Neural Networks with Heterogeneous Learning Rate
Figure 4 for PSO-Convolutional Neural Networks with Heterogeneous Learning Rate
Viaarxiv icon

FLUTE: A Scalable, Extensible Framework for High-Performance Federated Learning Simulations

Add code
Bookmark button
Alert button
Mar 25, 2022
Dimitrios Dimitriadis, Mirian Hipolito Garcia, Daniel Madrigal Diaz, Andre Manoel, Robert Sim

Figure 1 for FLUTE: A Scalable, Extensible Framework for High-Performance Federated Learning Simulations
Figure 2 for FLUTE: A Scalable, Extensible Framework for High-Performance Federated Learning Simulations
Figure 3 for FLUTE: A Scalable, Extensible Framework for High-Performance Federated Learning Simulations
Figure 4 for FLUTE: A Scalable, Extensible Framework for High-Performance Federated Learning Simulations
Viaarxiv icon

Analyzing the Intensity of Complaints on Social Media

Add code
Bookmark button
Alert button
Apr 20, 2022
Ming Fang, Shi Zong, Jing Li, Xinyu Dai, Shujian Huang, Jiajun Chen

Figure 1 for Analyzing the Intensity of Complaints on Social Media
Figure 2 for Analyzing the Intensity of Complaints on Social Media
Figure 3 for Analyzing the Intensity of Complaints on Social Media
Figure 4 for Analyzing the Intensity of Complaints on Social Media
Viaarxiv icon

Low-Latency Speaker-Independent Continuous Speech Separation

Apr 13, 2019
Takuya Yoshioka, Zhuo Chen, Changliang Liu, Xiong Xiao, Hakan Erdogan, Dimitrios Dimitriadis

Figure 1 for Low-Latency Speaker-Independent Continuous Speech Separation
Figure 2 for Low-Latency Speaker-Independent Continuous Speech Separation
Figure 3 for Low-Latency Speaker-Independent Continuous Speech Separation
Figure 4 for Low-Latency Speaker-Independent Continuous Speech Separation
Viaarxiv icon

Improving Voice Separation by Incorporating End-to-end Speech Recognition

Add code
Bookmark button
Alert button
Nov 29, 2019
Naoya Takahashi, Mayank Kumar Singh, Sakya Basak, Parthasaarathy Sudarsanam, Sriram Ganapathy, Yuki Mitsufuji

Figure 1 for Improving Voice Separation by Incorporating End-to-end Speech Recognition
Figure 2 for Improving Voice Separation by Incorporating End-to-end Speech Recognition
Figure 3 for Improving Voice Separation by Incorporating End-to-end Speech Recognition
Figure 4 for Improving Voice Separation by Incorporating End-to-end Speech Recognition
Viaarxiv icon

Dynamic Prosody Generation for Speech Synthesis using Linguistics-Driven Acoustic Embedding Selection

Dec 02, 2019
Shubhi Tyagi, Marco Nicolis, Jonas Rohnke, Thomas Drugman, Jaime Lorenzo-Trueba

Figure 1 for Dynamic Prosody Generation for Speech Synthesis using Linguistics-Driven Acoustic Embedding Selection
Figure 2 for Dynamic Prosody Generation for Speech Synthesis using Linguistics-Driven Acoustic Embedding Selection
Figure 3 for Dynamic Prosody Generation for Speech Synthesis using Linguistics-Driven Acoustic Embedding Selection
Figure 4 for Dynamic Prosody Generation for Speech Synthesis using Linguistics-Driven Acoustic Embedding Selection
Viaarxiv icon

SPGISpeech: 5,000 hours of transcribed financial audio for fully formatted end-to-end speech recognition

Add code
Bookmark button
Alert button
Apr 06, 2021
Patrick K. O'Neill, Vitaly Lavrukhin, Somshubra Majumdar, Vahid Noroozi, Yuekai Zhang, Oleksii Kuchaiev, Jagadeesh Balam, Yuliya Dovzhenko, Keenan Freyberg, Michael D. Shulman, Boris Ginsburg, Shinji Watanabe, Georg Kucsko

Figure 1 for SPGISpeech: 5,000 hours of transcribed financial audio for fully formatted end-to-end speech recognition
Figure 2 for SPGISpeech: 5,000 hours of transcribed financial audio for fully formatted end-to-end speech recognition
Figure 3 for SPGISpeech: 5,000 hours of transcribed financial audio for fully formatted end-to-end speech recognition
Figure 4 for SPGISpeech: 5,000 hours of transcribed financial audio for fully formatted end-to-end speech recognition
Viaarxiv icon