Alert button
Picture for Xiaodong Cui

Xiaodong Cui

Alert button

Training Nonlinear Transformers for Efficient In-Context Learning: A Theoretical Learning and Generalization Analysis

Feb 23, 2024
Hongkang Li, Meng Wang, Songtao Lu, Xiaodong Cui, Pin-Yu Chen

Viaarxiv icon

Joint Unsupervised and Supervised Training for Automatic Speech Recognition via Bilevel Optimization

Jan 13, 2024
A F M Saif, Xiaodong Cui, Han Shen, Songtao Lu, Brian Kingsbury, Tianyi Chen

Viaarxiv icon

Soft Random Sampling: A Theoretical and Empirical Analysis

Nov 24, 2023
Xiaodong Cui, Ashish Mittal, Songtao Lu, Wei Zhang, George Saon, Brian Kingsbury

Viaarxiv icon

How Can Context Help? Exploring Joint Retrieval of Passage and Personalized Context

Aug 26, 2023
Hui Wan, Hongkang Li, Songtao Lu, Xiaodong Cui, Marina Danilevsky

Figure 1 for How Can Context Help? Exploring Joint Retrieval of Passage and Personalized Context
Figure 2 for How Can Context Help? Exploring Joint Retrieval of Passage and Personalized Context
Figure 3 for How Can Context Help? Exploring Joint Retrieval of Passage and Personalized Context
Figure 4 for How Can Context Help? Exploring Joint Retrieval of Passage and Personalized Context
Viaarxiv icon

Diagonal State Space Augmented Transformers for Speech Recognition

Feb 27, 2023
George Saon, Ankit Gupta, Xiaodong Cui

Figure 1 for Diagonal State Space Augmented Transformers for Speech Recognition
Figure 2 for Diagonal State Space Augmented Transformers for Speech Recognition
Figure 3 for Diagonal State Space Augmented Transformers for Speech Recognition
Figure 4 for Diagonal State Space Augmented Transformers for Speech Recognition
Viaarxiv icon

Accelerating Inference and Language Model Fusion of Recurrent Neural Network Transducers via End-to-End 4-bit Quantization

Jun 16, 2022
Andrea Fasoli, Chia-Yu Chen, Mauricio Serrano, Swagath Venkataramani, George Saon, Xiaodong Cui, Brian Kingsbury, Kailash Gopalakrishnan

Figure 1 for Accelerating Inference and Language Model Fusion of Recurrent Neural Network Transducers via End-to-End 4-bit Quantization
Figure 2 for Accelerating Inference and Language Model Fusion of Recurrent Neural Network Transducers via End-to-End 4-bit Quantization
Figure 3 for Accelerating Inference and Language Model Fusion of Recurrent Neural Network Transducers via End-to-End 4-bit Quantization
Viaarxiv icon

Improving Generalization of Deep Neural Network Acoustic Models with Length Perturbation and N-best Based Label Smoothing

Mar 29, 2022
Xiaodong Cui, George Saon, Tohru Nagano, Masayuki Suzuki, Takashi Fukuda, Brian Kingsbury, Gakuto Kurata

Figure 1 for Improving Generalization of Deep Neural Network Acoustic Models with Length Perturbation and N-best Based Label Smoothing
Figure 2 for Improving Generalization of Deep Neural Network Acoustic Models with Length Perturbation and N-best Based Label Smoothing
Figure 3 for Improving Generalization of Deep Neural Network Acoustic Models with Length Perturbation and N-best Based Label Smoothing
Figure 4 for Improving Generalization of Deep Neural Network Acoustic Models with Length Perturbation and N-best Based Label Smoothing
Viaarxiv icon

Loss Landscape Dependent Self-Adjusting Learning Rates in Decentralized Stochastic Gradient Descent

Dec 02, 2021
Wei Zhang, Mingrui Liu, Yu Feng, Xiaodong Cui, Brian Kingsbury, Yuhai Tu

Figure 1 for Loss Landscape Dependent Self-Adjusting Learning Rates in Decentralized Stochastic Gradient Descent
Figure 2 for Loss Landscape Dependent Self-Adjusting Learning Rates in Decentralized Stochastic Gradient Descent
Figure 3 for Loss Landscape Dependent Self-Adjusting Learning Rates in Decentralized Stochastic Gradient Descent
Figure 4 for Loss Landscape Dependent Self-Adjusting Learning Rates in Decentralized Stochastic Gradient Descent
Viaarxiv icon

Asynchronous Decentralized Distributed Training of Acoustic Models

Oct 21, 2021
Xiaodong Cui, Wei Zhang, Abdullah Kayi, Mingrui Liu, Ulrich Finkler, Brian Kingsbury, George Saon, David Kung

Figure 1 for Asynchronous Decentralized Distributed Training of Acoustic Models
Figure 2 for Asynchronous Decentralized Distributed Training of Acoustic Models
Figure 3 for Asynchronous Decentralized Distributed Training of Acoustic Models
Figure 4 for Asynchronous Decentralized Distributed Training of Acoustic Models
Viaarxiv icon