Alert button
Picture for Wei Han

Wei Han

Alert button

Pushing the Limits of Semi-Supervised Learning for Automatic Speech Recognition

Add code
Bookmark button
Alert button
Oct 20, 2020
Yu Zhang, James Qin, Daniel S. Park, Wei Han, Chung-Cheng Chiu, Ruoming Pang, Quoc V. Le, Yonghui Wu

Figure 1 for Pushing the Limits of Semi-Supervised Learning for Automatic Speech Recognition
Figure 2 for Pushing the Limits of Semi-Supervised Learning for Automatic Speech Recognition
Figure 3 for Pushing the Limits of Semi-Supervised Learning for Automatic Speech Recognition
Figure 4 for Pushing the Limits of Semi-Supervised Learning for Automatic Speech Recognition
Viaarxiv icon

Answer-checking in Context: A Multi-modal FullyAttention Network for Visual Question Answering

Add code
Bookmark button
Alert button
Oct 17, 2020
Hantao Huang, Tao Han, Wei Han, Deep Yap, Cheng-Ming Chiang

Figure 1 for Answer-checking in Context: A Multi-modal FullyAttention Network for Visual Question Answering
Figure 2 for Answer-checking in Context: A Multi-modal FullyAttention Network for Visual Question Answering
Figure 3 for Answer-checking in Context: A Multi-modal FullyAttention Network for Visual Question Answering
Figure 4 for Answer-checking in Context: A Multi-modal FullyAttention Network for Visual Question Answering
Viaarxiv icon

Universal ASR: Unify and Improve Streaming ASR with Full-context Modeling

Add code
Bookmark button
Alert button
Oct 12, 2020
Jiahui Yu, Wei Han, Anmol Gulati, Chung-Cheng Chiu, Bo Li, Tara N. Sainath, Yonghui Wu, Ruoming Pang

Figure 1 for Universal ASR: Unify and Improve Streaming ASR with Full-context Modeling
Figure 2 for Universal ASR: Unify and Improve Streaming ASR with Full-context Modeling
Figure 3 for Universal ASR: Unify and Improve Streaming ASR with Full-context Modeling
Figure 4 for Universal ASR: Unify and Improve Streaming ASR with Full-context Modeling
Viaarxiv icon

Finding the Evidence: Localization-aware Answer Prediction for Text Visual Question Answering

Add code
Bookmark button
Alert button
Oct 06, 2020
Wei Han, Hantao Huang, Tao Han

Figure 1 for Finding the Evidence: Localization-aware Answer Prediction for Text Visual Question Answering
Figure 2 for Finding the Evidence: Localization-aware Answer Prediction for Text Visual Question Answering
Figure 3 for Finding the Evidence: Localization-aware Answer Prediction for Text Visual Question Answering
Figure 4 for Finding the Evidence: Localization-aware Answer Prediction for Text Visual Question Answering
Viaarxiv icon

Dialogue Relation Extraction with Document-level Heterogeneous Graph Attention Networks

Add code
Bookmark button
Alert button
Sep 14, 2020
Hui Chen, Pengfei Hong, Wei Han, Navonil Majumder, Soujanya Poria

Figure 1 for Dialogue Relation Extraction with Document-level Heterogeneous Graph Attention Networks
Figure 2 for Dialogue Relation Extraction with Document-level Heterogeneous Graph Attention Networks
Figure 3 for Dialogue Relation Extraction with Document-level Heterogeneous Graph Attention Networks
Figure 4 for Dialogue Relation Extraction with Document-level Heterogeneous Graph Attention Networks
Viaarxiv icon

Improved Noisy Student Training for Automatic Speech Recognition

Add code
Bookmark button
Alert button
May 19, 2020
Daniel S. Park, Yu Zhang, Ye Jia, Wei Han, Chung-Cheng Chiu, Bo Li, Yonghui Wu, Quoc V. Le

Figure 1 for Improved Noisy Student Training for Automatic Speech Recognition
Figure 2 for Improved Noisy Student Training for Automatic Speech Recognition
Figure 3 for Improved Noisy Student Training for Automatic Speech Recognition
Figure 4 for Improved Noisy Student Training for Automatic Speech Recognition
Viaarxiv icon

RNN-T Models Fail to Generalize to Out-of-Domain Audio: Causes and Solutions

Add code
Bookmark button
Alert button
May 17, 2020
Chung-Cheng Chiu, Arun Narayanan, Wei Han, Rohit Prabhavalkar, Yu Zhang, Navdeep Jaitly, Ruoming Pang, Tara N. Sainath, Patrick Nguyen, Liangliang Cao, Yonghui Wu

Figure 1 for RNN-T Models Fail to Generalize to Out-of-Domain Audio: Causes and Solutions
Figure 2 for RNN-T Models Fail to Generalize to Out-of-Domain Audio: Causes and Solutions
Figure 3 for RNN-T Models Fail to Generalize to Out-of-Domain Audio: Causes and Solutions
Figure 4 for RNN-T Models Fail to Generalize to Out-of-Domain Audio: Causes and Solutions
Viaarxiv icon

Conformer: Convolution-augmented Transformer for Speech Recognition

Add code
Bookmark button
Alert button
May 16, 2020
Anmol Gulati, James Qin, Chung-Cheng Chiu, Niki Parmar, Yu Zhang, Jiahui Yu, Wei Han, Shibo Wang, Zhengdong Zhang, Yonghui Wu, Ruoming Pang

Figure 1 for Conformer: Convolution-augmented Transformer for Speech Recognition
Figure 2 for Conformer: Convolution-augmented Transformer for Speech Recognition
Figure 3 for Conformer: Convolution-augmented Transformer for Speech Recognition
Figure 4 for Conformer: Convolution-augmented Transformer for Speech Recognition
Viaarxiv icon

ContextNet: Improving Convolutional Neural Networks for Automatic Speech Recognition with Global Context

Add code
Bookmark button
Alert button
May 16, 2020
Wei Han, Zhengdong Zhang, Yu Zhang, Jiahui Yu, Chung-Cheng Chiu, James Qin, Anmol Gulati, Ruoming Pang, Yonghui Wu

Figure 1 for ContextNet: Improving Convolutional Neural Networks for Automatic Speech Recognition with Global Context
Figure 2 for ContextNet: Improving Convolutional Neural Networks for Automatic Speech Recognition with Global Context
Figure 3 for ContextNet: Improving Convolutional Neural Networks for Automatic Speech Recognition with Global Context
Figure 4 for ContextNet: Improving Convolutional Neural Networks for Automatic Speech Recognition with Global Context
Viaarxiv icon