Get our free extension to see links to code for papers anywhere online!

Chrome logo Add to Chrome

Firefox logo Add to Firefox

Picture for Wei Han

Wei Han

Britton Chance Center for Biomedical Photonics, Wuhan National Laboratory for Optoelectronics-Huazhong University of Science and Technology, China

Co-training Transformer with Videos and Images Improves Action Recognition


Dec 14, 2021
Bowen Zhang, Jiahui Yu, Christopher Fifty, Wei Han, Andrew M. Dai, Ruoming Pang, Fei Sha


  Access Paper or Ask Questions

A Tensor-BTD-based Modulation for Massive Unsourced Random Access


Dec 05, 2021
Zhenting Luan, Yuchi Wu, Shansuo Liang, Liping Zhang, Wei Han, Bo Bai


  Access Paper or Ask Questions

Harmonic Retrieval with $L_1$-Tucker Tensor Decomposition


Nov 30, 2021
Zhenting Luan, Zhenyu Ming, Yuchi Wu, Wei Han, Xiang Chen, Bo Bai, Liping Zhang


  Access Paper or Ask Questions

Universal Paralinguistic Speech Representations Using Self-Supervised Conformers


Oct 09, 2021
Joel Shor, Aren Jansen, Wei Han, Daniel Park, Yu Zhang


  Access Paper or Ask Questions

Multi-trends Enhanced Dynamic Micro-video Recommendation


Oct 08, 2021
Yujie Lu, Yingxuan Huang, Shengyu Zhang, Wei Han, Hui Chen, Zhou Zhao, Fei Wu

* 11 pages, 2 figures 

  Access Paper or Ask Questions

BigSSL: Exploring the Frontier of Large-Scale Semi-Supervised Learning for Automatic Speech Recognition


Oct 01, 2021
Yu Zhang, Daniel S. Park, Wei Han, James Qin, Anmol Gulati, Joel Shor, Aren Jansen, Yuanzhong Xu, Yanping Huang, Shibo Wang, Zongwei Zhou, Bo Li, Min Ma, William Chan, Jiahui Yu, Yongqiang Wang, Liangliang Cao, Khe Chai Sim, Bhuvana Ramabhadran, Tara N. Sainath, Françoise Beaufays, Zhifeng Chen, Quoc V. Le, Chung-Cheng Chiu, Ruoming Pang, Yonghui Wu

* 14 pages, 7 figures, 13 tables; v2: minor corrections, reference baselines and bibliography updated 

  Access Paper or Ask Questions

Improving Multimodal Fusion with Hierarchical Mutual Information Maximization for Multimodal Sentiment Analysis


Sep 16, 2021
Wei Han, Hui Chen, Soujanya Poria

* Accepted as a long paper at EMNLP 2021 

  Access Paper or Ask Questions

W2v-BERT: Combining Contrastive Learning and Masked Language Modeling for Self-Supervised Speech Pre-Training


Aug 07, 2021
Yu-An Chung, Yu Zhang, Wei Han, Chung-Cheng Chiu, James Qin, Ruoming Pang, Yonghui Wu


  Access Paper or Ask Questions

Bi-Bimodal Modality Fusion for Correlation-Controlled Multimodal Sentiment Analysis


Jul 28, 2021
Wei Han, Hui Chen, Alexander Gelbukh, Amir Zadeh, Louis-philippe Morency, Soujanya Poria

* Accepted at ICMI 2021 

  Access Paper or Ask Questions

Supervised Contrastive Learning for Accented Speech Recognition


Jul 02, 2021
Tao Han, Hantao Huang, Ziang Yang, Wei Han

* Accented speech recognition, deep neural networks, model adaptation, supervised contrastive learning 

  Access Paper or Ask Questions

Bridging the gap between streaming and non-streaming ASR systems bydistilling ensembles of CTC and RNN-T models


Apr 25, 2021
Thibault Doutre, Wei Han, Chung-Cheng Chiu, Ruoming Pang, Olivier Siohan, Liangliang Cao


  Access Paper or Ask Questions

Exploring Targeted Universal Adversarial Perturbations to End-to-end ASR Models


Apr 06, 2021
Zhiyun Lu, Wei Han, Yu Zhang, Liangliang Cao

* Submitted to INTERSPEECH 2021 

  Access Paper or Ask Questions

Improving Streaming Automatic Speech Recognition With Non-Streaming Model Distillation On Unsupervised Data


Oct 22, 2020
Thibault Doutre, Wei Han, Min Ma, Zhiyun Lu, Chung-Cheng Chiu, Ruoming Pang, Arun Narayanan, Ananya Misra, Yu Zhang, Liangliang Cao


  Access Paper or Ask Questions

FastEmit: Low-latency Streaming ASR with Sequence-level Emission Regularization


Oct 21, 2020
Jiahui Yu, Chung-Cheng Chiu, Bo Li, Shuo-yiin Chang, Tara N. Sainath, Yanzhang He, Arun Narayanan, Wei Han, Anmol Gulati, Yonghui Wu, Ruoming Pang

* tech report 

  Access Paper or Ask Questions

Pushing the Limits of Semi-Supervised Learning for Automatic Speech Recognition


Oct 20, 2020
Yu Zhang, James Qin, Daniel S. Park, Wei Han, Chung-Cheng Chiu, Ruoming Pang, Quoc V. Le, Yonghui Wu

* 11 pages, 3 figures, 5 tables. Submitted to NeurIPS SAS 2020 Workshop 

  Access Paper or Ask Questions

Answer-checking in Context: A Multi-modal FullyAttention Network for Visual Question Answering


Oct 17, 2020
Hantao Huang, Tao Han, Wei Han, Deep Yap, Cheng-Ming Chiang

* Accepted in ICPR2020 

  Access Paper or Ask Questions

Universal ASR: Unify and Improve Streaming ASR with Full-context Modeling


Oct 12, 2020
Jiahui Yu, Wei Han, Anmol Gulati, Chung-Cheng Chiu, Bo Li, Tara N. Sainath, Yonghui Wu, Ruoming Pang

* tech report 

  Access Paper or Ask Questions

Finding the Evidence: Localization-aware Answer Prediction for Text Visual Question Answering


Oct 06, 2020
Wei Han, Hantao Huang, Tao Han

* Accepted in COLING2020 

  Access Paper or Ask Questions

Dialogue Relation Extraction with Document-level Heterogeneous Graph Attention Networks


Sep 14, 2020
Hui Chen, Pengfei Hong, Wei Han, Navonil Majumder, Soujanya Poria


  Access Paper or Ask Questions

Improved Noisy Student Training for Automatic Speech Recognition


May 19, 2020
Daniel S. Park, Yu Zhang, Ye Jia, Wei Han, Chung-Cheng Chiu, Bo Li, Yonghui Wu, Quoc V. Le

* 5 pages, 5 figures, 4 tables. Submitted to Interspeech 2020 

  Access Paper or Ask Questions

RNN-T Models Fail to Generalize to Out-of-Domain Audio: Causes and Solutions


May 17, 2020
Chung-Cheng Chiu, Arun Narayanan, Wei Han, Rohit Prabhavalkar, Yu Zhang, Navdeep Jaitly, Ruoming Pang, Tara N. Sainath, Patrick Nguyen, Liangliang Cao, Yonghui Wu

* Submitted to Interspeech 2020 

  Access Paper or Ask Questions

Conformer: Convolution-augmented Transformer for Speech Recognition


May 16, 2020
Anmol Gulati, James Qin, Chung-Cheng Chiu, Niki Parmar, Yu Zhang, Jiahui Yu, Wei Han, Shibo Wang, Zhengdong Zhang, Yonghui Wu, Ruoming Pang

* Submitted to Interspeech 2020 

  Access Paper or Ask Questions

ContextNet: Improving Convolutional Neural Networks for Automatic Speech Recognition with Global Context


May 16, 2020
Wei Han, Zhengdong Zhang, Yu Zhang, Jiahui Yu, Chung-Cheng Chiu, James Qin, Anmol Gulati, Ruoming Pang, Yonghui Wu

* Submitted to Interspeech 2020 

  Access Paper or Ask Questions

Streaming Object Detection for 3-D Point Clouds


May 04, 2020
Wei Han, Zhengdong Zhang, Benjamin Caine, Brandon Yang, Christoph Sprunk, Ouais Alsharif, Jiquan Ngiam, Vijay Vasudevan, Jonathon Shlens, Zhifeng Chen


  Access Paper or Ask Questions

FFusionCGAN: An end-to-end fusion method for few-focus images using conditional GAN in cytopathological digital slides


Jan 03, 2020
Xiebo Geng, Sibo Liua, Wei Han, Xu Li, Jiabo Ma, Jingya Yu, Xiuli Liu, Sahoqun Zeng, Li Chen, Shenghua Cheng


  Access Paper or Ask Questions

Scalability in Perception for Autonomous Driving: Waymo Open Dataset


Dec 18, 2019
Pei Sun, Henrik Kretzschmar, Xerxes Dotiwalla, Aurelien Chouard, Vijaysai Patnaik, Paul Tsui, James Guo, Yin Zhou, Yuning Chai, Benjamin Caine, Vijay Vasudevan, Wei Han, Jiquan Ngiam, Hang Zhao, Aleksei Timofeev, Scott Ettinger, Maxim Krivokon, Amy Gao, Aditya Joshi, Yu Zhang, Jonathon Shlens, Zhifeng Chen, Dragomir Anguelov


  Access Paper or Ask Questions