Alert button
Picture for Minglun Han

Minglun Han

Alert button

ViLaS: Integrating Vision and Language into Automatic Speech Recognition

Add code
Bookmark button
Alert button
May 31, 2023
Minglun Han, Feilong Chen, Ziyi Ni, Linghui Meng, Jing Shi, Shuang Xu, Bo Xu

Figure 1 for ViLaS: Integrating Vision and Language into Automatic Speech Recognition
Figure 2 for ViLaS: Integrating Vision and Language into Automatic Speech Recognition
Figure 3 for ViLaS: Integrating Vision and Language into Automatic Speech Recognition
Figure 4 for ViLaS: Integrating Vision and Language into Automatic Speech Recognition
Viaarxiv icon

X-LLM: Bootstrapping Advanced Large Language Models by Treating Multi-Modalities as Foreign Languages

Add code
Bookmark button
Alert button
May 10, 2023
Feilong Chen, Minglun Han, Haozhi Zhao, Qingyang Zhang, Jing Shi, Shuang Xu, Bo Xu

Figure 1 for X-LLM: Bootstrapping Advanced Large Language Models by Treating Multi-Modalities as Foreign Languages
Figure 2 for X-LLM: Bootstrapping Advanced Large Language Models by Treating Multi-Modalities as Foreign Languages
Figure 3 for X-LLM: Bootstrapping Advanced Large Language Models by Treating Multi-Modalities as Foreign Languages
Figure 4 for X-LLM: Bootstrapping Advanced Large Language Models by Treating Multi-Modalities as Foreign Languages
Viaarxiv icon

Matching-based Term Semantics Pre-training for Spoken Patient Query Understanding

Add code
Bookmark button
Alert button
Mar 02, 2023
Zefa Hu, Xiuyi Chen, Haoran Wu, Minglun Han, Ziyi Ni, Jing Shi, Shuang Xu, Bo Xu

Figure 1 for Matching-based Term Semantics Pre-training for Spoken Patient Query Understanding
Figure 2 for Matching-based Term Semantics Pre-training for Spoken Patient Query Understanding
Figure 3 for Matching-based Term Semantics Pre-training for Spoken Patient Query Understanding
Figure 4 for Matching-based Term Semantics Pre-training for Spoken Patient Query Understanding
Viaarxiv icon

Complex Dynamic Neurons Improved Spiking Transformer Network for Efficient Automatic Speech Recognition

Add code
Bookmark button
Alert button
Feb 02, 2023
Minglun Han, Qingyu Wang, Tielin Zhang, Yi Wang, Duzhen Zhang, Bo Xu

Figure 1 for Complex Dynamic Neurons Improved Spiking Transformer Network for Efficient Automatic Speech Recognition
Figure 2 for Complex Dynamic Neurons Improved Spiking Transformer Network for Efficient Automatic Speech Recognition
Figure 3 for Complex Dynamic Neurons Improved Spiking Transformer Network for Efficient Automatic Speech Recognition
Figure 4 for Complex Dynamic Neurons Improved Spiking Transformer Network for Efficient Automatic Speech Recognition
Viaarxiv icon

Knowledge Transfer from Pre-trained Language Models to Cif-based Speech Recognizers via Hierarchical Distillation

Add code
Bookmark button
Alert button
Jan 30, 2023
Minglun Han, Feilong Chen, Jing Shi, Shuang Xu, Bo Xu

Figure 1 for Knowledge Transfer from Pre-trained Language Models to Cif-based Speech Recognizers via Hierarchical Distillation
Figure 2 for Knowledge Transfer from Pre-trained Language Models to Cif-based Speech Recognizers via Hierarchical Distillation
Figure 3 for Knowledge Transfer from Pre-trained Language Models to Cif-based Speech Recognizers via Hierarchical Distillation
Figure 4 for Knowledge Transfer from Pre-trained Language Models to Cif-based Speech Recognizers via Hierarchical Distillation
Viaarxiv icon

VLP: A Survey on Vision-Language Pre-training

Add code
Bookmark button
Alert button
Feb 21, 2022
Feilong Chen, Duzhen Zhang, Minglun Han, Xiuyi Chen, Jing Shi, Shuang Xu, Bo Xu

Figure 1 for VLP: A Survey on Vision-Language Pre-training
Figure 2 for VLP: A Survey on Vision-Language Pre-training
Figure 3 for VLP: A Survey on Vision-Language Pre-training
Viaarxiv icon

Improving End-to-End Contextual Speech Recognition with Fine-grained Contextual Knowledge Selection

Add code
Bookmark button
Alert button
Jan 30, 2022
Minglun Han, Linhao Dong, Zhenlin Liang, Meng Cai, Shiyu Zhou, Zejun Ma, Bo Xu

Figure 1 for Improving End-to-End Contextual Speech Recognition with Fine-grained Contextual Knowledge Selection
Figure 2 for Improving End-to-End Contextual Speech Recognition with Fine-grained Contextual Knowledge Selection
Figure 3 for Improving End-to-End Contextual Speech Recognition with Fine-grained Contextual Knowledge Selection
Figure 4 for Improving End-to-End Contextual Speech Recognition with Fine-grained Contextual Knowledge Selection
Viaarxiv icon

cif-based collaborative decoding for end-to-end contextual speech recognition

Add code
Bookmark button
Alert button
Dec 17, 2020
Minglun Han, Linhao Dong, Shiyu Zhou, Bo Xu

Figure 1 for cif-based collaborative decoding for end-to-end contextual speech recognition
Figure 2 for cif-based collaborative decoding for end-to-end contextual speech recognition
Figure 3 for cif-based collaborative decoding for end-to-end contextual speech recognition
Figure 4 for cif-based collaborative decoding for end-to-end contextual speech recognition
Viaarxiv icon