Alert button
Picture for Yuexian Zou

Yuexian Zou

Alert button

Improving Mandarin End-to-End Speech Recognition with Word N-gram Language Model

Add code
Bookmark button
Alert button
Jan 06, 2022
Jinchuan Tian, Jianwei Yu, Chao Weng, Yuexian Zou, Dong Yu

Figure 1 for Improving Mandarin End-to-End Speech Recognition with Word N-gram Language Model
Figure 2 for Improving Mandarin End-to-End Speech Recognition with Word N-gram Language Model
Figure 3 for Improving Mandarin End-to-End Speech Recognition with Word N-gram Language Model
Figure 4 for Improving Mandarin End-to-End Speech Recognition with Word N-gram Language Model
Viaarxiv icon

Consistent Training and Decoding For End-to-end Speech Recognition Using Lattice-free MMI

Add code
Bookmark button
Alert button
Dec 30, 2021
Jinchuan Tian, Jianwei Yu, Chao Weng, Shi-Xiong Zhang, Dan Su, Dong Yu, Yuexian Zou

Figure 1 for Consistent Training and Decoding For End-to-end Speech Recognition Using Lattice-free MMI
Figure 2 for Consistent Training and Decoding For End-to-end Speech Recognition Using Lattice-free MMI
Figure 3 for Consistent Training and Decoding For End-to-end Speech Recognition Using Lattice-free MMI
Figure 4 for Consistent Training and Decoding For End-to-end Speech Recognition Using Lattice-free MMI
Viaarxiv icon

Detect what you want: Target Sound Detection

Add code
Bookmark button
Alert button
Dec 19, 2021
Dongchao Yang, Helin Wang, Yuexian Zou, Chao Weng

Figure 1 for Detect what you want: Target Sound Detection
Figure 2 for Detect what you want: Target Sound Detection
Figure 3 for Detect what you want: Target Sound Detection
Figure 4 for Detect what you want: Target Sound Detection
Viaarxiv icon

CLIP Meets Video Captioners: Attribute-Aware Representation Learning Promotes Accurate Captioning

Add code
Bookmark button
Alert button
Nov 30, 2021
Bang Yang, Yuexian Zou

Figure 1 for CLIP Meets Video Captioners: Attribute-Aware Representation Learning Promotes Accurate Captioning
Figure 2 for CLIP Meets Video Captioners: Attribute-Aware Representation Learning Promotes Accurate Captioning
Figure 3 for CLIP Meets Video Captioners: Attribute-Aware Representation Learning Promotes Accurate Captioning
Figure 4 for CLIP Meets Video Captioners: Attribute-Aware Representation Learning Promotes Accurate Captioning
Viaarxiv icon

Improving the Performance of Automated Audio Captioning via Integrating the Acoustic and Semantic Information

Add code
Bookmark button
Alert button
Oct 12, 2021
Zhongjie Ye, Helin Wang, Dongchao Yang, Yuexian Zou

Figure 1 for Improving the Performance of Automated Audio Captioning via Integrating the Acoustic and Semantic Information
Figure 2 for Improving the Performance of Automated Audio Captioning via Integrating the Acoustic and Semantic Information
Figure 3 for Improving the Performance of Automated Audio Captioning via Integrating the Acoustic and Semantic Information
Figure 4 for Improving the Performance of Automated Audio Captioning via Integrating the Acoustic and Semantic Information
Viaarxiv icon

A Mutual learning framework for Few-shot Sound Event Detection

Add code
Bookmark button
Alert button
Oct 09, 2021
Dongchao Yang, Helin Wang, Yuexian Zou, Zhongjie Ye, Wenwu Wang

Figure 1 for A Mutual learning framework for Few-shot Sound Event Detection
Figure 2 for A Mutual learning framework for Few-shot Sound Event Detection
Figure 3 for A Mutual learning framework for Few-shot Sound Event Detection
Figure 4 for A Mutual learning framework for Few-shot Sound Event Detection
Viaarxiv icon

Towards Joint Intent Detection and Slot Filling via Higher-order Attention

Add code
Bookmark button
Alert button
Sep 22, 2021
Dongsheng Chen, Zhiqi Huang, Xian Wu, Shen Ge, Yuexian Zou

Figure 1 for Towards Joint Intent Detection and Slot Filling via Higher-order Attention
Figure 2 for Towards Joint Intent Detection and Slot Filling via Higher-order Attention
Figure 3 for Towards Joint Intent Detection and Slot Filling via Higher-order Attention
Figure 4 for Towards Joint Intent Detection and Slot Filling via Higher-order Attention
Viaarxiv icon

On Pursuit of Designing Multi-modal Transformer for Video Grounding

Add code
Bookmark button
Alert button
Sep 13, 2021
Meng Cao, Long Chen, Mike Zheng Shou, Can Zhang, Yuexian Zou

Figure 1 for On Pursuit of Designing Multi-modal Transformer for Video Grounding
Figure 2 for On Pursuit of Designing Multi-modal Transformer for Video Grounding
Figure 3 for On Pursuit of Designing Multi-modal Transformer for Video Grounding
Figure 4 for On Pursuit of Designing Multi-modal Transformer for Video Grounding
Viaarxiv icon

Self-supervised Contrastive Cross-Modality Representation Learning for Spoken Question Answering

Add code
Bookmark button
Alert button
Sep 08, 2021
Chenyu You, Nuo Chen, Yuexian Zou

Figure 1 for Self-supervised Contrastive Cross-Modality Representation Learning for Spoken Question Answering
Figure 2 for Self-supervised Contrastive Cross-Modality Representation Learning for Spoken Question Answering
Figure 3 for Self-supervised Contrastive Cross-Modality Representation Learning for Spoken Question Answering
Figure 4 for Self-supervised Contrastive Cross-Modality Representation Learning for Spoken Question Answering
Viaarxiv icon