Alert button
Picture for Yuexian Zou

Yuexian Zou

Alert button

ZeroNLG: Aligning and Autoencoding Domains for Zero-Shot Multimodal and Multilingual Natural Language Generation

Mar 11, 2023
Bang Yang, Fenglin Liu, Yuexian Zou, Xian Wu, Yaowei Wang, David A. Clifton

Figure 1 for ZeroNLG: Aligning and Autoencoding Domains for Zero-Shot Multimodal and Multilingual Natural Language Generation
Figure 2 for ZeroNLG: Aligning and Autoencoding Domains for Zero-Shot Multimodal and Multilingual Natural Language Generation
Figure 3 for ZeroNLG: Aligning and Autoencoding Domains for Zero-Shot Multimodal and Multilingual Natural Language Generation
Figure 4 for ZeroNLG: Aligning and Autoencoding Domains for Zero-Shot Multimodal and Multilingual Natural Language Generation
Viaarxiv icon

Improving Weakly Supervised Sound Event Detection with Causal Intervention

Mar 10, 2023
Yifei Xin, Dongchao Yang, Fan Cui, Yujun Wang, Yuexian Zou

Figure 1 for Improving Weakly Supervised Sound Event Detection with Causal Intervention
Figure 2 for Improving Weakly Supervised Sound Event Detection with Causal Intervention
Figure 3 for Improving Weakly Supervised Sound Event Detection with Causal Intervention
Viaarxiv icon

FTM: A Frame-level Timeline Modeling Method for Temporal Graph Representation Learning

Feb 23, 2023
Bowen Cao, Qichen Ye, Weiyuan Xu, Yuexian Zou

Figure 1 for FTM: A Frame-level Timeline Modeling Method for Temporal Graph Representation Learning
Figure 2 for FTM: A Frame-level Timeline Modeling Method for Temporal Graph Representation Learning
Figure 3 for FTM: A Frame-level Timeline Modeling Method for Temporal Graph Representation Learning
Figure 4 for FTM: A Frame-level Timeline Modeling Method for Temporal Graph Representation Learning
Viaarxiv icon

FiTs: Fine-grained Two-stage Training for Knowledge-aware Question Answering

Feb 23, 2023
Qichen Ye, Bowen Cao, Nuo Chen, Weiyuan Xu, Yuexian Zou

Figure 1 for FiTs: Fine-grained Two-stage Training for Knowledge-aware Question Answering
Figure 2 for FiTs: Fine-grained Two-stage Training for Knowledge-aware Question Answering
Figure 3 for FiTs: Fine-grained Two-stage Training for Knowledge-aware Question Answering
Figure 4 for FiTs: Fine-grained Two-stage Training for Knowledge-aware Question Answering
Viaarxiv icon

SSVMR: Saliency-based Self-training for Video-Music Retrieval

Feb 18, 2023
Xuxin Cheng, Zhihong Zhu, Hongxiang Li, Yaowei Li, Yuexian Zou

Figure 1 for SSVMR: Saliency-based Self-training for Video-Music Retrieval
Figure 2 for SSVMR: Saliency-based Self-training for Video-Music Retrieval
Figure 3 for SSVMR: Saliency-based Self-training for Video-Music Retrieval
Figure 4 for SSVMR: Saliency-based Self-training for Video-Music Retrieval
Viaarxiv icon

Generating Templated Caption for Video Grounding

Jan 15, 2023
Hongxiang Li, Meng Cao, Xuxin Cheng, Zhihong Zhu, Yaowei Li, Yuexian Zou

Figure 1 for Generating Templated Caption for Video Grounding
Figure 2 for Generating Templated Caption for Video Grounding
Figure 3 for Generating Templated Caption for Video Grounding
Figure 4 for Generating Templated Caption for Video Grounding
Viaarxiv icon

Towards Unified All-Neural Beamforming for Time and Frequency Domain Speech Separation

Dec 24, 2022
Rongzhi Gu, Shi-Xiong Zhang, Yuexian Zou, Dong Yu

Figure 1 for Towards Unified All-Neural Beamforming for Time and Frequency Domain Speech Separation
Figure 2 for Towards Unified All-Neural Beamforming for Time and Frequency Domain Speech Separation
Figure 3 for Towards Unified All-Neural Beamforming for Time and Frequency Domain Speech Separation
Figure 4 for Towards Unified All-Neural Beamforming for Time and Frequency Domain Speech Separation
Viaarxiv icon

M3ST: Mix at Three Levels for Speech Translation

Dec 07, 2022
Xuxin Cheng, Qianqian Dong, Fengpeng Yue, Tom Ko, Mingxuan Wang, Yuexian Zou

Figure 1 for M3ST: Mix at Three Levels for Speech Translation
Figure 2 for M3ST: Mix at Three Levels for Speech Translation
Figure 3 for M3ST: Mix at Three Levels for Speech Translation
Figure 4 for M3ST: Mix at Three Levels for Speech Translation
Viaarxiv icon

Aligning Source Visual and Target Language Domains for Unpaired Video Captioning

Nov 22, 2022
Fenglin Liu, Xian Wu, Chenyu You, Shen Ge, Yuexian Zou, Xu Sun

Figure 1 for Aligning Source Visual and Target Language Domains for Unpaired Video Captioning
Figure 2 for Aligning Source Visual and Target Language Domains for Unpaired Video Captioning
Figure 3 for Aligning Source Visual and Target Language Domains for Unpaired Video Captioning
Figure 4 for Aligning Source Visual and Target Language Domains for Unpaired Video Captioning
Viaarxiv icon