Alert button
Picture for Yuexian Zou

Yuexian Zou

Alert button

MultiCapCLIP: Auto-Encoding Prompts for Zero-Shot Multilingual Visual Captioning

Aug 25, 2023
Bang Yang, Fenglin Liu, Xian Wu, Yaowei Wang, Xu Sun, Yuexian Zou

Figure 1 for MultiCapCLIP: Auto-Encoding Prompts for Zero-Shot Multilingual Visual Captioning
Figure 2 for MultiCapCLIP: Auto-Encoding Prompts for Zero-Shot Multilingual Visual Captioning
Figure 3 for MultiCapCLIP: Auto-Encoding Prompts for Zero-Shot Multilingual Visual Captioning
Figure 4 for MultiCapCLIP: Auto-Encoding Prompts for Zero-Shot Multilingual Visual Captioning
Viaarxiv icon

G2L: Semantically Aligned and Uniform Video Grounding via Geodesic and Game Theory

Aug 18, 2023
Hongxiang Li, Meng Cao, Xuxin Cheng, Yaowei Li, Zhihong Zhu, Yuexian Zou

Figure 1 for G2L: Semantically Aligned and Uniform Video Grounding via Geodesic and Game Theory
Figure 2 for G2L: Semantically Aligned and Uniform Video Grounding via Geodesic and Game Theory
Figure 3 for G2L: Semantically Aligned and Uniform Video Grounding via Geodesic and Game Theory
Figure 4 for G2L: Semantically Aligned and Uniform Video Grounding via Geodesic and Game Theory
Viaarxiv icon

Improving Audio-Text Retrieval via Hierarchical Cross-Modal Interaction and Auxiliary Captions

Jul 28, 2023
Yifei Xin, Yuexian Zou

Figure 1 for Improving Audio-Text Retrieval via Hierarchical Cross-Modal Interaction and Auxiliary Captions
Figure 2 for Improving Audio-Text Retrieval via Hierarchical Cross-Modal Interaction and Auxiliary Captions
Figure 3 for Improving Audio-Text Retrieval via Hierarchical Cross-Modal Interaction and Auxiliary Captions
Figure 4 for Improving Audio-Text Retrieval via Hierarchical Cross-Modal Interaction and Auxiliary Captions
Viaarxiv icon

Multimodal Prompt Learning for Product Title Generation with Extremely Limited Labels

Jul 05, 2023
Bang Yang, Fenglin Liu, Zheng Li, Qingyu Yin, Chenyu You, Bing Yin, Yuexian Zou

Figure 1 for Multimodal Prompt Learning for Product Title Generation with Extremely Limited Labels
Figure 2 for Multimodal Prompt Learning for Product Title Generation with Extremely Limited Labels
Figure 3 for Multimodal Prompt Learning for Product Title Generation with Extremely Limited Labels
Figure 4 for Multimodal Prompt Learning for Product Title Generation with Extremely Limited Labels
Viaarxiv icon

Customizing General-Purpose Foundation Models for Medical Report Generation

Jun 09, 2023
Bang Yang, Asif Raza, Yuexian Zou, Tong Zhang

Figure 1 for Customizing General-Purpose Foundation Models for Medical Report Generation
Figure 2 for Customizing General-Purpose Foundation Models for Medical Report Generation
Figure 3 for Customizing General-Purpose Foundation Models for Medical Report Generation
Figure 4 for Customizing General-Purpose Foundation Models for Medical Report Generation
Viaarxiv icon

HiFi-Codec: Group-residual Vector quantization for High Fidelity Audio Codec

May 07, 2023
Dongchao Yang, Songxiang Liu, Rongjie Huang, Jinchuan Tian, Chao Weng, Yuexian Zou

Figure 1 for HiFi-Codec: Group-residual Vector quantization for High Fidelity Audio Codec
Figure 2 for HiFi-Codec: Group-residual Vector quantization for High Fidelity Audio Codec
Viaarxiv icon

Unify, Align and Refine: Multi-Level Semantic Alignment for Radiology Report Generation

Apr 05, 2023
Yaowei Li, Bang Yang, Xuxin Cheng, Zhihong Zhu, Hongxiang Li, Yuexian Zou

Figure 1 for Unify, Align and Refine: Multi-Level Semantic Alignment for Radiology Report Generation
Figure 2 for Unify, Align and Refine: Multi-Level Semantic Alignment for Radiology Report Generation
Figure 3 for Unify, Align and Refine: Multi-Level Semantic Alignment for Radiology Report Generation
Figure 4 for Unify, Align and Refine: Multi-Level Semantic Alignment for Radiology Report Generation
Viaarxiv icon