Picture for Yuexian Zou

Yuexian Zou

Multimodal Prompt Learning for Product Title Generation with Extremely Limited Labels

Add code
Jul 05, 2023
Figure 1 for Multimodal Prompt Learning for Product Title Generation with Extremely Limited Labels
Figure 2 for Multimodal Prompt Learning for Product Title Generation with Extremely Limited Labels
Figure 3 for Multimodal Prompt Learning for Product Title Generation with Extremely Limited Labels
Figure 4 for Multimodal Prompt Learning for Product Title Generation with Extremely Limited Labels
Viaarxiv icon

Customizing General-Purpose Foundation Models for Medical Report Generation

Add code
Jun 09, 2023
Figure 1 for Customizing General-Purpose Foundation Models for Medical Report Generation
Figure 2 for Customizing General-Purpose Foundation Models for Medical Report Generation
Figure 3 for Customizing General-Purpose Foundation Models for Medical Report Generation
Figure 4 for Customizing General-Purpose Foundation Models for Medical Report Generation
Viaarxiv icon

HiFi-Codec: Group-residual Vector quantization for High Fidelity Audio Codec

Add code
May 07, 2023
Viaarxiv icon

Unify, Align and Refine: Multi-Level Semantic Alignment for Radiology Report Generation

Add code
Apr 05, 2023
Viaarxiv icon

TLAG: An Informative Trigger and Label-Aware Knowledge Guided Model for Dialogue-based Relation Extraction

Add code
Mar 30, 2023
Viaarxiv icon

WavCaps: A ChatGPT-Assisted Weakly-Labelled Audio Captioning Dataset for Audio-Language Multimodal Research

Add code
Mar 30, 2023
Figure 1 for WavCaps: A ChatGPT-Assisted Weakly-Labelled Audio Captioning Dataset for Audio-Language Multimodal Research
Figure 2 for WavCaps: A ChatGPT-Assisted Weakly-Labelled Audio Captioning Dataset for Audio-Language Multimodal Research
Figure 3 for WavCaps: A ChatGPT-Assisted Weakly-Labelled Audio Captioning Dataset for Audio-Language Multimodal Research
Figure 4 for WavCaps: A ChatGPT-Assisted Weakly-Labelled Audio Captioning Dataset for Audio-Language Multimodal Research
Viaarxiv icon

Improving Text-Audio Retrieval by Text-aware Attention Pooling and Prior Matrix Revised Loss

Add code
Mar 19, 2023
Viaarxiv icon

PoseRAC: Pose Saliency Transformer for Repetitive Action Counting

Add code
Mar 16, 2023
Figure 1 for PoseRAC: Pose Saliency Transformer for Repetitive Action Counting
Figure 2 for PoseRAC: Pose Saliency Transformer for Repetitive Action Counting
Figure 3 for PoseRAC: Pose Saliency Transformer for Repetitive Action Counting
Figure 4 for PoseRAC: Pose Saliency Transformer for Repetitive Action Counting
Viaarxiv icon

FiTs: Fine-grained Two-stage Training for Knowledge-aware Question Answering

Add code
Mar 15, 2023
Figure 1 for FiTs: Fine-grained Two-stage Training for Knowledge-aware Question Answering
Figure 2 for FiTs: Fine-grained Two-stage Training for Knowledge-aware Question Answering
Figure 3 for FiTs: Fine-grained Two-stage Training for Knowledge-aware Question Answering
Figure 4 for FiTs: Fine-grained Two-stage Training for Knowledge-aware Question Answering
Viaarxiv icon

FTM: A Frame-level Timeline Modeling Method for Temporal Graph Representation Learning

Add code
Mar 15, 2023
Figure 1 for FTM: A Frame-level Timeline Modeling Method for Temporal Graph Representation Learning
Figure 2 for FTM: A Frame-level Timeline Modeling Method for Temporal Graph Representation Learning
Figure 3 for FTM: A Frame-level Timeline Modeling Method for Temporal Graph Representation Learning
Figure 4 for FTM: A Frame-level Timeline Modeling Method for Temporal Graph Representation Learning
Viaarxiv icon