Picture for Yuexian Zou

Yuexian Zou

CLIP Meets Video Captioners: Attribute-Aware Representation Learning Promotes Accurate Captioning

Add code
Nov 30, 2021
Figure 1 for CLIP Meets Video Captioners: Attribute-Aware Representation Learning Promotes Accurate Captioning
Figure 2 for CLIP Meets Video Captioners: Attribute-Aware Representation Learning Promotes Accurate Captioning
Figure 3 for CLIP Meets Video Captioners: Attribute-Aware Representation Learning Promotes Accurate Captioning
Figure 4 for CLIP Meets Video Captioners: Attribute-Aware Representation Learning Promotes Accurate Captioning
Viaarxiv icon

Improving the Performance of Automated Audio Captioning via Integrating the Acoustic and Semantic Information

Add code
Oct 12, 2021
Figure 1 for Improving the Performance of Automated Audio Captioning via Integrating the Acoustic and Semantic Information
Figure 2 for Improving the Performance of Automated Audio Captioning via Integrating the Acoustic and Semantic Information
Figure 3 for Improving the Performance of Automated Audio Captioning via Integrating the Acoustic and Semantic Information
Figure 4 for Improving the Performance of Automated Audio Captioning via Integrating the Acoustic and Semantic Information
Viaarxiv icon

A Mutual learning framework for Few-shot Sound Event Detection

Add code
Oct 09, 2021
Figure 1 for A Mutual learning framework for Few-shot Sound Event Detection
Figure 2 for A Mutual learning framework for Few-shot Sound Event Detection
Figure 3 for A Mutual learning framework for Few-shot Sound Event Detection
Figure 4 for A Mutual learning framework for Few-shot Sound Event Detection
Viaarxiv icon

Towards Joint Intent Detection and Slot Filling via Higher-order Attention

Add code
Sep 22, 2021
Figure 1 for Towards Joint Intent Detection and Slot Filling via Higher-order Attention
Figure 2 for Towards Joint Intent Detection and Slot Filling via Higher-order Attention
Figure 3 for Towards Joint Intent Detection and Slot Filling via Higher-order Attention
Figure 4 for Towards Joint Intent Detection and Slot Filling via Higher-order Attention
Viaarxiv icon

On Pursuit of Designing Multi-modal Transformer for Video Grounding

Add code
Sep 13, 2021
Figure 1 for On Pursuit of Designing Multi-modal Transformer for Video Grounding
Figure 2 for On Pursuit of Designing Multi-modal Transformer for Video Grounding
Figure 3 for On Pursuit of Designing Multi-modal Transformer for Video Grounding
Figure 4 for On Pursuit of Designing Multi-modal Transformer for Video Grounding
Viaarxiv icon

Self-supervised Contrastive Cross-Modality Representation Learning for Spoken Question Answering

Add code
Sep 08, 2021
Figure 1 for Self-supervised Contrastive Cross-Modality Representation Learning for Spoken Question Answering
Figure 2 for Self-supervised Contrastive Cross-Modality Representation Learning for Spoken Question Answering
Figure 3 for Self-supervised Contrastive Cross-Modality Representation Learning for Spoken Question Answering
Figure 4 for Self-supervised Contrastive Cross-Modality Representation Learning for Spoken Question Answering
Viaarxiv icon

HAN: Higher-order Attention Network for Spoken Language Understanding

Add code
Aug 26, 2021
Figure 1 for HAN: Higher-order Attention Network for Spoken Language Understanding
Figure 2 for HAN: Higher-order Attention Network for Spoken Language Understanding
Figure 3 for HAN: Higher-order Attention Network for Spoken Language Understanding
Figure 4 for HAN: Higher-order Attention Network for Spoken Language Understanding
Viaarxiv icon

Fully Non-Homogeneous Atmospheric Scattering Modeling with Convolutional Neural Networks for Single Image Dehazing

Add code
Aug 25, 2021
Figure 1 for Fully Non-Homogeneous Atmospheric Scattering Modeling with Convolutional Neural Networks for Single Image Dehazing
Figure 2 for Fully Non-Homogeneous Atmospheric Scattering Modeling with Convolutional Neural Networks for Single Image Dehazing
Figure 3 for Fully Non-Homogeneous Atmospheric Scattering Modeling with Convolutional Neural Networks for Single Image Dehazing
Figure 4 for Fully Non-Homogeneous Atmospheric Scattering Modeling with Convolutional Neural Networks for Single Image Dehazing
Viaarxiv icon

Joint Multiple Intent Detection and Slot Filling via Self-distillation

Add code
Aug 18, 2021
Figure 1 for Joint Multiple Intent Detection and Slot Filling via Self-distillation
Figure 2 for Joint Multiple Intent Detection and Slot Filling via Self-distillation
Figure 3 for Joint Multiple Intent Detection and Slot Filling via Self-distillation
Figure 4 for Joint Multiple Intent Detection and Slot Filling via Self-distillation
Viaarxiv icon

Deep Motion Prior for Weakly-Supervised Temporal Action Localization

Add code
Aug 12, 2021
Figure 1 for Deep Motion Prior for Weakly-Supervised Temporal Action Localization
Figure 2 for Deep Motion Prior for Weakly-Supervised Temporal Action Localization
Figure 3 for Deep Motion Prior for Weakly-Supervised Temporal Action Localization
Figure 4 for Deep Motion Prior for Weakly-Supervised Temporal Action Localization
Viaarxiv icon