Picture for Yuexian Zou

Yuexian Zou

Learn Suspected Anomalies from Event Prompts for Video Anomaly Detection

Add code
Mar 02, 2024
Viaarxiv icon

Retrieval is Accurate Generation

Add code
Feb 29, 2024
Figure 1 for Retrieval is Accurate Generation
Figure 2 for Retrieval is Accurate Generation
Figure 3 for Retrieval is Accurate Generation
Figure 4 for Retrieval is Accurate Generation
Viaarxiv icon

Embracing Language Inclusivity and Diversity in CLIP through Continual Language Learning

Add code
Jan 30, 2024
Figure 1 for Embracing Language Inclusivity and Diversity in CLIP through Continual Language Learning
Figure 2 for Embracing Language Inclusivity and Diversity in CLIP through Continual Language Learning
Figure 3 for Embracing Language Inclusivity and Diversity in CLIP through Continual Language Learning
Figure 4 for Embracing Language Inclusivity and Diversity in CLIP through Continual Language Learning
Viaarxiv icon

ML-LMCL: Mutual Learning and Large-Margin Contrastive Learning for Improving ASR Robustness in Spoken Language Understanding

Add code
Nov 19, 2023
Viaarxiv icon

UnifiedVisionGPT: Streamlining Vision-Oriented AI through Generalized Multimodal Framework

Add code
Nov 16, 2023
Figure 1 for UnifiedVisionGPT: Streamlining Vision-Oriented AI through Generalized Multimodal Framework
Figure 2 for UnifiedVisionGPT: Streamlining Vision-Oriented AI through Generalized Multimodal Framework
Figure 3 for UnifiedVisionGPT: Streamlining Vision-Oriented AI through Generalized Multimodal Framework
Figure 4 for UnifiedVisionGPT: Streamlining Vision-Oriented AI through Generalized Multimodal Framework
Viaarxiv icon

Video Referring Expression Comprehension via Transformer with Content-conditioned Query

Add code
Oct 25, 2023
Figure 1 for Video Referring Expression Comprehension via Transformer with Content-conditioned Query
Figure 2 for Video Referring Expression Comprehension via Transformer with Content-conditioned Query
Figure 3 for Video Referring Expression Comprehension via Transformer with Content-conditioned Query
Figure 4 for Video Referring Expression Comprehension via Transformer with Content-conditioned Query
Viaarxiv icon

NADiffuSE: Noise-aware Diffusion-based Model for Speech Enhancement

Add code
Sep 03, 2023
Viaarxiv icon

MultiCapCLIP: Auto-Encoding Prompts for Zero-Shot Multilingual Visual Captioning

Add code
Aug 25, 2023
Figure 1 for MultiCapCLIP: Auto-Encoding Prompts for Zero-Shot Multilingual Visual Captioning
Figure 2 for MultiCapCLIP: Auto-Encoding Prompts for Zero-Shot Multilingual Visual Captioning
Figure 3 for MultiCapCLIP: Auto-Encoding Prompts for Zero-Shot Multilingual Visual Captioning
Figure 4 for MultiCapCLIP: Auto-Encoding Prompts for Zero-Shot Multilingual Visual Captioning
Viaarxiv icon

G2L: Semantically Aligned and Uniform Video Grounding via Geodesic and Game Theory

Add code
Aug 18, 2023
Viaarxiv icon

Improving Audio-Text Retrieval via Hierarchical Cross-Modal Interaction and Auxiliary Captions

Add code
Jul 28, 2023
Viaarxiv icon