Picture for Yaowei Wang

Yaowei Wang

Revisiting Uncertainty: On Evidential Learning for Partially Relevant Video Retrieval

Add code
May 07, 2026
Viaarxiv icon

CAST: Mitigating Object Hallucination in Large Vision-Language Models via Caption-Guided Visual Attention Steering

Add code
May 06, 2026
Viaarxiv icon

Efficient Adversarial Training via Criticality-Aware Fine-Tuning

Add code
Apr 14, 2026
Viaarxiv icon

Latent-Condensed Transformer for Efficient Long Context Modeling

Add code
Apr 14, 2026
Viaarxiv icon

Imagine Before Concentration: Diffusion-Guided Registers Enhance Partially Relevant Video Retrieval

Add code
Apr 04, 2026
Viaarxiv icon

Interactive Tracking: A Human-in-the-Loop Paradigm with Memory-Augmented Adaptation

Add code
Apr 02, 2026
Viaarxiv icon

A Step Toward Federated Pretraining of Multimodal Large Language Models

Add code
Mar 25, 2026
Viaarxiv icon

Cluster-Wise Spatio-Temporal Masking for Efficient Video-Language Pretraining

Add code
Mar 24, 2026
Viaarxiv icon

From Verbatim to Gist: Distilling Pyramidal Multimodal Memory via Semantic Information Bottleneck for Long-Horizon Video Agents

Add code
Mar 02, 2026
Viaarxiv icon

EPRBench: A High-Quality Benchmark Dataset for Event Stream Based Visual Place Recognition

Add code
Feb 13, 2026
Viaarxiv icon