Picture for Xiaoyu Wu

Xiaoyu Wu

MLVTG: Mamba-Based Feature Alignment and LLM-Driven Purification for Multi-Modal Video Temporal Grounding

Add code
Jun 10, 2025
Viaarxiv icon

Enhancing Video Memorability Prediction with Text-Motion Cross-modal Contrastive Loss and Its Application in Video Summarization

Add code
Jun 10, 2025
Viaarxiv icon

TableEval: A Real-World Benchmark for Complex, Multilingual, and Multi-Structured Table Question Answering

Add code
Jun 04, 2025
Viaarxiv icon

Breaking the Gold Standard: Extracting Forgotten Data under Exact Unlearning in Large Language Models

Add code
May 30, 2025
Viaarxiv icon

Rethinking Metrics and Benchmarks of Video Anomaly Detection

Add code
May 25, 2025
Viaarxiv icon

Language-guided Open-world Video Anomaly Detection

Add code
Mar 17, 2025
Viaarxiv icon

Winning the MIDST Challenge: New Membership Inference Attacks on Diffusion Models for Tabular Data Synthesis

Add code
Mar 15, 2025
Viaarxiv icon

PlanGen: Towards Unified Layout Planning and Image Generation in Auto-Regressive Vision Language Models

Add code
Mar 13, 2025
Viaarxiv icon

NAMI: Efficient Image Generation via Progressive Rectified Flow Transformers

Add code
Mar 12, 2025
Viaarxiv icon

PLPP: Prompt Learning with Perplexity Is Self-Distillation for Vision-Language Models

Add code
Dec 18, 2024
Viaarxiv icon