Picture for Xuange Zhang

Xuange Zhang

SurveillanceVQA-589K: A Benchmark for Comprehensive Surveillance Video-Language Understanding with Large Models

Add code
May 19, 2025
Viaarxiv icon

HiMix: Reducing Computational Complexity in Large Vision-Language Models

Add code
Jan 17, 2025
Viaarxiv icon

UCF-Crime Annotation: A Benchmark for Surveillance Video-and-Language Understanding

Add code
Sep 25, 2023
Viaarxiv icon