Picture for Yu Qiao

Yu Qiao

ShenZhen Key Lab of Computer Vision and Pattern Recognition, SIAT-SenseTime Joint Lab, Shenzhen Institutes of Advanced Technology, Chinese Academy of Sciences, SIAT Branch, Shenzhen Institute of Artificial Intelligence and Robotics for Society

ShotBench: Expert-Level Cinematic Understanding in Vision-Language Models

Add code
Jun 26, 2025
Viaarxiv icon

Sekai: A Video Dataset towards World Exploration

Add code
Jun 18, 2025
Viaarxiv icon

VRBench: A Benchmark for Multi-Step Reasoning in Long Narrative Videos

Add code
Jun 12, 2025
Viaarxiv icon

Super Encoding Network: Recursive Association of Multi-Modal Encoders for Video Understanding

Add code
Jun 09, 2025
Viaarxiv icon

VideoChat-A1: Thinking with Long Videos by Chain-of-Shot Reasoning

Add code
Jun 06, 2025
Viaarxiv icon

ZeroGUI: Automating Online GUI Learning at Zero Human Cost

Add code
May 29, 2025
Viaarxiv icon

Point or Line? Using Line-based Representation for Panoptic Symbol Spotting in CAD Drawings

Add code
May 29, 2025
Viaarxiv icon

An Empirical Study of Federated Prompt Learning for Vision Language Model

Add code
May 29, 2025
Viaarxiv icon

O$^2$-Searcher: A Searching-based Agent Model for Open-Domain Open-Ended Question Answering

Add code
May 22, 2025
Viaarxiv icon

Towards Artificial General or Personalized Intelligence? A Survey on Foundation Models for Personalized Federated Intelligence

Add code
May 11, 2025
Viaarxiv icon