Picture for Yu Qiao

Yu Qiao

ShenZhen Key Lab of Computer Vision and Pattern Recognition, SIAT-SenseTime Joint Lab, Shenzhen Institutes of Advanced Technology, Chinese Academy of Sciences, SIAT Branch, Shenzhen Institute of Artificial Intelligence and Robotics for Society

MM-Eureka: Exploring Visual Aha Moment with Rule-based Large-scale Reinforcement Learning

Add code
Mar 10, 2025
Viaarxiv icon

AgiBot World Colosseo: A Large-scale Manipulation Platform for Scalable and Intelligent Embodied Systems

Add code
Mar 09, 2025
Viaarxiv icon

An Egocentric Vision-Language Model based Portable Real-time Smart Assistant

Add code
Mar 06, 2025
Viaarxiv icon

Modeling Fine-Grained Hand-Object Dynamics for Egocentric Video Representation Learning

Add code
Mar 02, 2025
Figure 1 for Modeling Fine-Grained Hand-Object Dynamics for Egocentric Video Representation Learning
Figure 2 for Modeling Fine-Grained Hand-Object Dynamics for Egocentric Video Representation Learning
Figure 3 for Modeling Fine-Grained Hand-Object Dynamics for Egocentric Video Representation Learning
Figure 4 for Modeling Fine-Grained Hand-Object Dynamics for Egocentric Video Representation Learning
Viaarxiv icon

MMRC: A Large-Scale Benchmark for Understanding Multimodal Large Language Model in Real-World Conversation

Add code
Feb 17, 2025
Viaarxiv icon

LimSim Series: An Autonomous Driving Simulation Platform for Validation and Enhancement

Add code
Feb 13, 2025
Figure 1 for LimSim Series: An Autonomous Driving Simulation Platform for Validation and Enhancement
Figure 2 for LimSim Series: An Autonomous Driving Simulation Platform for Validation and Enhancement
Figure 3 for LimSim Series: An Autonomous Driving Simulation Platform for Validation and Enhancement
Figure 4 for LimSim Series: An Autonomous Driving Simulation Platform for Validation and Enhancement
Viaarxiv icon

Lumina-Video: Efficient and Flexible Video Generation with Multi-scale Next-DiT

Add code
Feb 10, 2025
Viaarxiv icon

Pre-trained Model Guided Mixture Knowledge Distillation for Adversarial Federated Learning

Add code
Jan 25, 2025
Figure 1 for Pre-trained Model Guided Mixture Knowledge Distillation for Adversarial Federated Learning
Figure 2 for Pre-trained Model Guided Mixture Knowledge Distillation for Adversarial Federated Learning
Figure 3 for Pre-trained Model Guided Mixture Knowledge Distillation for Adversarial Federated Learning
Figure 4 for Pre-trained Model Guided Mixture Knowledge Distillation for Adversarial Federated Learning
Viaarxiv icon

WanJuanSiLu: A High-Quality Open-Source Webtext Dataset for Low-Resource Languages

Add code
Jan 24, 2025
Figure 1 for WanJuanSiLu: A High-Quality Open-Source Webtext Dataset for Low-Resource Languages
Figure 2 for WanJuanSiLu: A High-Quality Open-Source Webtext Dataset for Low-Resource Languages
Figure 3 for WanJuanSiLu: A High-Quality Open-Source Webtext Dataset for Low-Resource Languages
Figure 4 for WanJuanSiLu: A High-Quality Open-Source Webtext Dataset for Low-Resource Languages
Viaarxiv icon

InternVideo2.5: Empowering Video MLLMs with Long and Rich Context Modeling

Add code
Jan 21, 2025
Figure 1 for InternVideo2.5: Empowering Video MLLMs with Long and Rich Context Modeling
Figure 2 for InternVideo2.5: Empowering Video MLLMs with Long and Rich Context Modeling
Figure 3 for InternVideo2.5: Empowering Video MLLMs with Long and Rich Context Modeling
Figure 4 for InternVideo2.5: Empowering Video MLLMs with Long and Rich Context Modeling
Viaarxiv icon