Picture for Kai Li

Kai Li

Department of Computer Science and Technology, Tsinghua University, Beijing, China

When AVSR Meets Video Conferencing: Dataset, Degradation, and the Hidden Mechanism Behind Performance Collapse

Add code
Mar 24, 2026
Viaarxiv icon

Learning Can Converge Stably to the Wrong Belief under Latent Reliability

Add code
Mar 23, 2026
Viaarxiv icon

Enhancing Vision-Based Policies with Omni-View and Cross-Modality Knowledge Distillation for Mobile Robots

Add code
Mar 21, 2026
Viaarxiv icon

BEAVER: A Training-Free Hierarchical Prompt Compression Method via Structure-Aware Page Selection

Add code
Mar 20, 2026
Viaarxiv icon

PJB: A Reasoning-Aware Benchmark for Person-Job Retrieval

Add code
Mar 18, 2026
Viaarxiv icon

ProGVC: Progressive-based Generative Video Compression via Auto-Regressive Context Modeling

Add code
Mar 18, 2026
Viaarxiv icon

Learning Visuomotor Policy for Multi-Robot Laser Tag Game

Add code
Mar 12, 2026
Viaarxiv icon

K^2-Agent: Co-Evolving Know-What and Know-How for Hierarchical Mobile Device Control

Add code
Feb 28, 2026
Viaarxiv icon

StemVLA:An Open-Source Vision-Language-Action Model with Future 3D Spatial Geometry Knowledge and 4D Historical Representation

Add code
Feb 27, 2026
Viaarxiv icon

SCOPE: Skeleton Graph-Based Computation-Efficient Framework for Autonomous UAV Exploration

Add code
Feb 26, 2026
Viaarxiv icon