Picture for Kun Shao

Kun Shao

Uncertainty-quantified Rollout Policy Adaptation for Unlabelled Cross-domain Temporal Grounding

Add code
Aug 08, 2025
Viaarxiv icon

SpatialViz-Bench: Automatically Generated Spatial Visualization Reasoning Tasks for MLLMs

Add code
Jul 10, 2025
Viaarxiv icon

AgentSwift: Efficient LLM Agent Design via Value-guided Hierarchical Search

Add code
Jun 06, 2025
Viaarxiv icon

ViMo: A Generative Visual GUI World Model for App Agent

Add code
Apr 15, 2025
Viaarxiv icon

VideoAgent2: Enhancing the LLM-Based Agent System for Long-Form Video Understanding by Uncertainty-Aware CoT

Add code
Apr 06, 2025
Viaarxiv icon

ThinkBench: Dynamic Out-of-Distribution Evaluation for Robust LLM Reasoning

Add code
Feb 22, 2025
Viaarxiv icon

VSC-RL: Advancing Autonomous Vision-Language Agents with Variational Subgoal-Conditioned Reinforcement Learning

Add code
Feb 11, 2025
Viaarxiv icon

AppVLM: A Lightweight Vision Language Model for Online App Control

Add code
Feb 10, 2025
Figure 1 for AppVLM: A Lightweight Vision Language Model for Online App Control
Figure 2 for AppVLM: A Lightweight Vision Language Model for Online App Control
Figure 3 for AppVLM: A Lightweight Vision Language Model for Online App Control
Figure 4 for AppVLM: A Lightweight Vision Language Model for Online App Control
Viaarxiv icon

GUI Agents with Foundation Models: A Comprehensive Survey

Add code
Nov 07, 2024
Figure 1 for GUI Agents with Foundation Models: A Comprehensive Survey
Figure 2 for GUI Agents with Foundation Models: A Comprehensive Survey
Viaarxiv icon

Large Language Models Orchestrating Structured Reasoning Achieve Kaggle Grandmaster Level

Add code
Nov 05, 2024
Figure 1 for Large Language Models Orchestrating Structured Reasoning Achieve Kaggle Grandmaster Level
Figure 2 for Large Language Models Orchestrating Structured Reasoning Achieve Kaggle Grandmaster Level
Figure 3 for Large Language Models Orchestrating Structured Reasoning Achieve Kaggle Grandmaster Level
Figure 4 for Large Language Models Orchestrating Structured Reasoning Achieve Kaggle Grandmaster Level
Viaarxiv icon