Picture for Hanyang Chen

Hanyang Chen

GUI-Libra: Training Native GUI Agents to Reason and Act with Action-aware Supervision and Partially Verifiable RL

Add code
Feb 25, 2026
Viaarxiv icon

Visual Backdoor Attacks on MLLM Embodied Decision Making via Contrastive Trigger Learning

Add code
Oct 31, 2025
Viaarxiv icon

ERA: Transforming VLMs into Embodied Agents via Embodied Prior Learning and Online Reinforcement Learning

Add code
Oct 14, 2025
Viaarxiv icon

EmbodiedBench: Comprehensive Benchmarking Multi-modal Large Language Models for Vision-Driven Embodied Agents

Add code
Feb 13, 2025
Figure 1 for EmbodiedBench: Comprehensive Benchmarking Multi-modal Large Language Models for Vision-Driven Embodied Agents
Figure 2 for EmbodiedBench: Comprehensive Benchmarking Multi-modal Large Language Models for Vision-Driven Embodied Agents
Figure 3 for EmbodiedBench: Comprehensive Benchmarking Multi-modal Large Language Models for Vision-Driven Embodied Agents
Figure 4 for EmbodiedBench: Comprehensive Benchmarking Multi-modal Large Language Models for Vision-Driven Embodied Agents
Viaarxiv icon

DiffLight: A Partial Rewards Conditioned Diffusion Model for Traffic Signal Control with Missing Data

Add code
Oct 31, 2024
Figure 1 for DiffLight: A Partial Rewards Conditioned Diffusion Model for Traffic Signal Control with Missing Data
Figure 2 for DiffLight: A Partial Rewards Conditioned Diffusion Model for Traffic Signal Control with Missing Data
Figure 3 for DiffLight: A Partial Rewards Conditioned Diffusion Model for Traffic Signal Control with Missing Data
Figure 4 for DiffLight: A Partial Rewards Conditioned Diffusion Model for Traffic Signal Control with Missing Data
Viaarxiv icon