Picture for Yuhao Yang

Yuhao Yang

6DAttack: Backdoor Attacks in the 6DoF Pose Estimation

Add code
Dec 22, 2025
Figure 1 for 6DAttack: Backdoor Attacks in the 6DoF Pose Estimation
Figure 2 for 6DAttack: Backdoor Attacks in the 6DoF Pose Estimation
Figure 3 for 6DAttack: Backdoor Attacks in the 6DoF Pose Estimation
Figure 4 for 6DAttack: Backdoor Attacks in the 6DoF Pose Estimation
Viaarxiv icon

See-Control: A Multimodal Agent Framework for Smartphone Interaction with a Robotic Arm

Add code
Dec 09, 2025
Viaarxiv icon

Hi-Agent: Hierarchical Vision-Language Agents for Mobile Device Control

Add code
Oct 16, 2025
Viaarxiv icon

Ferret-UI Lite: Lessons from Building Small On-Device GUI Agents

Add code
Sep 30, 2025
Figure 1 for Ferret-UI Lite: Lessons from Building Small On-Device GUI Agents
Figure 2 for Ferret-UI Lite: Lessons from Building Small On-Device GUI Agents
Figure 3 for Ferret-UI Lite: Lessons from Building Small On-Device GUI Agents
Figure 4 for Ferret-UI Lite: Lessons from Building Small On-Device GUI Agents
Viaarxiv icon

Sparse Meets Dense: Unified Generative Recommendations with Cascaded Sparse-Dense Representations

Add code
Mar 04, 2025
Figure 1 for Sparse Meets Dense: Unified Generative Recommendations with Cascaded Sparse-Dense Representations
Figure 2 for Sparse Meets Dense: Unified Generative Recommendations with Cascaded Sparse-Dense Representations
Figure 3 for Sparse Meets Dense: Unified Generative Recommendations with Cascaded Sparse-Dense Representations
Figure 4 for Sparse Meets Dense: Unified Generative Recommendations with Cascaded Sparse-Dense Representations
Viaarxiv icon

RecLM: Recommendation Instruction Tuning

Add code
Dec 26, 2024
Figure 1 for RecLM: Recommendation Instruction Tuning
Figure 2 for RecLM: Recommendation Instruction Tuning
Figure 3 for RecLM: Recommendation Instruction Tuning
Figure 4 for RecLM: Recommendation Instruction Tuning
Viaarxiv icon

GraphAgent: Agentic Graph Language Assistant

Add code
Dec 22, 2024
Viaarxiv icon

Aria-UI: Visual Grounding for GUI Instructions

Add code
Dec 20, 2024
Figure 1 for Aria-UI: Visual Grounding for GUI Instructions
Figure 2 for Aria-UI: Visual Grounding for GUI Instructions
Figure 3 for Aria-UI: Visual Grounding for GUI Instructions
Figure 4 for Aria-UI: Visual Grounding for GUI Instructions
Viaarxiv icon

SPA-Bench: A Comprehensive Benchmark for SmartPhone Agent Evaluation

Add code
Oct 19, 2024
Figure 1 for SPA-Bench: A Comprehensive Benchmark for SmartPhone Agent Evaluation
Figure 2 for SPA-Bench: A Comprehensive Benchmark for SmartPhone Agent Evaluation
Figure 3 for SPA-Bench: A Comprehensive Benchmark for SmartPhone Agent Evaluation
Figure 4 for SPA-Bench: A Comprehensive Benchmark for SmartPhone Agent Evaluation
Viaarxiv icon

Learning Geospatial Region Embedding with Heterogeneous Graph

Add code
May 23, 2024
Viaarxiv icon