Picture for Zhihao Wu

Zhihao Wu

Simplifying Graph Convolutional Networks with Redundancy-Free Neighbors

Add code
Apr 21, 2025
Viaarxiv icon

AppVLM: A Lightweight Vision Language Model for Online App Control

Add code
Feb 10, 2025
Figure 1 for AppVLM: A Lightweight Vision Language Model for Online App Control
Figure 2 for AppVLM: A Lightweight Vision Language Model for Online App Control
Figure 3 for AppVLM: A Lightweight Vision Language Model for Online App Control
Figure 4 for AppVLM: A Lightweight Vision Language Model for Online App Control
Viaarxiv icon

OCMDP: Observation-Constrained Markov Decision Process

Add code
Nov 12, 2024
Viaarxiv icon

SPA-Bench: A Comprehensive Benchmark for SmartPhone Agent Evaluation

Add code
Oct 19, 2024
Figure 1 for SPA-Bench: A Comprehensive Benchmark for SmartPhone Agent Evaluation
Figure 2 for SPA-Bench: A Comprehensive Benchmark for SmartPhone Agent Evaluation
Figure 3 for SPA-Bench: A Comprehensive Benchmark for SmartPhone Agent Evaluation
Figure 4 for SPA-Bench: A Comprehensive Benchmark for SmartPhone Agent Evaluation
Viaarxiv icon

DistRL: An Asynchronous Distributed Reinforcement Learning Framework for On-Device Control Agents

Add code
Oct 18, 2024
Figure 1 for DistRL: An Asynchronous Distributed Reinforcement Learning Framework for On-Device Control Agents
Figure 2 for DistRL: An Asynchronous Distributed Reinforcement Learning Framework for On-Device Control Agents
Figure 3 for DistRL: An Asynchronous Distributed Reinforcement Learning Framework for On-Device Control Agents
Figure 4 for DistRL: An Asynchronous Distributed Reinforcement Learning Framework for On-Device Control Agents
Viaarxiv icon

Masked Two-channel Decoupling Framework for Incomplete Multi-view Weak Multi-label Learning

Add code
Apr 26, 2024
Figure 1 for Masked Two-channel Decoupling Framework for Incomplete Multi-view Weak Multi-label Learning
Figure 2 for Masked Two-channel Decoupling Framework for Incomplete Multi-view Weak Multi-label Learning
Figure 3 for Masked Two-channel Decoupling Framework for Incomplete Multi-view Weak Multi-label Learning
Figure 4 for Masked Two-channel Decoupling Framework for Incomplete Multi-view Weak Multi-label Learning
Viaarxiv icon

ADEdgeDrop: Adversarial Edge Dropping for Robust Graph Neural Networks

Add code
Mar 14, 2024
Figure 1 for ADEdgeDrop: Adversarial Edge Dropping for Robust Graph Neural Networks
Figure 2 for ADEdgeDrop: Adversarial Edge Dropping for Robust Graph Neural Networks
Figure 3 for ADEdgeDrop: Adversarial Edge Dropping for Robust Graph Neural Networks
Figure 4 for ADEdgeDrop: Adversarial Edge Dropping for Robust Graph Neural Networks
Viaarxiv icon

OpticalDR: A Deep Optical Imaging Model for Privacy-Protective Depression Recognition

Add code
Feb 29, 2024
Figure 1 for OpticalDR: A Deep Optical Imaging Model for Privacy-Protective Depression Recognition
Figure 2 for OpticalDR: A Deep Optical Imaging Model for Privacy-Protective Depression Recognition
Figure 3 for OpticalDR: A Deep Optical Imaging Model for Privacy-Protective Depression Recognition
Figure 4 for OpticalDR: A Deep Optical Imaging Model for Privacy-Protective Depression Recognition
Viaarxiv icon

Pangu-Agent: A Fine-Tunable Generalist Agent with Structured Reasoning

Add code
Dec 22, 2023
Figure 1 for Pangu-Agent: A Fine-Tunable Generalist Agent with Structured Reasoning
Figure 2 for Pangu-Agent: A Fine-Tunable Generalist Agent with Structured Reasoning
Figure 3 for Pangu-Agent: A Fine-Tunable Generalist Agent with Structured Reasoning
Figure 4 for Pangu-Agent: A Fine-Tunable Generalist Agent with Structured Reasoning
Viaarxiv icon

Building Category Graphs Representation with Spatial and Temporal Attention for Visual Navigation

Add code
Dec 06, 2023
Viaarxiv icon