Picture for Zeyi Liao

Zeyi Liao

Autonomous Continual Learning of Computer-Use Agents for Environment Adaptation

Add code
Feb 10, 2026
Viaarxiv icon

SafePred: A Predictive Guardrail for Computer-Using Agents via World Models

Add code
Feb 02, 2026
Viaarxiv icon

Graph2Eval: Automatic Multimodal Task Generation for Agents via Knowledge Graphs

Add code
Oct 01, 2025
Figure 1 for Graph2Eval: Automatic Multimodal Task Generation for Agents via Knowledge Graphs
Figure 2 for Graph2Eval: Automatic Multimodal Task Generation for Agents via Knowledge Graphs
Figure 3 for Graph2Eval: Automatic Multimodal Task Generation for Agents via Knowledge Graphs
Figure 4 for Graph2Eval: Automatic Multimodal Task Generation for Agents via Knowledge Graphs
Viaarxiv icon

Mind2Web 2: Evaluating Agentic Search with Agent-as-a-Judge

Add code
Jun 26, 2025
Figure 1 for Mind2Web 2: Evaluating Agentic Search with Agent-as-a-Judge
Figure 2 for Mind2Web 2: Evaluating Agentic Search with Agent-as-a-Judge
Figure 3 for Mind2Web 2: Evaluating Agentic Search with Agent-as-a-Judge
Figure 4 for Mind2Web 2: Evaluating Agentic Search with Agent-as-a-Judge
Viaarxiv icon

RedTeamCUA: Realistic Adversarial Testing of Computer-Use Agents in Hybrid Web-OS Environments

Add code
May 28, 2025
Viaarxiv icon

AmpleGCG-Plus: A Strong Generative Model of Adversarial Suffixes to Jailbreak LLMs with Higher Success Rates in Fewer Attempts

Add code
Oct 29, 2024
Viaarxiv icon

AdvWeb: Controllable Black-box Attacks on VLM-powered Web Agents

Add code
Oct 22, 2024
Figure 1 for AdvWeb: Controllable Black-box Attacks on VLM-powered Web Agents
Figure 2 for AdvWeb: Controllable Black-box Attacks on VLM-powered Web Agents
Figure 3 for AdvWeb: Controllable Black-box Attacks on VLM-powered Web Agents
Figure 4 for AdvWeb: Controllable Black-box Attacks on VLM-powered Web Agents
Viaarxiv icon

ScienceAgentBench: Toward Rigorous Assessment of Language Agents for Data-Driven Scientific Discovery

Add code
Oct 07, 2024
Figure 1 for ScienceAgentBench: Toward Rigorous Assessment of Language Agents for Data-Driven Scientific Discovery
Figure 2 for ScienceAgentBench: Toward Rigorous Assessment of Language Agents for Data-Driven Scientific Discovery
Figure 3 for ScienceAgentBench: Toward Rigorous Assessment of Language Agents for Data-Driven Scientific Discovery
Figure 4 for ScienceAgentBench: Toward Rigorous Assessment of Language Agents for Data-Driven Scientific Discovery
Viaarxiv icon

EIA: Environmental Injection Attack on Generalist Web Agents for Privacy Leakage

Add code
Sep 17, 2024
Figure 1 for EIA: Environmental Injection Attack on Generalist Web Agents for Privacy Leakage
Figure 2 for EIA: Environmental Injection Attack on Generalist Web Agents for Privacy Leakage
Figure 3 for EIA: Environmental Injection Attack on Generalist Web Agents for Privacy Leakage
Figure 4 for EIA: Environmental Injection Attack on Generalist Web Agents for Privacy Leakage
Viaarxiv icon

Joint Demonstration and Preference Learning Improves Policy Alignment with Human Feedback

Add code
Jun 11, 2024
Viaarxiv icon