Picture for Chaoran Chen

Chaoran Chen

Humans' ALMANAC: A Human Collaboration Dataset of Action-Level Mental Model Annotations for Agent Collaboration

Add code
Jun 04, 2026
Viaarxiv icon

How Coding Agents Fail Their Users: A Large-Scale Analysis of Developer-Agent Misalignment in 20,574 Real-World Sessions

Add code
May 28, 2026
Viaarxiv icon

Behavioral Canaries: Auditing Private Retrieved Context Usage in RL Fine-Tuning

Add code
Apr 24, 2026
Viaarxiv icon

Toward a Human-Centered Evaluation Framework for Trustworthy LLM-Powered GUI Agents

Add code
Apr 24, 2025
Figure 1 for Toward a Human-Centered Evaluation Framework for Trustworthy LLM-Powered GUI Agents
Figure 2 for Toward a Human-Centered Evaluation Framework for Trustworthy LLM-Powered GUI Agents
Viaarxiv icon

The Obvious Invisible Threat: LLM-Powered GUI Agents' Vulnerability to Fine-Print Injections

Add code
Apr 15, 2025
Figure 1 for The Obvious Invisible Threat: LLM-Powered GUI Agents' Vulnerability to Fine-Print Injections
Figure 2 for The Obvious Invisible Threat: LLM-Powered GUI Agents' Vulnerability to Fine-Print Injections
Figure 3 for The Obvious Invisible Threat: LLM-Powered GUI Agents' Vulnerability to Fine-Print Injections
Figure 4 for The Obvious Invisible Threat: LLM-Powered GUI Agents' Vulnerability to Fine-Print Injections
Viaarxiv icon

Towards a Design Guideline for RPA Evaluation: A Survey of Large Language Model-Based Role-Playing Agents

Add code
Feb 18, 2025
Figure 1 for Towards a Design Guideline for RPA Evaluation: A Survey of Large Language Model-Based Role-Playing Agents
Figure 2 for Towards a Design Guideline for RPA Evaluation: A Survey of Large Language Model-Based Role-Playing Agents
Figure 3 for Towards a Design Guideline for RPA Evaluation: A Survey of Large Language Model-Based Role-Playing Agents
Figure 4 for Towards a Design Guideline for RPA Evaluation: A Survey of Large Language Model-Based Role-Playing Agents
Viaarxiv icon

GenSpectrum Chat: Data Exploration in Public Health Using Large Language Models

Add code
May 23, 2023
Figure 1 for GenSpectrum Chat: Data Exploration in Public Health Using Large Language Models
Figure 2 for GenSpectrum Chat: Data Exploration in Public Health Using Large Language Models
Figure 3 for GenSpectrum Chat: Data Exploration in Public Health Using Large Language Models
Figure 4 for GenSpectrum Chat: Data Exploration in Public Health Using Large Language Models
Viaarxiv icon

Patterns for Representing Knowledge Graphs to Communicate Situational Knowledge of Service Robots

Add code
Jan 26, 2021
Figure 1 for Patterns for Representing Knowledge Graphs to Communicate Situational Knowledge of Service Robots
Figure 2 for Patterns for Representing Knowledge Graphs to Communicate Situational Knowledge of Service Robots
Figure 3 for Patterns for Representing Knowledge Graphs to Communicate Situational Knowledge of Service Robots
Figure 4 for Patterns for Representing Knowledge Graphs to Communicate Situational Knowledge of Service Robots
Viaarxiv icon