Picture for Yuxuan Lu

Yuxuan Lu

Humans' ALMANAC: A Human Collaboration Dataset of Action-Level Mental Model Annotations for Agent Collaboration

Add code
Jun 04, 2026
Viaarxiv icon

CollabSim: A CSCW-Grounded Methodology for Investigating Collaborative Competence of LLM Agents through Controlled Multi-Agent Experiments

Add code
Jun 04, 2026
Viaarxiv icon

Jailbreaking LLMs via Calibration

Add code
Jan 31, 2026
Viaarxiv icon

Trajectory2Task: Training Robust Tool-Calling Agents with Synthesized Yet Verifiable Data for Complex User Intents

Add code
Jan 28, 2026
Viaarxiv icon

See, Think, Act: Online Shopper Behavior Simulation with VLM Agents

Add code
Oct 22, 2025
Viaarxiv icon

DPRF: A Generalizable Dynamic Persona Refinement Framework for Optimizing Behavior Alignment Between Personalized LLM Role-Playing Agents and Humans

Add code
Oct 16, 2025
Viaarxiv icon

Customer-R1: Personalized Simulation of Human Behaviors via RL-based LLM Agent in Online Shopping

Add code
Oct 08, 2025
Figure 1 for Customer-R1: Personalized Simulation of Human Behaviors via RL-based LLM Agent in Online Shopping
Figure 2 for Customer-R1: Personalized Simulation of Human Behaviors via RL-based LLM Agent in Online Shopping
Figure 3 for Customer-R1: Personalized Simulation of Human Behaviors via RL-based LLM Agent in Online Shopping
Figure 4 for Customer-R1: Personalized Simulation of Human Behaviors via RL-based LLM Agent in Online Shopping
Viaarxiv icon

Multi-Agent-as-Judge: Aligning LLM-Agent-Based Automated Evaluation with Multi-Dimensional Human Evaluation

Add code
Jul 28, 2025
Figure 1 for Multi-Agent-as-Judge: Aligning LLM-Agent-Based Automated Evaluation with Multi-Dimensional Human Evaluation
Figure 2 for Multi-Agent-as-Judge: Aligning LLM-Agent-Based Automated Evaluation with Multi-Dimensional Human Evaluation
Figure 3 for Multi-Agent-as-Judge: Aligning LLM-Agent-Based Automated Evaluation with Multi-Dimensional Human Evaluation
Figure 4 for Multi-Agent-as-Judge: Aligning LLM-Agent-Based Automated Evaluation with Multi-Dimensional Human Evaluation
Viaarxiv icon

Shop-R1: Rewarding LLMs to Simulate Human Behavior in Online Shopping via Reinforcement Learning

Add code
Jul 23, 2025
Figure 1 for Shop-R1: Rewarding LLMs to Simulate Human Behavior in Online Shopping via Reinforcement Learning
Figure 2 for Shop-R1: Rewarding LLMs to Simulate Human Behavior in Online Shopping via Reinforcement Learning
Figure 3 for Shop-R1: Rewarding LLMs to Simulate Human Behavior in Online Shopping via Reinforcement Learning
Figure 4 for Shop-R1: Rewarding LLMs to Simulate Human Behavior in Online Shopping via Reinforcement Learning
Viaarxiv icon

Aligned Textual Scoring Rules

Add code
Jul 08, 2025
Viaarxiv icon