Picture for Dakuo Wang

Dakuo Wang

Northeastern University, USA

SurgWound-Bench: A Benchmark for Surgical Wound Diagnosis

Add code
Aug 21, 2025
Viaarxiv icon

Multi-Agent-as-Judge: Aligning LLM-Agent-Based Automated Evaluation with Multi-Dimensional Human Evaluation

Add code
Jul 28, 2025
Viaarxiv icon

Shop-R1: Rewarding LLMs to Simulate Human Behavior in Online Shopping via Reinforcement Learning

Add code
Jul 23, 2025
Viaarxiv icon

OPeRA: A Dataset of Observation, Persona, Rationale, and Action for Evaluating LLMs on Human Online Shopping Behavior Simulation

Add code
Jun 05, 2025
Viaarxiv icon

AgentA/B: Automated and Scalable Web A/BTesting with Interactive LLM Agents

Add code
Apr 13, 2025
Viaarxiv icon

UXAgent: A System for Simulating Usability Testing of Web Design with LLM Agents

Add code
Apr 13, 2025
Viaarxiv icon

Beyond Believability: Accurate Human Behavior Simulation with Fine-Tuned LLMs

Add code
Mar 27, 2025
Viaarxiv icon

Towards a Design Guideline for RPA Evaluation: A Survey of Large Language Model-Based Role-Playing Agents

Add code
Feb 18, 2025
Figure 1 for Towards a Design Guideline for RPA Evaluation: A Survey of Large Language Model-Based Role-Playing Agents
Figure 2 for Towards a Design Guideline for RPA Evaluation: A Survey of Large Language Model-Based Role-Playing Agents
Figure 3 for Towards a Design Guideline for RPA Evaluation: A Survey of Large Language Model-Based Role-Playing Agents
Figure 4 for Towards a Design Guideline for RPA Evaluation: A Survey of Large Language Model-Based Role-Playing Agents
Viaarxiv icon

UXAgent: An LLM Agent-Based Usability Testing Framework for Web Design

Add code
Feb 18, 2025
Figure 1 for UXAgent: An LLM Agent-Based Usability Testing Framework for Web Design
Figure 2 for UXAgent: An LLM Agent-Based Usability Testing Framework for Web Design
Figure 3 for UXAgent: An LLM Agent-Based Usability Testing Framework for Web Design
Figure 4 for UXAgent: An LLM Agent-Based Usability Testing Framework for Web Design
Viaarxiv icon

Toward Metaphor-Fluid Conversation Design for Voice User Interfaces

Add code
Feb 17, 2025
Viaarxiv icon