Picture for Chien-Sheng Wu

Chien-Sheng Wu

From Passive Metric to Active Signal: The Evolving Role of Uncertainty Quantification in Large Language Models

Add code
Jan 22, 2026
Viaarxiv icon

Agentic Confidence Calibration

Add code
Jan 22, 2026
Viaarxiv icon

Agentic Uncertainty Quantification

Add code
Jan 22, 2026
Viaarxiv icon

The Need for a Socially-Grounded Persona Framework for User Simulation

Add code
Jan 12, 2026
Viaarxiv icon

MMPersuade: A Dataset and Evaluation Framework for Multimodal Persuasion

Add code
Oct 26, 2025
Viaarxiv icon

GUI-KV: Efficient GUI Agents via KV Cache with Spatio-Temporal Awareness

Add code
Oct 01, 2025
Viaarxiv icon

CRMArena-Pro: Holistic Assessment of LLM Agents Across Diverse Business Scenarios and Interactions

Add code
May 24, 2025
Viaarxiv icon

AI-Slop to AI-Polish? Aligning Language Models through Edit-Based Writing Rewards and Test-time Computation

Add code
Apr 10, 2025
Viaarxiv icon

BingoGuard: LLM Content Moderation Tools with Risk Levels

Add code
Mar 09, 2025
Viaarxiv icon

Turning Conversations into Workflows: A Framework to Extract and Evaluate Dialog Workflows for Service AI Agents

Add code
Feb 24, 2025
Viaarxiv icon