Picture for Chien-Sheng Wu

Chien-Sheng Wu

GUI-KV: Efficient GUI Agents via KV Cache with Spatio-Temporal Awareness

Add code
Oct 01, 2025
Viaarxiv icon

CRMArena-Pro: Holistic Assessment of LLM Agents Across Diverse Business Scenarios and Interactions

Add code
May 24, 2025
Viaarxiv icon

AI-Slop to AI-Polish? Aligning Language Models through Edit-Based Writing Rewards and Test-time Computation

Add code
Apr 10, 2025
Viaarxiv icon

BingoGuard: LLM Content Moderation Tools with Risk Levels

Add code
Mar 09, 2025
Viaarxiv icon

Turning Conversations into Workflows: A Framework to Extract and Evaluate Dialog Workflows for Service AI Agents

Add code
Feb 24, 2025
Viaarxiv icon

Why Vision Language Models Struggle with Visual Arithmetic? Towards Enhanced Chart and Geometry Understanding

Add code
Feb 17, 2025
Viaarxiv icon

Proactive Conversational Agents with Inner Thoughts

Add code
Dec 31, 2024
Viaarxiv icon

SummExecEdit: A Factual Consistency Benchmark in Summarization with Executable Edits

Add code
Dec 17, 2024
Viaarxiv icon

Unanswerability Evaluation for Retreival Augmented Generation

Add code
Dec 16, 2024
Viaarxiv icon

SiReRAG: Indexing Similar and Related Information for Multihop Reasoning

Add code
Dec 09, 2024
Figure 1 for SiReRAG: Indexing Similar and Related Information for Multihop Reasoning
Figure 2 for SiReRAG: Indexing Similar and Related Information for Multihop Reasoning
Figure 3 for SiReRAG: Indexing Similar and Related Information for Multihop Reasoning
Figure 4 for SiReRAG: Indexing Similar and Related Information for Multihop Reasoning
Viaarxiv icon