Picture for Xingshan Zeng

Xingshan Zeng

Pitfalls of Rule- and Model-based Verifiers -- A Case Study on Mathematical Reasoning

Add code
May 28, 2025
Viaarxiv icon

Stepwise Reasoning Checkpoint Analysis: A Test Time Scaling Method to Enhance LLMs' Reasoning

Add code
May 23, 2025
Viaarxiv icon

The Real Barrier to LLM Agent Usability is Agentic ROI

Add code
May 23, 2025
Viaarxiv icon

ToolACE-DEV: Self-Improving Tool Learning via Decomposition and EVolution

Add code
May 12, 2025
Viaarxiv icon

Advancing and Benchmarking Personalized Tool Invocation for LLMs

Add code
May 07, 2025
Viaarxiv icon

ToolACE-R: Tool Learning with Adaptive Self-Refinement

Add code
Apr 02, 2025
Viaarxiv icon

GUI Agents with Foundation Models: A Comprehensive Survey

Add code
Nov 07, 2024
Figure 1 for GUI Agents with Foundation Models: A Comprehensive Survey
Figure 2 for GUI Agents with Foundation Models: A Comprehensive Survey
Viaarxiv icon

ToolFlow: Boosting LLM Tool-Calling Through Natural and Coherent Dialogue Synthesis

Add code
Oct 24, 2024
Viaarxiv icon

ToolACE: Winning the Points of LLM Function Calling

Add code
Sep 02, 2024
Figure 1 for ToolACE: Winning the Points of LLM Function Calling
Figure 2 for ToolACE: Winning the Points of LLM Function Calling
Figure 3 for ToolACE: Winning the Points of LLM Function Calling
Figure 4 for ToolACE: Winning the Points of LLM Function Calling
Viaarxiv icon

Bridging and Modeling Correlations in Pairwise Data for Direct Preference Optimization

Add code
Aug 14, 2024
Viaarxiv icon