Picture for Caiming Xiong

Caiming Xiong

Salesforce AI Research

MCP-Universe: Benchmarking Large Language Models with Real-World Model Context Protocol Servers

Add code
Aug 20, 2025
Viaarxiv icon

CoAct-1: Computer-using Agents with Coding as Actions

Add code
Aug 05, 2025
Viaarxiv icon

UserBench: An Interactive Gym Environment for User-Centric Agents

Add code
Jul 29, 2025
Viaarxiv icon

Beyond Accuracy: Dissecting Mathematical Reasoning for LLMs Under Reinforcement Learning

Add code
Jun 05, 2025
Viaarxiv icon

MAS-ZERO: Designing Multi-Agent Systems with Zero Supervision

Add code
May 26, 2025
Viaarxiv icon

CRMArena-Pro: Holistic Assessment of LLM Agents Across Diverse Business Scenarios and Interactions

Add code
May 24, 2025
Viaarxiv icon

Meta-Design Matters: A Self-Design Multi-Agent System

Add code
May 21, 2025
Viaarxiv icon

Fractured Chain-of-Thought Reasoning

Add code
May 19, 2025
Viaarxiv icon

Scaling Computer-Use Grounding via User Interface Decomposition and Synthesis

Add code
May 19, 2025
Viaarxiv icon

J4R: Learning to Judge with Equivalent Initial State Group Relative Preference Optimization

Add code
May 19, 2025
Viaarxiv icon