Picture for Cheng Qian

Cheng Qian

May

Atomic Reasoning for Scientific Table Claim Verification

Add code
Jun 08, 2025
Viaarxiv icon

DecisionFlow: Advancing Large Language Model as Principled Decision Maker

Add code
May 27, 2025
Viaarxiv icon

ModelingAgent: Bridging LLMs and Mathematical Modeling for Real-World Challenges

Add code
May 21, 2025
Viaarxiv icon

RM-R1: Reward Modeling as Reasoning

Add code
May 05, 2025
Viaarxiv icon

OTC: Optimal Tool Calls via Reinforcement Learning

Add code
Apr 21, 2025
Viaarxiv icon

ToolRL: Reward is All Tool Learning Needs

Add code
Apr 16, 2025
Viaarxiv icon

Alice: Proactive Learning with Teacher's Demonstrations for Weak-to-Strong Generalization

Add code
Apr 09, 2025
Viaarxiv icon

AIR: A Systematic Analysis of Annotations, Instructions, and Response Pairs in Preference Dataset

Add code
Apr 04, 2025
Viaarxiv icon

MultiAgentBench: Evaluating the Collaboration and Competition of LLM agents

Add code
Mar 03, 2025
Viaarxiv icon

The Law of Knowledge Overshadowing: Towards Understanding, Predicting, and Preventing LLM Hallucination

Add code
Feb 22, 2025
Figure 1 for The Law of Knowledge Overshadowing: Towards Understanding, Predicting, and Preventing LLM Hallucination
Figure 2 for The Law of Knowledge Overshadowing: Towards Understanding, Predicting, and Preventing LLM Hallucination
Figure 3 for The Law of Knowledge Overshadowing: Towards Understanding, Predicting, and Preventing LLM Hallucination
Figure 4 for The Law of Knowledge Overshadowing: Towards Understanding, Predicting, and Preventing LLM Hallucination
Viaarxiv icon