Picture for Xianfeng Tang

Xianfeng Tang

MedVLSynther: Synthesizing High-Quality Visual Question Answering from Medical Documents with Generator-Verifier LMMs

Add code
Oct 29, 2025
Viaarxiv icon

TRAJECT-Bench:A Trajectory-Aware Benchmark for Evaluating Agentic Tool Use

Add code
Oct 06, 2025
Figure 1 for TRAJECT-Bench:A Trajectory-Aware Benchmark for Evaluating Agentic Tool Use
Figure 2 for TRAJECT-Bench:A Trajectory-Aware Benchmark for Evaluating Agentic Tool Use
Figure 3 for TRAJECT-Bench:A Trajectory-Aware Benchmark for Evaluating Agentic Tool Use
Figure 4 for TRAJECT-Bench:A Trajectory-Aware Benchmark for Evaluating Agentic Tool Use
Viaarxiv icon

Can Large Language Models Adequately Perform Symbolic Reasoning Over Time Series?

Add code
Aug 05, 2025
Viaarxiv icon

Bradley-Terry and Multi-Objective Reward Modeling Are Complementary

Add code
Jul 10, 2025
Viaarxiv icon

RRO: LLM Agent Optimization Through Rising Reward Trajectories

Add code
May 27, 2025
Figure 1 for RRO: LLM Agent Optimization Through Rising Reward Trajectories
Figure 2 for RRO: LLM Agent Optimization Through Rising Reward Trajectories
Figure 3 for RRO: LLM Agent Optimization Through Rising Reward Trajectories
Figure 4 for RRO: LLM Agent Optimization Through Rising Reward Trajectories
Viaarxiv icon

Efficient Long CoT Reasoning in Small Language Models

Add code
May 24, 2025
Viaarxiv icon

Beyond Text: Unveiling Privacy Vulnerabilities in Multi-modal Retrieval-Augmented Generation

Add code
May 20, 2025
Viaarxiv icon

Harnessing the Unseen: The Hidden Influence of Intrinsic Knowledge in Long-Context Language Models

Add code
Apr 11, 2025
Figure 1 for Harnessing the Unseen: The Hidden Influence of Intrinsic Knowledge in Long-Context Language Models
Figure 2 for Harnessing the Unseen: The Hidden Influence of Intrinsic Knowledge in Long-Context Language Models
Figure 3 for Harnessing the Unseen: The Hidden Influence of Intrinsic Knowledge in Long-Context Language Models
Figure 4 for Harnessing the Unseen: The Hidden Influence of Intrinsic Knowledge in Long-Context Language Models
Viaarxiv icon

SFT or RL? An Early Investigation into Training R1-Like Reasoning Large Vision-Language Models

Add code
Apr 10, 2025
Figure 1 for SFT or RL? An Early Investigation into Training R1-Like Reasoning Large Vision-Language Models
Figure 2 for SFT or RL? An Early Investigation into Training R1-Like Reasoning Large Vision-Language Models
Figure 3 for SFT or RL? An Early Investigation into Training R1-Like Reasoning Large Vision-Language Models
Figure 4 for SFT or RL? An Early Investigation into Training R1-Like Reasoning Large Vision-Language Models
Viaarxiv icon

m1: Unleash the Potential of Test-Time Scaling for Medical Reasoning with Large Language Models

Add code
Apr 01, 2025
Viaarxiv icon