Picture for Xianfeng Tang

Xianfeng Tang

RRO: LLM Agent Optimization Through Rising Reward Trajectories

Add code
May 27, 2025
Viaarxiv icon

Efficient Long CoT Reasoning in Small Language Models

Add code
May 24, 2025
Viaarxiv icon

Beyond Text: Unveiling Privacy Vulnerabilities in Multi-modal Retrieval-Augmented Generation

Add code
May 20, 2025
Viaarxiv icon

Harnessing the Unseen: The Hidden Influence of Intrinsic Knowledge in Long-Context Language Models

Add code
Apr 11, 2025
Viaarxiv icon

SFT or RL? An Early Investigation into Training R1-Like Reasoning Large Vision-Language Models

Add code
Apr 10, 2025
Viaarxiv icon

m1: Unleash the Potential of Test-Time Scaling for Medical Reasoning with Large Language Models

Add code
Apr 01, 2025
Viaarxiv icon

ViLBench: A Suite for Vision-Language Process Reward Modeling

Add code
Mar 26, 2025
Viaarxiv icon

ResMoE: Space-efficient Compression of Mixture of Experts LLMs via Residual Restoration

Add code
Mar 10, 2025
Viaarxiv icon

Cite Before You Speak: Enhancing Context-Response Grounding in E-commerce Conversational LLM-Agents

Add code
Mar 05, 2025
Viaarxiv icon

How Far are LLMs from Real Search? A Comprehensive Study on Efficiency, Completeness, and Inherent Capabilities

Add code
Feb 26, 2025
Viaarxiv icon