Picture for Bing Qin

Bing Qin

UFO-RL: Uncertainty-Focused Optimization for Efficient Reinforcement Learning Data Selection

Add code
May 18, 2025
Viaarxiv icon

Beyond Frameworks: Unpacking Collaboration Strategies in Multi-Agent Systems

Add code
May 18, 2025
Viaarxiv icon

Simulating Before Planning: Constructing Intrinsic User World Model for User-Tailored Dialogue Policy Planning

Add code
Apr 18, 2025
Viaarxiv icon

Information Gain-Guided Causal Intervention for Autonomous Debiasing Large Language Models

Add code
Apr 17, 2025
Viaarxiv icon

AdaSteer: Your Aligned LLM is Inherently an Adaptive Jailbreak Defender

Add code
Apr 13, 2025
Viaarxiv icon

Chain of Strategy Optimization Makes Large Language Models Better Emotional Supporter

Add code
Mar 07, 2025
Viaarxiv icon

Enhancing Non-English Capabilities of English-Centric Large Language Models through Deep Supervision Fine-Tuning

Add code
Mar 05, 2025
Viaarxiv icon

From Hypothesis to Publication: A Comprehensive Survey of AI-Driven Research Support Systems

Add code
Mar 03, 2025
Viaarxiv icon

Beware of Your Po! Measuring and Mitigating AI Safety Risks in Role-Play Fine-Tuning of LLMs

Add code
Feb 28, 2025
Viaarxiv icon

MA-GTS: A Multi-Agent Framework for Solving Complex Graph Problems in Real-World Applications

Add code
Feb 25, 2025
Viaarxiv icon