Picture for Fei Huang

Fei Huang

additional authors not shown

CPO: Addressing Reward Ambiguity in Role-playing Dialogue via Comparative Policy Optimization

Add code
Aug 12, 2025
Viaarxiv icon

Memp: Exploring Agent Procedural Memory

Add code
Aug 08, 2025
Viaarxiv icon

RL-PLUS: Countering Capability Boundary Collapse of LLMs in Reinforcement Learning with Hybrid-policy Optimization

Add code
Jul 31, 2025
Viaarxiv icon

RMTBench: Benchmarking LLMs Through Multi-Turn User-Centric Role-Playing

Add code
Jul 27, 2025
Viaarxiv icon

Translationese-index: Using Likelihood Ratios for Graded and Generalizable Measurement of Translationese

Add code
Jul 16, 2025
Viaarxiv icon

Perception-Aware Policy Optimization for Multimodal Reasoning

Add code
Jul 08, 2025
Viaarxiv icon

WebSailor: Navigating Super-human Reasoning for Web Agent

Add code
Jul 03, 2025
Viaarxiv icon

DynamicBench: Evaluating Real-Time Report Generation in Large Language Models

Add code
Jun 26, 2025
Viaarxiv icon

EIFBENCH: Extremely Complex Instruction Following Benchmark for Large Language Models

Add code
Jun 10, 2025
Viaarxiv icon

Writing-RL: Advancing Long-form Writing via Adaptive Curriculum Reinforcement Learning

Add code
Jun 06, 2025
Viaarxiv icon