Picture for Chenxu Yang

Chenxu Yang

Near-Future Policy Optimization

Add code
Apr 22, 2026
Viaarxiv icon

Self-Distilled RLVR

Add code
Apr 03, 2026
Viaarxiv icon

Exposing Cross-Modal Consistency for Fake News Detection in Short-Form Videos

Add code
Mar 16, 2026
Viaarxiv icon

Beyond the Covariance Trap: Unlocking Generalization in Same-Subject Knowledge Editing for Large Language Models

Add code
Mar 16, 2026
Viaarxiv icon

System 1&2 Synergy via Dynamic Model Interpolation

Add code
Jan 29, 2026
Viaarxiv icon

Breaking the Trade-Off Between Faithfulness and Expressiveness for Large Language Models

Add code
Aug 26, 2025
Viaarxiv icon

Weights-Rotated Preference Optimization for Large Language Models

Add code
Aug 25, 2025
Viaarxiv icon

S-GRPO: Early Exit via Reinforcement Learning in Reasoning Models

Add code
May 12, 2025
Viaarxiv icon

Dynamic Early Exit in Reasoning Models

Add code
Apr 22, 2025
Viaarxiv icon

A Multi-Task Role-Playing Agent Capable of Imitating Character Linguistic Styles

Add code
Nov 04, 2024
Figure 1 for A Multi-Task Role-Playing Agent Capable of Imitating Character Linguistic Styles
Figure 2 for A Multi-Task Role-Playing Agent Capable of Imitating Character Linguistic Styles
Figure 3 for A Multi-Task Role-Playing Agent Capable of Imitating Character Linguistic Styles
Figure 4 for A Multi-Task Role-Playing Agent Capable of Imitating Character Linguistic Styles
Viaarxiv icon