Picture for Erpeng Xue

Erpeng Xue

SKILLC: Learning Autonomous Skill Internalization in LLM Agents via Contrastive Credit Assignment

Add code
May 27, 2026
Viaarxiv icon

When the Majority Votes Wrong, the Intervention Timing for Test-Time Reinforcement Learning Hides in the Extinction Window

Add code
May 19, 2026
Viaarxiv icon

Action is All You Need: Dual-Flow Generative Ranking Network for Recommendation

Add code
May 22, 2025
Viaarxiv icon

End-to-end training of Multimodal Model and ranking Model

Add code
Apr 09, 2024
Figure 1 for End-to-end training of Multimodal Model and ranking Model
Figure 2 for End-to-end training of Multimodal Model and ranking Model
Figure 3 for End-to-end training of Multimodal Model and ranking Model
Figure 4 for End-to-end training of Multimodal Model and ranking Model
Viaarxiv icon