Picture for Jinyang Wu

Jinyang Wu

OPID: On-Policy Skill Distillation for Agentic Reinforcement Learning

Add code
Jun 25, 2026
Viaarxiv icon

Supervised Post-training of Speech Foundation Models for Robust Adaptation in Speech Deepfake Detection

Add code
Jun 24, 2026
Viaarxiv icon

Orchestra-o1: Omnimodal Agent Orchestration

Add code
Jun 10, 2026
Viaarxiv icon

Late-Layer Fusion is Enough: Dual-Path Vision Token Routing for Multimodal Large Language Models under Visual Saturation

Add code
Jun 08, 2026
Viaarxiv icon

Learning to Adapt SFT Data for Better Reasoning Generalization

Add code
May 26, 2026
Viaarxiv icon

Maestro: Reinforcement Learning to Orchestrate Hierarchical Model-Skill Ensembles

Add code
May 21, 2026
Viaarxiv icon

Implicit Hierarchical GRPO: Decoupling Tool Invocation from Execution for Tool-Integrated Mathematical Reasoning

Add code
May 18, 2026
Viaarxiv icon

Self-Distilled Agentic Reinforcement Learning

Add code
May 14, 2026
Viaarxiv icon

RobotEQ: Transitioning from Passive Intelligence to Active Intelligence in Embodied AI

Add code
May 07, 2026
Viaarxiv icon

SKILL0: In-Context Agentic Reinforcement Learning for Skill Internalization

Add code
Apr 02, 2026
Viaarxiv icon