Picture for Yang Bai

Yang Bai

Prefix Teach, Suffix Fade: Local Teachability Collapse in Strong-to-Weak On-Policy Distillation

Add code
May 13, 2026
Viaarxiv icon

Multi-Objective and Mixed-Reward Reinforcement Learning via Reward-Decorrelated Policy Optimization

Add code
May 13, 2026
Viaarxiv icon

Proximal Path-Specific Inference

Add code
May 10, 2026
Viaarxiv icon

VideoWeaver: Multimodal Multi-View Video-to-Video Transfer for Embodied Agents

Add code
Mar 26, 2026
Viaarxiv icon

Aligning Paralinguistic Understanding and Generation in Speech LLMs via Multi-Task Reinforcement Learning

Add code
Mar 16, 2026
Viaarxiv icon

ConfCtrl: Enabling Precise Camera Control in Video Diffusion via Confidence-Aware Interpolation

Add code
Mar 10, 2026
Viaarxiv icon

RynnBrain: Open Embodied Foundation Models

Add code
Feb 13, 2026
Viaarxiv icon

A Machine Learning accelerated geophysical fluid solver

Add code
Feb 09, 2026
Viaarxiv icon

Note2Chat: Improving LLMs for Multi-Turn Clinical History Taking Using Medical Notes

Add code
Jan 29, 2026
Viaarxiv icon

LongCat-Flash-Thinking-2601 Technical Report

Add code
Jan 23, 2026
Viaarxiv icon