Picture for Yang Bai

Yang Bai

Explore Data Left Behind in Reinforcement Learning for Reasoning Language Models

Add code
Nov 06, 2025
Viaarxiv icon

A Survey on LLM Mid-training

Add code
Oct 27, 2025
Viaarxiv icon

On Multiple Robustness of Proximal Dynamic Treatment Regimes

Add code
Oct 23, 2025
Viaarxiv icon

Making Mathematical Reasoning Adaptive

Add code
Oct 06, 2025
Viaarxiv icon

Fre-CW: Targeted Attack on Time Series Forecasting using Frequency Domain Loss

Add code
Aug 12, 2025
Viaarxiv icon

Improving Generative Ad Text on Facebook using Reinforcement Learning

Add code
Jul 29, 2025
Viaarxiv icon

RoboSwap: A GAN-driven Video Diffusion Framework For Unsupervised Robot Arm Swapping

Add code
Jun 10, 2025
Viaarxiv icon

Multi-task Learning for Heterogeneous Data via Integrating Shared and Task-Specific Encodings

Add code
May 30, 2025
Viaarxiv icon

Multi-task Learning for Heterogeneous Multi-source Block-Wise Missing Data

Add code
May 30, 2025
Viaarxiv icon

An integrated language-vision foundation model for conversational diagnostics and triaging in primary eye care

Add code
May 13, 2025
Viaarxiv icon