Picture for Tao Yang

Tao Yang

DAMO Academy, Alibaba Group

AgentMath: Empowering Mathematical Reasoning for Large Language Models via Tool-Augmented Agent

Add code
Dec 23, 2025
Viaarxiv icon

A Specialized Large Language Model for Clinical Reasoning and Diagnosis in Rare Diseases

Add code
Nov 18, 2025
Viaarxiv icon

MarsRL: Advancing Multi-Agent Reasoning System via Reinforcement Learning with Agentic Pipeline Parallelism

Add code
Nov 14, 2025
Viaarxiv icon

QUARK: Quantization-Enabled Circuit Sharing for Transformer Acceleration by Exploiting Common Patterns in Nonlinear Operations

Add code
Nov 10, 2025
Viaarxiv icon

Self-Forcing++: Towards Minute-Scale High-Quality Video Generation

Add code
Oct 02, 2025
Viaarxiv icon

Learning More with Less: A Dynamic Dual-Level Down-Sampling Framework for Efficient Policy Optimization

Add code
Sep 26, 2025
Viaarxiv icon

WeChat-YATT: A Simple, Scalable and Balanced RLHF Trainer

Add code
Aug 11, 2025
Viaarxiv icon

G-Core: A Simple, Scalable and Balanced RLHF Trainer

Add code
Jul 30, 2025
Viaarxiv icon

Seismic Acoustic Impedance Inversion Framework Based on Conditional Latent Generative Diffusion Model

Add code
Jun 16, 2025
Viaarxiv icon

crossMoDA Challenge: Evolution of Cross-Modality Domain Adaptation Techniques for Vestibular Schwannoma and Cochlea Segmentation from 2021 to 2023

Add code
Jun 13, 2025
Viaarxiv icon