Picture for Tao Yang

Tao Yang

DAMO Academy, Alibaba Group

Discriminative Policy Optimization for Token-Level Reward Models

Add code
May 29, 2025
Viaarxiv icon

Structured Memory Mechanisms for Stable Context Representation in Large Language Models

Add code
May 28, 2025
Viaarxiv icon

DASH: Input-Aware Dynamic Layer Skipping for Efficient LLM Inference with Markov Decision Policies

Add code
May 23, 2025
Viaarxiv icon

Hunyuan-TurboS: Advancing Large Language Models through Mamba-Transformer Synergy and Adaptive Chain-of-Thought

Add code
May 21, 2025
Viaarxiv icon

Tianyi: A Traditional Chinese Medicine all-rounder language model and its Real-World Clinical Practice

Add code
May 19, 2025
Viaarxiv icon

MPS-Prover: Advancing Stepwise Theorem Proving by Multi-Perspective Search and Data Curation

Add code
May 16, 2025
Viaarxiv icon

One-Point Sampling for Distributed Bandit Convex Optimization with Time-Varying Constraints

Add code
Apr 24, 2025
Viaarxiv icon

Dynamic Superblock Pruning for Fast Learned Sparse Retrieval

Add code
Apr 23, 2025
Viaarxiv icon

A Deep Learning Framework for Sequence Mining with Bidirectional LSTM and Multi-Scale Attention

Add code
Apr 21, 2025
Viaarxiv icon

Stabilization Analysis and Mode Recognition of Kerosene Supersonic Combustion: A Deep Learning Approach Based on Res-CNN-beta-VAE

Add code
Mar 17, 2025
Viaarxiv icon