Picture for Mao Yang

Mao Yang

Microsoft Research

SeerAttention-R: Sparse Attention Adaptation for Long Reasoning

Add code
Jun 10, 2025
Viaarxiv icon

rStar-Coder: Scaling Competitive Code Reasoning with a Large-Scale Verified Dataset

Add code
May 27, 2025
Viaarxiv icon

SwarmThinkers: Learning Physically Consistent Atomic KMC Transitions at Scale

Add code
May 26, 2025
Viaarxiv icon

RetroInfer: A Vector-Storage Approach for Scalable Long-Context LLM Inference

Add code
May 05, 2025
Viaarxiv icon

BitDecoding: Unlocking Tensor Cores for Long-Context LLMs Decoding with Low-Bit KV Cache

Add code
Mar 24, 2025
Viaarxiv icon

LongRoPE2: Near-Lossless LLM Context Window Scaling

Add code
Feb 27, 2025
Figure 1 for LongRoPE2: Near-Lossless LLM Context Window Scaling
Figure 2 for LongRoPE2: Near-Lossless LLM Context Window Scaling
Figure 3 for LongRoPE2: Near-Lossless LLM Context Window Scaling
Figure 4 for LongRoPE2: Near-Lossless LLM Context Window Scaling
Viaarxiv icon

AttentionEngine: A Versatile Framework for Efficient Attention Mechanisms on Diverse Hardware Platforms

Add code
Feb 21, 2025
Viaarxiv icon

Sigma: Differential Rescaling of Query, Key and Value for Efficient Language Models

Add code
Jan 23, 2025
Figure 1 for Sigma: Differential Rescaling of Query, Key and Value for Efficient Language Models
Figure 2 for Sigma: Differential Rescaling of Query, Key and Value for Efficient Language Models
Figure 3 for Sigma: Differential Rescaling of Query, Key and Value for Efficient Language Models
Figure 4 for Sigma: Differential Rescaling of Query, Key and Value for Efficient Language Models
Viaarxiv icon

LUT-DLA: Lookup Table as Efficient Extreme Low-Bit Deep Learning Accelerator

Add code
Jan 18, 2025
Viaarxiv icon

rStar-Math: Small LLMs Can Master Math Reasoning with Self-Evolved Deep Thinking

Add code
Jan 08, 2025
Figure 1 for rStar-Math: Small LLMs Can Master Math Reasoning with Self-Evolved Deep Thinking
Figure 2 for rStar-Math: Small LLMs Can Master Math Reasoning with Self-Evolved Deep Thinking
Figure 3 for rStar-Math: Small LLMs Can Master Math Reasoning with Self-Evolved Deep Thinking
Figure 4 for rStar-Math: Small LLMs Can Master Math Reasoning with Self-Evolved Deep Thinking
Viaarxiv icon