Picture for Guohao Dai

Guohao Dai

Infinigence-AI, Shanghai Jiao Tong University

TASP: Topology-aware Sequence Parallelism

Add code
Sep 30, 2025
Viaarxiv icon

RLinf: Flexible and Efficient Large-scale Reinforcement Learning via Macro-to-Micro Flow Transformation

Add code
Sep 19, 2025
Figure 1 for RLinf: Flexible and Efficient Large-scale Reinforcement Learning via Macro-to-Micro Flow Transformation
Figure 2 for RLinf: Flexible and Efficient Large-scale Reinforcement Learning via Macro-to-Micro Flow Transformation
Figure 3 for RLinf: Flexible and Efficient Large-scale Reinforcement Learning via Macro-to-Micro Flow Transformation
Figure 4 for RLinf: Flexible and Efficient Large-scale Reinforcement Learning via Macro-to-Micro Flow Transformation
Viaarxiv icon

SpecDiff: Accelerating Diffusion Model Inference with Self-Speculation

Add code
Sep 17, 2025
Viaarxiv icon

VocabTailor: Dynamic Vocabulary Selection for Downstream Tasks in Small Language Models

Add code
Aug 21, 2025
Viaarxiv icon

Megrez2 Technical Report

Add code
Jul 23, 2025
Viaarxiv icon

R2R: Efficiently Navigating Divergent Reasoning Paths with Small-Large Model Token Routing

Add code
May 27, 2025
Viaarxiv icon

PM-KVQ: Progressive Mixed-precision KV Cache Quantization for Long-CoT LLMs

Add code
May 24, 2025
Viaarxiv icon

semi-PD: Towards Efficient LLM Serving via Phase-Wise Disaggregated Computation and Unified Storage

Add code
Apr 28, 2025
Viaarxiv icon

FlashOverlap: A Lightweight Design for Efficiently Overlapping Communication and Computation

Add code
Apr 28, 2025
Viaarxiv icon

VGDFR: Diffusion-based Video Generation with Dynamic Latent Frame Rate

Add code
Apr 16, 2025
Figure 1 for VGDFR: Diffusion-based Video Generation with Dynamic Latent Frame Rate
Figure 2 for VGDFR: Diffusion-based Video Generation with Dynamic Latent Frame Rate
Figure 3 for VGDFR: Diffusion-based Video Generation with Dynamic Latent Frame Rate
Figure 4 for VGDFR: Diffusion-based Video Generation with Dynamic Latent Frame Rate
Viaarxiv icon