Picture for Yue Cheng

Yue Cheng

ZeRO-Prefill: Zero Redundancy Overheads in MoE Prefill Serving

Add code
May 03, 2026
Viaarxiv icon

M3D-Net: Multi-Modal 3D Facial Feature Reconstruction Network for Deepfake Detection

Add code
Apr 16, 2026
Viaarxiv icon

Reducing Hallucination in Enterprise AI Workflows via Hybrid Utility Minimum Bayes Risk (HUMBR)

Add code
Apr 13, 2026
Viaarxiv icon

ViviDoc: Generating Interactive Documents through Human-Agent Collaboration

Add code
Mar 30, 2026
Viaarxiv icon

ZenFlow: Enabling Stall-Free Offloading Training via Asynchronous Updates

Add code
May 18, 2025
Viaarxiv icon

LCGC: Learning from Consistency Gradient Conflicting for Class-Imbalanced Semi-Supervised Debiasing

Add code
Apr 09, 2025
Viaarxiv icon

Ensuring Fair LLM Serving Amid Diverse Applications

Add code
Nov 24, 2024
Figure 1 for Ensuring Fair LLM Serving Amid Diverse Applications
Figure 2 for Ensuring Fair LLM Serving Amid Diverse Applications
Figure 3 for Ensuring Fair LLM Serving Amid Diverse Applications
Figure 4 for Ensuring Fair LLM Serving Amid Diverse Applications
Viaarxiv icon

LOCAL: Learning with Orientation Matrix to Infer Causal Structure from Time Series Data

Add code
Oct 28, 2024
Figure 1 for LOCAL: Learning with Orientation Matrix to Infer Causal Structure from Time Series Data
Figure 2 for LOCAL: Learning with Orientation Matrix to Infer Causal Structure from Time Series Data
Figure 3 for LOCAL: Learning with Orientation Matrix to Infer Causal Structure from Time Series Data
Figure 4 for LOCAL: Learning with Orientation Matrix to Infer Causal Structure from Time Series Data
Viaarxiv icon

LinBridge: A Learnable Framework for Interpreting Nonlinear Neural Encoding Models

Add code
Oct 26, 2024
Figure 1 for LinBridge: A Learnable Framework for Interpreting Nonlinear Neural Encoding Models
Figure 2 for LinBridge: A Learnable Framework for Interpreting Nonlinear Neural Encoding Models
Figure 3 for LinBridge: A Learnable Framework for Interpreting Nonlinear Neural Encoding Models
Figure 4 for LinBridge: A Learnable Framework for Interpreting Nonlinear Neural Encoding Models
Viaarxiv icon

Everything You Always Wanted to Know About Storage Compressibility of Pre-Trained ML Models but Were Afraid to Ask

Add code
Feb 20, 2024
Viaarxiv icon