Picture for Di Zhang

Di Zhang

MinT: Managed Infrastructure for Training and Serving Millions of LLMs

Add code
May 13, 2026
Viaarxiv icon

$δ$-mem: Efficient Online Memory for Large Language Models

Add code
May 12, 2026
Viaarxiv icon

Steering Visual Generation in Unified Multimodal Models with Understanding Supervision

Add code
May 07, 2026
Viaarxiv icon

PhysInOne: Visual Physics Learning and Reasoning in One Suite

Add code
Apr 10, 2026
Viaarxiv icon

SlideSparse: Fast and Flexible (2N-2):2N Structured Sparsity

Add code
Mar 05, 2026
Viaarxiv icon

Sparse-BitNet: 1.58-bit LLMs are Naturally Friendly to Semi-Structured Sparsity

Add code
Mar 05, 2026
Viaarxiv icon

CARE: A Molecular-Guided Foundation Model with Adaptive Region Modeling for Whole Slide Image Analysis

Add code
Feb 25, 2026
Viaarxiv icon

Learning Native Continuation for Action Chunking Flow Policies

Add code
Feb 13, 2026
Viaarxiv icon

Learning Geometrically-Grounded 3D Visual Representations for View-Generalizable Robotic Manipulation

Add code
Jan 30, 2026
Viaarxiv icon

Golden Goose: A Simple Trick to Synthesize Unlimited RLVR Tasks from Unverifiable Internet Text

Add code
Jan 30, 2026
Viaarxiv icon