Picture for Yanzhi Wang

Yanzhi Wang

RCR-Router: Efficient Role-Aware Context Routing for Multi-Agent LLM Systems with Structured Memory

Add code
Aug 06, 2025
Viaarxiv icon

Taming Diffusion Transformer for Real-Time Mobile Video Generation

Add code
Jul 17, 2025
Figure 1 for Taming Diffusion Transformer for Real-Time Mobile Video Generation
Figure 2 for Taming Diffusion Transformer for Real-Time Mobile Video Generation
Figure 3 for Taming Diffusion Transformer for Real-Time Mobile Video Generation
Figure 4 for Taming Diffusion Transformer for Real-Time Mobile Video Generation
Viaarxiv icon

Enabling Flexible Multi-LLM Integration for Scalable Knowledge Aggregation

Add code
May 28, 2025
Figure 1 for Enabling Flexible Multi-LLM Integration for Scalable Knowledge Aggregation
Figure 2 for Enabling Flexible Multi-LLM Integration for Scalable Knowledge Aggregation
Figure 3 for Enabling Flexible Multi-LLM Integration for Scalable Knowledge Aggregation
Figure 4 for Enabling Flexible Multi-LLM Integration for Scalable Knowledge Aggregation
Viaarxiv icon

ALTER: All-in-One Layer Pruning and Temporal Expert Routing for Efficient Diffusion Generation

Add code
May 27, 2025
Viaarxiv icon

Taming Diffusion for Dataset Distillation with High Representativeness

Add code
May 23, 2025
Viaarxiv icon

Structured Agent Distillation for Large Language Model

Add code
May 20, 2025
Figure 1 for Structured Agent Distillation for Large Language Model
Figure 2 for Structured Agent Distillation for Large Language Model
Figure 3 for Structured Agent Distillation for Large Language Model
Figure 4 for Structured Agent Distillation for Large Language Model
Viaarxiv icon

Perturbation-efficient Zeroth-order Optimization for Hardware-friendly On-device Training

Add code
Apr 28, 2025
Figure 1 for Perturbation-efficient Zeroth-order Optimization for Hardware-friendly On-device Training
Figure 2 for Perturbation-efficient Zeroth-order Optimization for Hardware-friendly On-device Training
Figure 3 for Perturbation-efficient Zeroth-order Optimization for Hardware-friendly On-device Training
Figure 4 for Perturbation-efficient Zeroth-order Optimization for Hardware-friendly On-device Training
Viaarxiv icon

H3AE: High Compression, High Speed, and High Quality AutoEncoder for Video Diffusion Models

Add code
Apr 14, 2025
Viaarxiv icon

QuartDepth: Post-Training Quantization for Real-Time Depth Estimation on the Edge

Add code
Mar 20, 2025
Figure 1 for QuartDepth: Post-Training Quantization for Real-Time Depth Estimation on the Edge
Figure 2 for QuartDepth: Post-Training Quantization for Real-Time Depth Estimation on the Edge
Figure 3 for QuartDepth: Post-Training Quantization for Real-Time Depth Estimation on the Edge
Figure 4 for QuartDepth: Post-Training Quantization for Real-Time Depth Estimation on the Edge
Viaarxiv icon

HDCompression: Hybrid-Diffusion Image Compression for Ultra-Low Bitrates

Add code
Feb 11, 2025
Figure 1 for HDCompression: Hybrid-Diffusion Image Compression for Ultra-Low Bitrates
Figure 2 for HDCompression: Hybrid-Diffusion Image Compression for Ultra-Low Bitrates
Figure 3 for HDCompression: Hybrid-Diffusion Image Compression for Ultra-Low Bitrates
Figure 4 for HDCompression: Hybrid-Diffusion Image Compression for Ultra-Low Bitrates
Viaarxiv icon