Picture for Zhiru Zhang

Zhiru Zhang

ShadowLLM: Predictor-based Contextual Sparsity for Large Language Models

Add code
Jun 24, 2024
Viaarxiv icon

Differentiable Combinatorial Scheduling at Scale

Add code
Jun 06, 2024
Figure 1 for Differentiable Combinatorial Scheduling at Scale
Figure 2 for Differentiable Combinatorial Scheduling at Scale
Figure 3 for Differentiable Combinatorial Scheduling at Scale
Figure 4 for Differentiable Combinatorial Scheduling at Scale
Viaarxiv icon

Learning from Students: Applying t-Distributions to Explore Accurate and Efficient Formats for LLMs

Add code
May 06, 2024
Figure 1 for Learning from Students: Applying t-Distributions to Explore Accurate and Efficient Formats for LLMs
Figure 2 for Learning from Students: Applying t-Distributions to Explore Accurate and Efficient Formats for LLMs
Figure 3 for Learning from Students: Applying t-Distributions to Explore Accurate and Efficient Formats for LLMs
Figure 4 for Learning from Students: Applying t-Distributions to Explore Accurate and Efficient Formats for LLMs
Viaarxiv icon

Radial Networks: Dynamic Layer Routing for High-Performance Large Language Models

Add code
Apr 07, 2024
Figure 1 for Radial Networks: Dynamic Layer Routing for High-Performance Large Language Models
Figure 2 for Radial Networks: Dynamic Layer Routing for High-Performance Large Language Models
Figure 3 for Radial Networks: Dynamic Layer Routing for High-Performance Large Language Models
Figure 4 for Radial Networks: Dynamic Layer Routing for High-Performance Large Language Models
Viaarxiv icon

Allo: A Programming Model for Composable Accelerator Design

Add code
Apr 07, 2024
Figure 1 for Allo: A Programming Model for Composable Accelerator Design
Figure 2 for Allo: A Programming Model for Composable Accelerator Design
Figure 3 for Allo: A Programming Model for Composable Accelerator Design
Figure 4 for Allo: A Programming Model for Composable Accelerator Design
Viaarxiv icon

UniSparse: An Intermediate Language for General Sparse Format Customization

Add code
Mar 09, 2024
Figure 1 for UniSparse: An Intermediate Language for General Sparse Format Customization
Figure 2 for UniSparse: An Intermediate Language for General Sparse Format Customization
Figure 3 for UniSparse: An Intermediate Language for General Sparse Format Customization
Figure 4 for UniSparse: An Intermediate Language for General Sparse Format Customization
Viaarxiv icon

Less is More: Hop-Wise Graph Attention for Scalable and Generalizable Learning on Circuits

Add code
Mar 06, 2024
Viaarxiv icon

Polynormer: Polynomial-Expressive Graph Transformer in Linear Time

Add code
Mar 02, 2024
Figure 1 for Polynormer: Polynomial-Expressive Graph Transformer in Linear Time
Figure 2 for Polynormer: Polynomial-Expressive Graph Transformer in Linear Time
Figure 3 for Polynormer: Polynomial-Expressive Graph Transformer in Linear Time
Figure 4 for Polynormer: Polynomial-Expressive Graph Transformer in Linear Time
Viaarxiv icon

SAGMAN: Stability Analysis of Graph Neural Networks on the Manifolds

Add code
Feb 21, 2024
Figure 1 for SAGMAN: Stability Analysis of Graph Neural Networks on the Manifolds
Figure 2 for SAGMAN: Stability Analysis of Graph Neural Networks on the Manifolds
Figure 3 for SAGMAN: Stability Analysis of Graph Neural Networks on the Manifolds
Figure 4 for SAGMAN: Stability Analysis of Graph Neural Networks on the Manifolds
Viaarxiv icon

Exploring the Limits of Semantic Image Compression at Micro-bits per Pixel

Add code
Feb 21, 2024
Figure 1 for Exploring the Limits of Semantic Image Compression at Micro-bits per Pixel
Figure 2 for Exploring the Limits of Semantic Image Compression at Micro-bits per Pixel
Figure 3 for Exploring the Limits of Semantic Image Compression at Micro-bits per Pixel
Figure 4 for Exploring the Limits of Semantic Image Compression at Micro-bits per Pixel
Viaarxiv icon