Picture for Arya Mazaheri

Arya Mazaheri

PipeInfer: Accelerating LLM Inference using Asynchronous Pipelined Speculation

Add code
Jul 16, 2024
Viaarxiv icon

GNN-RL Compression: Topology-Aware Network Pruning using Multi-stage Graph Embedding and Reinforcement Learning

Add code
Feb 05, 2021
Figure 1 for GNN-RL Compression: Topology-Aware Network Pruning using Multi-stage Graph Embedding and Reinforcement Learning
Figure 2 for GNN-RL Compression: Topology-Aware Network Pruning using Multi-stage Graph Embedding and Reinforcement Learning
Figure 3 for GNN-RL Compression: Topology-Aware Network Pruning using Multi-stage Graph Embedding and Reinforcement Learning
Figure 4 for GNN-RL Compression: Topology-Aware Network Pruning using Multi-stage Graph Embedding and Reinforcement Learning
Viaarxiv icon

Auto Graph Encoder-Decoder for Model Compression and Network Acceleration

Add code
Dec 31, 2020
Figure 1 for Auto Graph Encoder-Decoder for Model Compression and Network Acceleration
Figure 2 for Auto Graph Encoder-Decoder for Model Compression and Network Acceleration
Figure 3 for Auto Graph Encoder-Decoder for Model Compression and Network Acceleration
Figure 4 for Auto Graph Encoder-Decoder for Model Compression and Network Acceleration
Viaarxiv icon