Picture for Minyi Guo

Minyi Guo

HOBBIT: A Mixed Precision Expert Offloading System for Fast MoE Inference

Add code
Nov 03, 2024
Figure 1 for HOBBIT: A Mixed Precision Expert Offloading System for Fast MoE Inference
Figure 2 for HOBBIT: A Mixed Precision Expert Offloading System for Fast MoE Inference
Figure 3 for HOBBIT: A Mixed Precision Expert Offloading System for Fast MoE Inference
Figure 4 for HOBBIT: A Mixed Precision Expert Offloading System for Fast MoE Inference
Viaarxiv icon

SparseTem: Boosting the Efficiency of CNN-Based Video Encoders by Exploiting Temporal Continuity

Add code
Oct 28, 2024
Viaarxiv icon

vTensor: Flexible Virtual Tensor Management for Efficient LLM Serving

Add code
Jul 22, 2024
Figure 1 for vTensor: Flexible Virtual Tensor Management for Efficient LLM Serving
Figure 2 for vTensor: Flexible Virtual Tensor Management for Efficient LLM Serving
Figure 3 for vTensor: Flexible Virtual Tensor Management for Efficient LLM Serving
Figure 4 for vTensor: Flexible Virtual Tensor Management for Efficient LLM Serving
Viaarxiv icon

AutoVCoder: A Systematic Framework for Automated Verilog Code Generation using LLMs

Add code
Jul 21, 2024
Figure 1 for AutoVCoder: A Systematic Framework for Automated Verilog Code Generation using LLMs
Figure 2 for AutoVCoder: A Systematic Framework for Automated Verilog Code Generation using LLMs
Figure 3 for AutoVCoder: A Systematic Framework for Automated Verilog Code Generation using LLMs
Figure 4 for AutoVCoder: A Systematic Framework for Automated Verilog Code Generation using LLMs
Viaarxiv icon

SimGen: Simulator-conditioned Driving Scene Generation

Add code
Jun 13, 2024
Figure 1 for SimGen: Simulator-conditioned Driving Scene Generation
Figure 2 for SimGen: Simulator-conditioned Driving Scene Generation
Figure 3 for SimGen: Simulator-conditioned Driving Scene Generation
Figure 4 for SimGen: Simulator-conditioned Driving Scene Generation
Viaarxiv icon

A Codesign of Scheduling and Parallelization for Large Model Training in Heterogeneous Clusters

Add code
Mar 24, 2024
Viaarxiv icon

Embodied Understanding of Driving Scenarios

Add code
Mar 07, 2024
Figure 1 for Embodied Understanding of Driving Scenarios
Figure 2 for Embodied Understanding of Driving Scenarios
Figure 3 for Embodied Understanding of Driving Scenarios
Figure 4 for Embodied Understanding of Driving Scenarios
Viaarxiv icon

Extend Your Own Correspondences: Unsupervised Distant Point Cloud Registration by Progressive Distance Extension

Add code
Mar 06, 2024
Figure 1 for Extend Your Own Correspondences: Unsupervised Distant Point Cloud Registration by Progressive Distance Extension
Figure 2 for Extend Your Own Correspondences: Unsupervised Distant Point Cloud Registration by Progressive Distance Extension
Figure 3 for Extend Your Own Correspondences: Unsupervised Distant Point Cloud Registration by Progressive Distance Extension
Figure 4 for Extend Your Own Correspondences: Unsupervised Distant Point Cloud Registration by Progressive Distance Extension
Viaarxiv icon

Hierarchical Source-to-Post-Route QoR Prediction in High-Level Synthesis with GNNs

Add code
Jan 14, 2024
Figure 1 for Hierarchical Source-to-Post-Route QoR Prediction in High-Level Synthesis with GNNs
Figure 2 for Hierarchical Source-to-Post-Route QoR Prediction in High-Level Synthesis with GNNs
Figure 3 for Hierarchical Source-to-Post-Route QoR Prediction in High-Level Synthesis with GNNs
Figure 4 for Hierarchical Source-to-Post-Route QoR Prediction in High-Level Synthesis with GNNs
Viaarxiv icon

STAG: Enabling Low Latency and Low Staleness of GNN-based Services with Dynamic Graphs

Add code
Sep 27, 2023
Viaarxiv icon