Picture for Haoxuan Shan

Haoxuan Shan

Helen

EVA: Accelerating LLM Decoding via an Efficient Vector Quantization Architecture

Add code
May 22, 2026
Viaarxiv icon

A Survey: Collaborative Hardware and Software Design in the Era of Large Language Models

Add code
Oct 08, 2024
Figure 1 for A Survey: Collaborative Hardware and Software Design in the Era of Large Language Models
Figure 2 for A Survey: Collaborative Hardware and Software Design in the Era of Large Language Models
Viaarxiv icon