Picture for Zhicheng Hu

Zhicheng Hu

Round Attention: A Novel Round-Level Attention Mechanism to Accelerate LLM Inference

Add code
Feb 21, 2025
Figure 1 for Round Attention: A Novel Round-Level Attention Mechanism to Accelerate LLM Inference
Figure 2 for Round Attention: A Novel Round-Level Attention Mechanism to Accelerate LLM Inference
Figure 3 for Round Attention: A Novel Round-Level Attention Mechanism to Accelerate LLM Inference
Figure 4 for Round Attention: A Novel Round-Level Attention Mechanism to Accelerate LLM Inference
Viaarxiv icon