Picture for Yudong Pan

Yudong Pan

TriMoE: Augmenting GPU with AMX-Enabled CPU and DIMM-NDP for High-Throughput MoE Inference via Offloading

Add code
Mar 01, 2026
Viaarxiv icon

From Buffers to Registers: Unlocking Fine-Grained FlashAttention with Hybrid-Bonded 3D NPU Co-Design

Add code
Feb 11, 2026
Viaarxiv icon

COMET: Towards Partical W4A4KV4 LLMs Serving

Add code
Oct 16, 2024
Figure 1 for COMET: Towards Partical W4A4KV4 LLMs Serving
Figure 2 for COMET: Towards Partical W4A4KV4 LLMs Serving
Figure 3 for COMET: Towards Partical W4A4KV4 LLMs Serving
Figure 4 for COMET: Towards Partical W4A4KV4 LLMs Serving
Viaarxiv icon

Data is all you need: Finetuning LLMs for Chip Design via an Automated design-data augmentation framework

Add code
Mar 17, 2024
Figure 1 for Data is all you need: Finetuning LLMs for Chip Design via an Automated design-data augmentation framework
Figure 2 for Data is all you need: Finetuning LLMs for Chip Design via an Automated design-data augmentation framework
Figure 3 for Data is all you need: Finetuning LLMs for Chip Design via an Automated design-data augmentation framework
Figure 4 for Data is all you need: Finetuning LLMs for Chip Design via an Automated design-data augmentation framework
Viaarxiv icon

Short-time SSVEP data extension by a novel generative adversarial networks based framework

Add code
Jan 18, 2023
Viaarxiv icon

A Transformer-based deep neural network model for SSVEP classification

Add code
Oct 09, 2022
Figure 1 for A Transformer-based deep neural network model for SSVEP classification
Figure 2 for A Transformer-based deep neural network model for SSVEP classification
Figure 3 for A Transformer-based deep neural network model for SSVEP classification
Figure 4 for A Transformer-based deep neural network model for SSVEP classification
Viaarxiv icon