Alert button
Picture for Hongwu Peng

Hongwu Peng

Alert button

Learning from Teaching Regularization: Generalizable Correlations Should be Easy to Imitate

Feb 05, 2024
Can Jin, Tong Che, Hongwu Peng, Yiyuan Li, Marco Pavone

Viaarxiv icon

Zero-Space Cost Fault Tolerance for Transformer-based Language Models on ReRAM

Jan 22, 2024
Bingbing Li, Geng Yuan, Zigeng Wang, Shaoyi Huang, Hongwu Peng, Payman Behnam, Wujie Wen, Hang Liu, Caiwen Ding

Viaarxiv icon

Medusa: Simple LLM Inference Acceleration Framework with Multiple Decoding Heads

Jan 19, 2024
Tianle Cai, Yuhong Li, Zhengyang Geng, Hongwu Peng, Jason D. Lee, Deming Chen, Tri Dao

Viaarxiv icon

MaxK-GNN: Towards Theoretical Speed Limits for Accelerating Graph Neural Networks Training

Dec 18, 2023
Hongwu Peng, Xi Xie, Kaustubh Shivdikar, MD Amit Hasan, Jiahui Zhao, Shaoyi Huang, Omer Khan, David Kaeli, Caiwen Ding

Viaarxiv icon

Advanced Language Model-Driven Verilog Development: Enhancing Power, Performance, and Area Optimization in Code Synthesis

Dec 02, 2023
Kiran Thorat, Jiahui Zhao, Yaotian Liu, Hongwu Peng, Xi Xie, Bin Lei, Jeff Zhang, Caiwen Ding

Viaarxiv icon

Evaluating Emerging AI/ML Accelerators: IPU, RDU, and NVIDIA/AMD GPUs

Nov 08, 2023
Hongwu Peng, Caiwen Ding, Tong Geng, Sutanay Choudhury, Kevin Barker, Ang Li

Figure 1 for Evaluating Emerging AI/ML Accelerators: IPU, RDU, and NVIDIA/AMD GPUs
Figure 2 for Evaluating Emerging AI/ML Accelerators: IPU, RDU, and NVIDIA/AMD GPUs
Figure 3 for Evaluating Emerging AI/ML Accelerators: IPU, RDU, and NVIDIA/AMD GPUs
Figure 4 for Evaluating Emerging AI/ML Accelerators: IPU, RDU, and NVIDIA/AMD GPUs
Viaarxiv icon

LinGCN: Structural Linearized Graph Convolutional Network for Homomorphically Encrypted Inference

Sep 30, 2023
Hongwu Peng, Ran Ran, Yukui Luo, Jiahui Zhao, Shaoyi Huang, Kiran Thorat, Tong Geng, Chenghong Wang, Xiaolin Xu, Wujie Wen, Caiwen Ding

Figure 1 for LinGCN: Structural Linearized Graph Convolutional Network for Homomorphically Encrypted Inference
Figure 2 for LinGCN: Structural Linearized Graph Convolutional Network for Homomorphically Encrypted Inference
Figure 3 for LinGCN: Structural Linearized Graph Convolutional Network for Homomorphically Encrypted Inference
Figure 4 for LinGCN: Structural Linearized Graph Convolutional Network for Homomorphically Encrypted Inference
Viaarxiv icon

Accel-GCN: High-Performance GPU Accelerator Design for Graph Convolution Networks

Aug 22, 2023
Xi Xie, Hongwu Peng, Amit Hasan, Shaoyi Huang, Jiahui Zhao, Haowen Fang, Wei Zhang, Tong Geng, Omer Khan, Caiwen Ding

Viaarxiv icon

AutoReP: Automatic ReLU Replacement for Fast Private Network Inference

Aug 20, 2023
Hongwu Peng, Shaoyi Huang, Tong Zhou, Yukui Luo, Chenghong Wang, Zigeng Wang, Jiahui Zhao, Xi Xie, Ang Li, Tony Geng, Kaleel Mahmood, Wujie Wen, Xiaolin Xu, Caiwen Ding

Figure 1 for AutoReP: Automatic ReLU Replacement for Fast Private Network Inference
Figure 2 for AutoReP: Automatic ReLU Replacement for Fast Private Network Inference
Figure 3 for AutoReP: Automatic ReLU Replacement for Fast Private Network Inference
Figure 4 for AutoReP: Automatic ReLU Replacement for Fast Private Network Inference
Viaarxiv icon

RRNet: Towards ReLU-Reduced Neural Network for Two-party Computation Based Private Inference

Feb 22, 2023
Hongwu Peng, Shanglin Zhou, Yukui Luo, Nuo Xu, Shijin Duan, Ran Ran, Jiahui Zhao, Shaoyi Huang, Xi Xie, Chenghong Wang, Tong Geng, Wujie Wen, Xiaolin Xu, Caiwen Ding

Figure 1 for RRNet: Towards ReLU-Reduced Neural Network for Two-party Computation Based Private Inference
Figure 2 for RRNet: Towards ReLU-Reduced Neural Network for Two-party Computation Based Private Inference
Figure 3 for RRNet: Towards ReLU-Reduced Neural Network for Two-party Computation Based Private Inference
Figure 4 for RRNet: Towards ReLU-Reduced Neural Network for Two-party Computation Based Private Inference
Viaarxiv icon