Picture for Marian Verhelst

Marian Verhelst

P3-LLM: An Integrated NPU-PIM Accelerator for LLM Inference Using Hybrid Numerical Formats

Add code
Nov 16, 2025
Figure 1 for P3-LLM: An Integrated NPU-PIM Accelerator for LLM Inference Using Hybrid Numerical Formats
Figure 2 for P3-LLM: An Integrated NPU-PIM Accelerator for LLM Inference Using Hybrid Numerical Formats
Figure 3 for P3-LLM: An Integrated NPU-PIM Accelerator for LLM Inference Using Hybrid Numerical Formats
Figure 4 for P3-LLM: An Integrated NPU-PIM Accelerator for LLM Inference Using Hybrid Numerical Formats
Viaarxiv icon

Precision-Scalable Microscaling Datapaths with Optimized Reduction Tree for Efficient NPU Integration

Add code
Nov 09, 2025
Viaarxiv icon

An Open-Source HW-SW Co-Development Framework Enabling Efficient Multi-Accelerator Systems

Add code
Aug 20, 2025
Viaarxiv icon

Efficient Precision-Scalable Hardware for Microscaling (MX) Processing in Robotics Learning

Add code
May 28, 2025
Figure 1 for Efficient Precision-Scalable Hardware for Microscaling (MX) Processing in Robotics Learning
Figure 2 for Efficient Precision-Scalable Hardware for Microscaling (MX) Processing in Robotics Learning
Figure 3 for Efficient Precision-Scalable Hardware for Microscaling (MX) Processing in Robotics Learning
Figure 4 for Efficient Precision-Scalable Hardware for Microscaling (MX) Processing in Robotics Learning
Viaarxiv icon

CNN-based Robust Sound Source Localization with SRP-PHAT for the Extreme Edge

Add code
Mar 03, 2025
Viaarxiv icon

Anda: Unlocking Efficient LLM Inference with a Variable-Length Grouped Activation Data Format

Add code
Nov 24, 2024
Viaarxiv icon

OpenGeMM: A High-Utilization GeMM Accelerator Generator with Lightweight RISC-V Control and Tight Memory Coupling

Add code
Nov 14, 2024
Figure 1 for OpenGeMM: A High-Utilization GeMM Accelerator Generator with Lightweight RISC-V Control and Tight Memory Coupling
Figure 2 for OpenGeMM: A High-Utilization GeMM Accelerator Generator with Lightweight RISC-V Control and Tight Memory Coupling
Figure 3 for OpenGeMM: A High-Utilization GeMM Accelerator Generator with Lightweight RISC-V Control and Tight Memory Coupling
Figure 4 for OpenGeMM: A High-Utilization GeMM Accelerator Generator with Lightweight RISC-V Control and Tight Memory Coupling
Viaarxiv icon

MATCH: Model-Aware TVM-based Compilation for Heterogeneous Edge Devices

Add code
Oct 11, 2024
Figure 1 for MATCH: Model-Aware TVM-based Compilation for Heterogeneous Edge Devices
Figure 2 for MATCH: Model-Aware TVM-based Compilation for Heterogeneous Edge Devices
Figure 3 for MATCH: Model-Aware TVM-based Compilation for Heterogeneous Edge Devices
Figure 4 for MATCH: Model-Aware TVM-based Compilation for Heterogeneous Edge Devices
Viaarxiv icon

Pack my weights and run! Minimizing overheads for in-memory computing accelerators

Add code
Sep 15, 2024
Viaarxiv icon

ACCO: Automated Causal CNN Scheduling Optimizer for Real-Time Edge Accelerators

Add code
Jun 11, 2024
Viaarxiv icon