Picture for Massoud Pedram

Massoud Pedram

SkipKV: Selective Skipping of KV Generation and Storage for Efficient Inference with Large Reasoning Models

Add code
Dec 08, 2025
Figure 1 for SkipKV: Selective Skipping of KV Generation and Storage for Efficient Inference with Large Reasoning Models
Figure 2 for SkipKV: Selective Skipping of KV Generation and Storage for Efficient Inference with Large Reasoning Models
Figure 3 for SkipKV: Selective Skipping of KV Generation and Storage for Efficient Inference with Large Reasoning Models
Figure 4 for SkipKV: Selective Skipping of KV Generation and Storage for Efficient Inference with Large Reasoning Models
Viaarxiv icon

ASAP-FE: Energy-Efficient Feature Extraction Enabling Multi-Channel Keyword Spotting on Edge Processors

Add code
Jun 17, 2025
Viaarxiv icon

MARCO: Hardware-Aware Neural Architecture Search for Edge Devices with Multi-Agent Reinforcement Learning and Conformal Prediction Filtering

Add code
Jun 16, 2025
Viaarxiv icon

FAIR-SIGHT: Fairness Assurance in Image Recognition via Simultaneous Conformal Thresholding and Dynamic Output Repair

Add code
Apr 10, 2025
Figure 1 for FAIR-SIGHT: Fairness Assurance in Image Recognition via Simultaneous Conformal Thresholding and Dynamic Output Repair
Figure 2 for FAIR-SIGHT: Fairness Assurance in Image Recognition via Simultaneous Conformal Thresholding and Dynamic Output Repair
Figure 3 for FAIR-SIGHT: Fairness Assurance in Image Recognition via Simultaneous Conformal Thresholding and Dynamic Output Repair
Figure 4 for FAIR-SIGHT: Fairness Assurance in Image Recognition via Simultaneous Conformal Thresholding and Dynamic Output Repair
Viaarxiv icon

RocketPPA: Ultra-Fast LLM-Based PPA Estimator at Code-Level Abstraction

Add code
Mar 27, 2025
Figure 1 for RocketPPA: Ultra-Fast LLM-Based PPA Estimator at Code-Level Abstraction
Figure 2 for RocketPPA: Ultra-Fast LLM-Based PPA Estimator at Code-Level Abstraction
Figure 3 for RocketPPA: Ultra-Fast LLM-Based PPA Estimator at Code-Level Abstraction
Figure 4 for RocketPPA: Ultra-Fast LLM-Based PPA Estimator at Code-Level Abstraction
Viaarxiv icon

FACTER: Fairness-Aware Conformal Thresholding and Prompt Engineering for Enabling Fair LLM-Based Recommender Systems

Add code
Feb 05, 2025
Figure 1 for FACTER: Fairness-Aware Conformal Thresholding and Prompt Engineering for Enabling Fair LLM-Based Recommender Systems
Figure 2 for FACTER: Fairness-Aware Conformal Thresholding and Prompt Engineering for Enabling Fair LLM-Based Recommender Systems
Figure 3 for FACTER: Fairness-Aware Conformal Thresholding and Prompt Engineering for Enabling Fair LLM-Based Recommender Systems
Figure 4 for FACTER: Fairness-Aware Conformal Thresholding and Prompt Engineering for Enabling Fair LLM-Based Recommender Systems
Viaarxiv icon

Efficient Noise Mitigation for Enhancing Inference Accuracy in DNNs on Mixed-Signal Accelerators

Add code
Sep 27, 2024
Viaarxiv icon

Enhancing Layout Hotspot Detection Efficiency with YOLOv8 and PCA-Guided Augmentation

Add code
Jul 19, 2024
Figure 1 for Enhancing Layout Hotspot Detection Efficiency with YOLOv8 and PCA-Guided Augmentation
Figure 2 for Enhancing Layout Hotspot Detection Efficiency with YOLOv8 and PCA-Guided Augmentation
Figure 3 for Enhancing Layout Hotspot Detection Efficiency with YOLOv8 and PCA-Guided Augmentation
Figure 4 for Enhancing Layout Hotspot Detection Efficiency with YOLOv8 and PCA-Guided Augmentation
Viaarxiv icon

CHOSEN: Compilation to Hardware Optimization Stack for Efficient Vision Transformer Inference

Add code
Jul 17, 2024
Figure 1 for CHOSEN: Compilation to Hardware Optimization Stack for Efficient Vision Transformer Inference
Figure 2 for CHOSEN: Compilation to Hardware Optimization Stack for Efficient Vision Transformer Inference
Figure 3 for CHOSEN: Compilation to Hardware Optimization Stack for Efficient Vision Transformer Inference
Figure 4 for CHOSEN: Compilation to Hardware Optimization Stack for Efficient Vision Transformer Inference
Viaarxiv icon

ARCO:Adaptive Multi-Agent Reinforcement Learning-Based Hardware/Software Co-Optimization Compiler for Improved Performance in DNN Accelerator Design

Add code
Jul 11, 2024
Viaarxiv icon