Picture for Sreenivas Subramoney

Sreenivas Subramoney

QCQA: Quality and Capacity-aware grouped Query Attention

Add code
Jun 08, 2024
Figure 1 for QCQA: Quality and Capacity-aware grouped Query Attention
Figure 2 for QCQA: Quality and Capacity-aware grouped Query Attention
Figure 3 for QCQA: Quality and Capacity-aware grouped Query Attention
Figure 4 for QCQA: Quality and Capacity-aware grouped Query Attention
Viaarxiv icon

Towards Joint Optimization for DNN Architecture and Configuration for Compute-In-Memory Hardware

Add code
Feb 19, 2024
Figure 1 for Towards Joint Optimization for DNN Architecture and Configuration for Compute-In-Memory Hardware
Figure 2 for Towards Joint Optimization for DNN Architecture and Configuration for Compute-In-Memory Hardware
Figure 3 for Towards Joint Optimization for DNN Architecture and Configuration for Compute-In-Memory Hardware
Figure 4 for Towards Joint Optimization for DNN Architecture and Configuration for Compute-In-Memory Hardware
Viaarxiv icon

Reclaimer: A Reinforcement Learning Approach to Dynamic Resource Allocation for Cloud Microservices

Add code
Apr 17, 2023
Figure 1 for Reclaimer: A Reinforcement Learning Approach to Dynamic Resource Allocation for Cloud Microservices
Figure 2 for Reclaimer: A Reinforcement Learning Approach to Dynamic Resource Allocation for Cloud Microservices
Figure 3 for Reclaimer: A Reinforcement Learning Approach to Dynamic Resource Allocation for Cloud Microservices
Figure 4 for Reclaimer: A Reinforcement Learning Approach to Dynamic Resource Allocation for Cloud Microservices
Viaarxiv icon

VEGETA: Vertically-Integrated Extensions for Sparse/Dense GEMM Tile Acceleration on CPUs

Add code
Feb 23, 2023
Figure 1 for VEGETA: Vertically-Integrated Extensions for Sparse/Dense GEMM Tile Acceleration on CPUs
Figure 2 for VEGETA: Vertically-Integrated Extensions for Sparse/Dense GEMM Tile Acceleration on CPUs
Figure 3 for VEGETA: Vertically-Integrated Extensions for Sparse/Dense GEMM Tile Acceleration on CPUs
Figure 4 for VEGETA: Vertically-Integrated Extensions for Sparse/Dense GEMM Tile Acceleration on CPUs
Viaarxiv icon

ApHMM: Accelerating Profile Hidden Markov Models for Fast and Energy-Efficient Genome Analysis

Add code
Jul 20, 2022
Figure 1 for ApHMM: Accelerating Profile Hidden Markov Models for Fast and Energy-Efficient Genome Analysis
Figure 2 for ApHMM: Accelerating Profile Hidden Markov Models for Fast and Energy-Efficient Genome Analysis
Figure 3 for ApHMM: Accelerating Profile Hidden Markov Models for Fast and Energy-Efficient Genome Analysis
Figure 4 for ApHMM: Accelerating Profile Hidden Markov Models for Fast and Energy-Efficient Genome Analysis
Viaarxiv icon

Unsupervised Learning of Depth, Camera Pose and Optical Flow from Monocular Video

Add code
May 19, 2022
Figure 1 for Unsupervised Learning of Depth, Camera Pose and Optical Flow from Monocular Video
Figure 2 for Unsupervised Learning of Depth, Camera Pose and Optical Flow from Monocular Video
Figure 3 for Unsupervised Learning of Depth, Camera Pose and Optical Flow from Monocular Video
Figure 4 for Unsupervised Learning of Depth, Camera Pose and Optical Flow from Monocular Video
Viaarxiv icon

Robust 3D Scene Segmentation through Hierarchical and Learnable Part-Fusion

Add code
Nov 16, 2021
Figure 1 for Robust 3D Scene Segmentation through Hierarchical and Learnable Part-Fusion
Figure 2 for Robust 3D Scene Segmentation through Hierarchical and Learnable Part-Fusion
Figure 3 for Robust 3D Scene Segmentation through Hierarchical and Learnable Part-Fusion
Figure 4 for Robust 3D Scene Segmentation through Hierarchical and Learnable Part-Fusion
Viaarxiv icon

Pythia: A Customizable Hardware Prefetching Framework Using Online Reinforcement Learning

Add code
Oct 19, 2021
Figure 1 for Pythia: A Customizable Hardware Prefetching Framework Using Online Reinforcement Learning
Figure 2 for Pythia: A Customizable Hardware Prefetching Framework Using Online Reinforcement Learning
Figure 3 for Pythia: A Customizable Hardware Prefetching Framework Using Online Reinforcement Learning
Figure 4 for Pythia: A Customizable Hardware Prefetching Framework Using Online Reinforcement Learning
Viaarxiv icon

RASA: Efficient Register-Aware Systolic Array Matrix Engine for CPU

Add code
Oct 05, 2021
Figure 1 for RASA: Efficient Register-Aware Systolic Array Matrix Engine for CPU
Figure 2 for RASA: Efficient Register-Aware Systolic Array Matrix Engine for CPU
Figure 3 for RASA: Efficient Register-Aware Systolic Array Matrix Engine for CPU
Figure 4 for RASA: Efficient Register-Aware Systolic Array Matrix Engine for CPU
Viaarxiv icon