Picture for Rachid Karami

Rachid Karami

MOSAIC: Efficient Mixture-of-Agent Scheduling via Adaptive Aggregation and Inference Concurrency

Add code
Jun 02, 2026
Viaarxiv icon

Characterizing State Space Model (SSM) and SSM-Transformer Hybrid Language Model Performance with Long Context Length

Add code
Jul 16, 2025
Figure 1 for Characterizing State Space Model (SSM) and SSM-Transformer Hybrid Language Model Performance with Long Context Length
Figure 2 for Characterizing State Space Model (SSM) and SSM-Transformer Hybrid Language Model Performance with Long Context Length
Figure 3 for Characterizing State Space Model (SSM) and SSM-Transformer Hybrid Language Model Performance with Long Context Length
Figure 4 for Characterizing State Space Model (SSM) and SSM-Transformer Hybrid Language Model Performance with Long Context Length
Viaarxiv icon

BF-IMNA: A Bit Fluid In-Memory Neural Architecture for Neural Network Acceleration

Add code
Nov 03, 2024
Figure 1 for BF-IMNA: A Bit Fluid In-Memory Neural Architecture for Neural Network Acceleration
Figure 2 for BF-IMNA: A Bit Fluid In-Memory Neural Architecture for Neural Network Acceleration
Figure 3 for BF-IMNA: A Bit Fluid In-Memory Neural Architecture for Neural Network Acceleration
Figure 4 for BF-IMNA: A Bit Fluid In-Memory Neural Architecture for Neural Network Acceleration
Viaarxiv icon

NonGEMM Bench: Understanding the Performance Horizon of the Latest ML Workloads with NonGEMM Workloads

Add code
Apr 17, 2024
Figure 1 for NonGEMM Bench: Understanding the Performance Horizon of the Latest ML Workloads with NonGEMM Workloads
Figure 2 for NonGEMM Bench: Understanding the Performance Horizon of the Latest ML Workloads with NonGEMM Workloads
Figure 3 for NonGEMM Bench: Understanding the Performance Horizon of the Latest ML Workloads with NonGEMM Workloads
Figure 4 for NonGEMM Bench: Understanding the Performance Horizon of the Latest ML Workloads with NonGEMM Workloads
Viaarxiv icon