Picture for Mohsen Imani

Mohsen Imani

TorchTraceAP: A New Benchmark Dataset for Detecting Performance Anti-Patterns in Computer Vision Models

Add code
Dec 16, 2025
Figure 1 for TorchTraceAP: A New Benchmark Dataset for Detecting Performance Anti-Patterns in Computer Vision Models
Figure 2 for TorchTraceAP: A New Benchmark Dataset for Detecting Performance Anti-Patterns in Computer Vision Models
Figure 3 for TorchTraceAP: A New Benchmark Dataset for Detecting Performance Anti-Patterns in Computer Vision Models
Figure 4 for TorchTraceAP: A New Benchmark Dataset for Detecting Performance Anti-Patterns in Computer Vision Models
Viaarxiv icon

Are Hypervectors Enough? Single-Call LLM Reasoning over Knowledge Graphs

Add code
Dec 10, 2025
Viaarxiv icon

Cauchy-Schwarz Fairness Regularizer

Add code
Dec 10, 2025
Viaarxiv icon

LUNE: Efficient LLM Unlearning via LoRA Fine-Tuning with Negative Examples

Add code
Dec 08, 2025
Viaarxiv icon

Recover-to-Forget: Gradient Reconstruction from LoRA for Efficient LLM Unlearning

Add code
Dec 08, 2025
Viaarxiv icon

Mitigating Bias in Graph Hyperdimensional Computing

Add code
Dec 08, 2025
Viaarxiv icon

T-SAR: A Full-Stack Co-design for CPU-Only Ternary LLM Inference via In-Place SIMD ALU Reorganization

Add code
Nov 17, 2025
Figure 1 for T-SAR: A Full-Stack Co-design for CPU-Only Ternary LLM Inference via In-Place SIMD ALU Reorganization
Figure 2 for T-SAR: A Full-Stack Co-design for CPU-Only Ternary LLM Inference via In-Place SIMD ALU Reorganization
Figure 3 for T-SAR: A Full-Stack Co-design for CPU-Only Ternary LLM Inference via In-Place SIMD ALU Reorganization
Figure 4 for T-SAR: A Full-Stack Co-design for CPU-Only Ternary LLM Inference via In-Place SIMD ALU Reorganization
Viaarxiv icon

QUILL: An Algorithm-Architecture Co-Design for Cache-Local Deformable Attention

Add code
Nov 17, 2025
Figure 1 for QUILL: An Algorithm-Architecture Co-Design for Cache-Local Deformable Attention
Figure 2 for QUILL: An Algorithm-Architecture Co-Design for Cache-Local Deformable Attention
Figure 3 for QUILL: An Algorithm-Architecture Co-Design for Cache-Local Deformable Attention
Figure 4 for QUILL: An Algorithm-Architecture Co-Design for Cache-Local Deformable Attention
Viaarxiv icon

Draft and Refine with Visual Experts

Add code
Nov 14, 2025
Viaarxiv icon

ASTER: Attention-based Spiking Transformer Engine for Event-driven Reasoning

Add code
Nov 10, 2025
Viaarxiv icon