Picture for Dimitrios S. Nikolopoulos

Dimitrios S. Nikolopoulos

WISP: Waste- and Interference-Suppressed Distributed Speculative LLM Serving at the Edge via Dynamic Drafting and SLO-Aware Batching

Add code
Jan 15, 2026
Viaarxiv icon

DiffPro: Joint Timestep and Layer-Wise Precision Optimization for Efficient Diffusion Inference

Add code
Nov 14, 2025
Viaarxiv icon

MARCO: A Multi-Agent System for Optimizing HPC Code Generation Using Large Language Models

Add code
May 06, 2025
Viaarxiv icon

HiRED: Attention-Guided Token Dropping for Efficient Inference of High-Resolution Vision-Language Models in Resource-Constrained Environments

Add code
Aug 20, 2024
Figure 1 for HiRED: Attention-Guided Token Dropping for Efficient Inference of High-Resolution Vision-Language Models in Resource-Constrained Environments
Figure 2 for HiRED: Attention-Guided Token Dropping for Efficient Inference of High-Resolution Vision-Language Models in Resource-Constrained Environments
Figure 3 for HiRED: Attention-Guided Token Dropping for Efficient Inference of High-Resolution Vision-Language Models in Resource-Constrained Environments
Figure 4 for HiRED: Attention-Guided Token Dropping for Efficient Inference of High-Resolution Vision-Language Models in Resource-Constrained Environments
Viaarxiv icon