Picture for Ismail Elezi

Ismail Elezi

Do You See What I Am Pointing At? Gesture-Based Egocentric Video Question Answering

Add code
Mar 13, 2026
Viaarxiv icon

Test-Time Scaling with Diffusion Language Models via Reward-Guided Stitching

Add code
Feb 26, 2026
Viaarxiv icon

A Benchmark for Deep Information Synthesis

Add code
Feb 24, 2026
Viaarxiv icon

Top 10 Open Challenges Steering the Future of Diffusion Language Model and Its Variants

Add code
Jan 20, 2026
Viaarxiv icon

SATGround: A Spatially-Aware Approach for Visual Grounding in Remote Sensing

Add code
Dec 09, 2025
Viaarxiv icon

RetouchLLM: Training-free White-box Image Retouching

Add code
Oct 09, 2025
Viaarxiv icon

Region-based Cluster Discrimination for Visual Representation Learning

Add code
Jul 26, 2025
Viaarxiv icon

"Principal Components" Enable A New Language of Images

Add code
Mar 11, 2025
Figure 1 for "Principal Components" Enable A New Language of Images
Figure 2 for "Principal Components" Enable A New Language of Images
Figure 3 for "Principal Components" Enable A New Language of Images
Figure 4 for "Principal Components" Enable A New Language of Images
Viaarxiv icon

From Attention to Activation: Unravelling the Enigmas of Large Language Models

Add code
Oct 22, 2024
Viaarxiv icon

Fractal Calibration for long-tailed object detection

Add code
Oct 15, 2024
Figure 1 for Fractal Calibration for long-tailed object detection
Figure 2 for Fractal Calibration for long-tailed object detection
Figure 3 for Fractal Calibration for long-tailed object detection
Figure 4 for Fractal Calibration for long-tailed object detection
Viaarxiv icon