Picture for Mohamed Elhoseiny

Mohamed Elhoseiny

Artic-O: End-to-End Articulated Object Reconstruction via Latent Geometry Learning

Add code
Jun 20, 2026
Viaarxiv icon

Beyond Semantics: Modeling Factual and Affective Perceptual Experiences from Vision-Language Data

Add code
Jun 02, 2026
Viaarxiv icon

A Shared Valence Axis Across Modern LLMs and Human EEG: The Saturation Regularity

Add code
May 28, 2026
Viaarxiv icon

Afford-VLA: Action-Aligned Visual Planning via Internalized Affordance

Add code
May 22, 2026
Viaarxiv icon

CompoSE: Compositional Synthesis and Editing of 3D Shapes via Part-Aware Control

Add code
May 19, 2026
Viaarxiv icon

The Second Challenge on Cross-Domain Few-Shot Object Detection at NTIRE 2026: Methods and Results

Add code
Apr 13, 2026
Viaarxiv icon

Small Vision-Language Models are Smart Compressors for Long Video Understanding

Add code
Apr 09, 2026
Viaarxiv icon

M-MiniGPT4: Multilingual VLLM Alignment via Translated Data

Add code
Mar 31, 2026
Viaarxiv icon

dTRPO: Trajectory Reduction in Policy Optimization of Diffusion Large Language Models

Add code
Mar 19, 2026
Viaarxiv icon

InfinityStory: Unlimited Video Generation with World Consistency and Character-Aware Shot Transitions

Add code
Mar 04, 2026
Viaarxiv icon