Picture for Ismini Lourentzou

Ismini Lourentzou

RewardFlow: Generate Images by Optimizing What You Reward

Add code
Apr 09, 2026
Viaarxiv icon

Phantom: Physics-Infused Video Generation via Joint Modeling of Visual and Latent Physical Dynamics

Add code
Apr 09, 2026
Viaarxiv icon

3D-VCD: Hallucination Mitigation in 3D-LLM Embodied Agents through Visual Contrastive Decoding

Add code
Apr 09, 2026
Viaarxiv icon

VTAM: Video-Tactile-Action Models for Complex Physical Interaction Beyond VLAs

Add code
Mar 24, 2026
Viaarxiv icon

EgoForge: Goal-Directed Egocentric World Simulator

Add code
Mar 20, 2026
Viaarxiv icon

DreamPartGen: Semantically Grounded Part-Level 3D Generation via Collaborative Latent Denoising

Add code
Mar 19, 2026
Viaarxiv icon

Best of Both Worlds: Multimodal Reasoning and Generation via Unified Discrete Flow Matching

Add code
Feb 12, 2026
Viaarxiv icon

FASA: Frequency-aware Sparse Attention

Add code
Feb 03, 2026
Viaarxiv icon

Toward Cognitive Supersensing in Multimodal Large Language Model

Add code
Feb 02, 2026
Viaarxiv icon

PyraTok: Language-Aligned Pyramidal Tokenizer for Video Understanding and Generation

Add code
Jan 22, 2026
Viaarxiv icon