Picture for Ismini Lourentzou

Ismini Lourentzou

VTAM: Video-Tactile-Action Models for Complex Physical Interaction Beyond VLAs

Add code
Mar 24, 2026
Viaarxiv icon

EgoForge: Goal-Directed Egocentric World Simulator

Add code
Mar 20, 2026
Viaarxiv icon

DreamPartGen: Semantically Grounded Part-Level 3D Generation via Collaborative Latent Denoising

Add code
Mar 19, 2026
Viaarxiv icon

Best of Both Worlds: Multimodal Reasoning and Generation via Unified Discrete Flow Matching

Add code
Feb 12, 2026
Viaarxiv icon

FASA: Frequency-aware Sparse Attention

Add code
Feb 03, 2026
Viaarxiv icon

Toward Cognitive Supersensing in Multimodal Large Language Model

Add code
Feb 02, 2026
Viaarxiv icon

PyraTok: Language-Aligned Pyramidal Tokenizer for Video Understanding and Generation

Add code
Jan 22, 2026
Viaarxiv icon

PALM: Progress-Aware Policy Learning via Affordance Reasoning for Long-Horizon Robotic Manipulation

Add code
Jan 11, 2026
Viaarxiv icon

Hierarchical Dataset Selection for High-Quality Data Sharing

Add code
Dec 24, 2025
Figure 1 for Hierarchical Dataset Selection for High-Quality Data Sharing
Figure 2 for Hierarchical Dataset Selection for High-Quality Data Sharing
Figure 3 for Hierarchical Dataset Selection for High-Quality Data Sharing
Figure 4 for Hierarchical Dataset Selection for High-Quality Data Sharing
Viaarxiv icon

CoRe3D: Collaborative Reasoning as a Foundation for 3D Intelligence

Add code
Dec 14, 2025
Viaarxiv icon