Picture for Elmar Rückert

Elmar Rückert

ViTaPEs: Visuotactile Position Encodings for Cross-Modal Alignment in Multimodal Transformers

Add code
May 26, 2025
Viaarxiv icon