Picture for Elmar Rückert

Elmar Rückert

ViTaPEs: Visuotactile Position Encodings for Cross-Modal Alignment in Multimodal Transformers

Add code
May 26, 2025
Figure 1 for ViTaPEs: Visuotactile Position Encodings for Cross-Modal Alignment in Multimodal Transformers
Figure 2 for ViTaPEs: Visuotactile Position Encodings for Cross-Modal Alignment in Multimodal Transformers
Figure 3 for ViTaPEs: Visuotactile Position Encodings for Cross-Modal Alignment in Multimodal Transformers
Figure 4 for ViTaPEs: Visuotactile Position Encodings for Cross-Modal Alignment in Multimodal Transformers
Viaarxiv icon