Picture for Jiaxu Feng

Jiaxu Feng

A 2D Semantic-Aware Position Encoding for Vision Transformers

Add code
May 14, 2025
Figure 1 for A 2D Semantic-Aware Position Encoding for Vision Transformers
Figure 2 for A 2D Semantic-Aware Position Encoding for Vision Transformers
Figure 3 for A 2D Semantic-Aware Position Encoding for Vision Transformers
Figure 4 for A 2D Semantic-Aware Position Encoding for Vision Transformers
Viaarxiv icon