Picture for Tong Jin

Tong Jin

Vidi: Large Multimodal Models for Video Understanding and Editing

Add code
Apr 22, 2025
Viaarxiv icon

SelaVPR++: Towards Seamless Adaptation of Foundation Models for Efficient Place Recognition

Add code
Feb 23, 2025
Viaarxiv icon

EDTformer: An Efficient Decoder Transformer for Visual Place Recognition

Add code
Dec 01, 2024
Figure 1 for EDTformer: An Efficient Decoder Transformer for Visual Place Recognition
Figure 2 for EDTformer: An Efficient Decoder Transformer for Visual Place Recognition
Figure 3 for EDTformer: An Efficient Decoder Transformer for Visual Place Recognition
Figure 4 for EDTformer: An Efficient Decoder Transformer for Visual Place Recognition
Viaarxiv icon