Picture for Qihang Fan

Qihang Fan

Vision Transformer with Sparse Scan Prior

Add code
May 22, 2024
Viaarxiv icon

Semantic Equitable Clustering: A Simple, Fast and Effective Strategy for Vision Transformer

Add code
May 22, 2024
Viaarxiv icon

Band-Attention Modulated RetNet for Face Forgery Detection

Add code
Apr 09, 2024
Figure 1 for Band-Attention Modulated RetNet for Face Forgery Detection
Figure 2 for Band-Attention Modulated RetNet for Face Forgery Detection
Figure 3 for Band-Attention Modulated RetNet for Face Forgery Detection
Figure 4 for Band-Attention Modulated RetNet for Face Forgery Detection
Viaarxiv icon

ViTAR: Vision Transformer with Any Resolution

Add code
Mar 28, 2024
Figure 1 for ViTAR: Vision Transformer with Any Resolution
Figure 2 for ViTAR: Vision Transformer with Any Resolution
Figure 3 for ViTAR: Vision Transformer with Any Resolution
Figure 4 for ViTAR: Vision Transformer with Any Resolution
Viaarxiv icon

Video-Teller: Enhancing Cross-Modal Generation with Fusion and Decoupling

Add code
Oct 11, 2023
Figure 1 for Video-Teller: Enhancing Cross-Modal Generation with Fusion and Decoupling
Figure 2 for Video-Teller: Enhancing Cross-Modal Generation with Fusion and Decoupling
Figure 3 for Video-Teller: Enhancing Cross-Modal Generation with Fusion and Decoupling
Figure 4 for Video-Teller: Enhancing Cross-Modal Generation with Fusion and Decoupling
Viaarxiv icon

Video-CSR: Complex Video Digest Creation for Visual-Language Models

Add code
Oct 08, 2023
Figure 1 for Video-CSR: Complex Video Digest Creation for Visual-Language Models
Figure 2 for Video-CSR: Complex Video Digest Creation for Visual-Language Models
Figure 3 for Video-CSR: Complex Video Digest Creation for Visual-Language Models
Figure 4 for Video-CSR: Complex Video Digest Creation for Visual-Language Models
Viaarxiv icon

RMT: Retentive Networks Meet Vision Transformers

Add code
Sep 20, 2023
Figure 1 for RMT: Retentive Networks Meet Vision Transformers
Figure 2 for RMT: Retentive Networks Meet Vision Transformers
Figure 3 for RMT: Retentive Networks Meet Vision Transformers
Figure 4 for RMT: Retentive Networks Meet Vision Transformers
Viaarxiv icon

Lightweight Vision Transformer with Bidirectional Interaction

Add code
Jun 01, 2023
Figure 1 for Lightweight Vision Transformer with Bidirectional Interaction
Figure 2 for Lightweight Vision Transformer with Bidirectional Interaction
Figure 3 for Lightweight Vision Transformer with Bidirectional Interaction
Figure 4 for Lightweight Vision Transformer with Bidirectional Interaction
Viaarxiv icon

Rethinking Local Perception in Lightweight Vision Transformer

Add code
Apr 03, 2023
Figure 1 for Rethinking Local Perception in Lightweight Vision Transformer
Figure 2 for Rethinking Local Perception in Lightweight Vision Transformer
Figure 3 for Rethinking Local Perception in Lightweight Vision Transformer
Figure 4 for Rethinking Local Perception in Lightweight Vision Transformer
Viaarxiv icon