Alert button
Picture for Qihang Fan

Qihang Fan

Alert button

Semantic Equitable Clustering: A Simple, Fast and Effective Strategy for Vision Transformer

Add code
Bookmark button
Alert button
May 22, 2024
Qihang Fan, Huaibo Huang, Mingrui Chen, Ran He

Viaarxiv icon

Vision Transformer with Sparse Scan Prior

Add code
Bookmark button
Alert button
May 22, 2024
Qihang Fan, Huaibo Huang, Mingrui Chen, Ran He

Viaarxiv icon

Band-Attention Modulated RetNet for Face Forgery Detection

Add code
Bookmark button
Alert button
Apr 09, 2024
Zhida Zhang, Jie Cao, Wenkui Yang, Qihang Fan, Kai Zhou, Ran He

Viaarxiv icon

ViTAR: Vision Transformer with Any Resolution

Add code
Bookmark button
Alert button
Mar 28, 2024
Qihang Fan, Quanzeng You, Xiaotian Han, Yongfei Liu, Yunzhe Tao, Huaibo Huang, Ran He, Hongxia Yang

Figure 1 for ViTAR: Vision Transformer with Any Resolution
Figure 2 for ViTAR: Vision Transformer with Any Resolution
Figure 3 for ViTAR: Vision Transformer with Any Resolution
Figure 4 for ViTAR: Vision Transformer with Any Resolution
Viaarxiv icon

Video-Teller: Enhancing Cross-Modal Generation with Fusion and Decoupling

Add code
Bookmark button
Alert button
Oct 11, 2023
Haogeng Liu, Qihang Fan, Tingkai Liu, Linjie Yang, Yunzhe Tao, Huaibo Huang, Ran He, Hongxia Yang

Figure 1 for Video-Teller: Enhancing Cross-Modal Generation with Fusion and Decoupling
Figure 2 for Video-Teller: Enhancing Cross-Modal Generation with Fusion and Decoupling
Figure 3 for Video-Teller: Enhancing Cross-Modal Generation with Fusion and Decoupling
Figure 4 for Video-Teller: Enhancing Cross-Modal Generation with Fusion and Decoupling
Viaarxiv icon

Video-CSR: Complex Video Digest Creation for Visual-Language Models

Add code
Bookmark button
Alert button
Oct 08, 2023
Tingkai Liu, Yunzhe Tao, Haogeng Liu, Qihang Fan, Ding Zhou, Huaibo Huang, Ran He, Hongxia Yang

Figure 1 for Video-CSR: Complex Video Digest Creation for Visual-Language Models
Figure 2 for Video-CSR: Complex Video Digest Creation for Visual-Language Models
Figure 3 for Video-CSR: Complex Video Digest Creation for Visual-Language Models
Figure 4 for Video-CSR: Complex Video Digest Creation for Visual-Language Models
Viaarxiv icon

RMT: Retentive Networks Meet Vision Transformers

Add code
Bookmark button
Alert button
Sep 20, 2023
Qihang Fan, Huaibo Huang, Mingrui Chen, Hongmin Liu, Ran He

Figure 1 for RMT: Retentive Networks Meet Vision Transformers
Figure 2 for RMT: Retentive Networks Meet Vision Transformers
Figure 3 for RMT: Retentive Networks Meet Vision Transformers
Figure 4 for RMT: Retentive Networks Meet Vision Transformers
Viaarxiv icon

Lightweight Vision Transformer with Bidirectional Interaction

Add code
Bookmark button
Alert button
Jun 01, 2023
Qihang Fan, Huaibo Huang, Xiaoqiang Zhou, Ran He

Figure 1 for Lightweight Vision Transformer with Bidirectional Interaction
Figure 2 for Lightweight Vision Transformer with Bidirectional Interaction
Figure 3 for Lightweight Vision Transformer with Bidirectional Interaction
Figure 4 for Lightweight Vision Transformer with Bidirectional Interaction
Viaarxiv icon

Rethinking Local Perception in Lightweight Vision Transformer

Add code
Bookmark button
Alert button
Apr 03, 2023
Qihang Fan, Huaibo Huang, Jiyang Guan, Ran He

Figure 1 for Rethinking Local Perception in Lightweight Vision Transformer
Figure 2 for Rethinking Local Perception in Lightweight Vision Transformer
Figure 3 for Rethinking Local Perception in Lightweight Vision Transformer
Figure 4 for Rethinking Local Perception in Lightweight Vision Transformer
Viaarxiv icon