Picture for Dongchen Han

Dongchen Han

Linear-Time Global Visual Modeling without Explicit Attention

Add code
May 06, 2026
Viaarxiv icon

Linearizing Vision Transformer with Test-Time Training

Add code
May 04, 2026
Viaarxiv icon

SiameseNorm: Breaking the Barrier to Reconciling Pre/Post-Norm

Add code
Feb 08, 2026
Viaarxiv icon

LINA: Linear Autoregressive Image Generative Models with Continuous Tokens

Add code
Jan 30, 2026
Viaarxiv icon

Vision Transformers are Circulant Attention Learners

Add code
Dec 25, 2025
Viaarxiv icon

Step by Step Network

Add code
Nov 18, 2025
Viaarxiv icon

Bridging the Divide: Reconsidering Softmax and Linear Attention

Add code
Dec 09, 2024
Figure 1 for Bridging the Divide: Reconsidering Softmax and Linear Attention
Figure 2 for Bridging the Divide: Reconsidering Softmax and Linear Attention
Figure 3 for Bridging the Divide: Reconsidering Softmax and Linear Attention
Figure 4 for Bridging the Divide: Reconsidering Softmax and Linear Attention
Viaarxiv icon

Efficient Diffusion Transformer with Step-wise Dynamic Attention Mediators

Add code
Aug 11, 2024
Figure 1 for Efficient Diffusion Transformer with Step-wise Dynamic Attention Mediators
Figure 2 for Efficient Diffusion Transformer with Step-wise Dynamic Attention Mediators
Figure 3 for Efficient Diffusion Transformer with Step-wise Dynamic Attention Mediators
Figure 4 for Efficient Diffusion Transformer with Step-wise Dynamic Attention Mediators
Viaarxiv icon

Demystify Mamba in Vision: A Linear Attention Perspective

Add code
May 26, 2024
Figure 1 for Demystify Mamba in Vision: A Linear Attention Perspective
Figure 2 for Demystify Mamba in Vision: A Linear Attention Perspective
Figure 3 for Demystify Mamba in Vision: A Linear Attention Perspective
Figure 4 for Demystify Mamba in Vision: A Linear Attention Perspective
Viaarxiv icon

VL-Trojan: Multimodal Instruction Backdoor Attacks against Autoregressive Visual Language Models

Add code
Feb 21, 2024
Figure 1 for VL-Trojan: Multimodal Instruction Backdoor Attacks against Autoregressive Visual Language Models
Figure 2 for VL-Trojan: Multimodal Instruction Backdoor Attacks against Autoregressive Visual Language Models
Figure 3 for VL-Trojan: Multimodal Instruction Backdoor Attacks against Autoregressive Visual Language Models
Figure 4 for VL-Trojan: Multimodal Instruction Backdoor Attacks against Autoregressive Visual Language Models
Viaarxiv icon