Picture for Yejun Tang

Yejun Tang

ContextDrag: Precise Drag-Based Image Editing via Context-Preserving Token Injection and Position-Consistent Attention

Add code
Dec 09, 2025
Viaarxiv icon

A Bilingual, OpenWorld Video Text Dataset and End-to-end Video Text Spotter with Transformer

Add code
Dec 09, 2021
Figure 1 for A Bilingual, OpenWorld Video Text Dataset and End-to-end Video Text Spotter with Transformer
Figure 2 for A Bilingual, OpenWorld Video Text Dataset and End-to-end Video Text Spotter with Transformer
Figure 3 for A Bilingual, OpenWorld Video Text Dataset and End-to-end Video Text Spotter with Transformer
Figure 4 for A Bilingual, OpenWorld Video Text Dataset and End-to-end Video Text Spotter with Transformer
Viaarxiv icon