Picture for Tan Yu

Tan Yu

RAP: Real-time Audio-driven Portrait Animation with Video Diffusion Transformer

Add code
Aug 07, 2025
Viaarxiv icon

Marrying Autoregressive Transformer and Diffusion with Multi-Reference Autoregression

Add code
Jun 11, 2025
Viaarxiv icon

UniGuardian: A Unified Defense for Detecting Prompt Injection, Backdoor Attacks and Adversarial Attacks in Large Language Models

Add code
Feb 18, 2025
Viaarxiv icon

KALAHash: Knowledge-Anchored Low-Resource Adaptation for Deep Hashing

Add code
Dec 27, 2024
Figure 1 for KALAHash: Knowledge-Anchored Low-Resource Adaptation for Deep Hashing
Figure 2 for KALAHash: Knowledge-Anchored Low-Resource Adaptation for Deep Hashing
Figure 3 for KALAHash: Knowledge-Anchored Low-Resource Adaptation for Deep Hashing
Figure 4 for KALAHash: Knowledge-Anchored Low-Resource Adaptation for Deep Hashing
Viaarxiv icon

In Defense of RAG in the Era of Long-Context Language Models

Add code
Sep 03, 2024
Viaarxiv icon

FACTS About Building Retrieval Augmented Generation-based Chatbots

Add code
Jul 10, 2024
Viaarxiv icon

Reconstruct before Query: Continual Missing Modality Learning with Decomposed Prompt Collaboration

Add code
Mar 17, 2024
Figure 1 for Reconstruct before Query: Continual Missing Modality Learning with Decomposed Prompt Collaboration
Figure 2 for Reconstruct before Query: Continual Missing Modality Learning with Decomposed Prompt Collaboration
Figure 3 for Reconstruct before Query: Continual Missing Modality Learning with Decomposed Prompt Collaboration
Figure 4 for Reconstruct before Query: Continual Missing Modality Learning with Decomposed Prompt Collaboration
Viaarxiv icon

Degenerate Swin to Win: Plain Window-based Transformer without Sophisticated Operations

Add code
Nov 25, 2022
Figure 1 for Degenerate Swin to Win: Plain Window-based Transformer without Sophisticated Operations
Figure 2 for Degenerate Swin to Win: Plain Window-based Transformer without Sophisticated Operations
Figure 3 for Degenerate Swin to Win: Plain Window-based Transformer without Sophisticated Operations
Figure 4 for Degenerate Swin to Win: Plain Window-based Transformer without Sophisticated Operations
Viaarxiv icon

R2-MLP: Round-Roll MLP for Multi-View 3D Object Recognition

Add code
Nov 20, 2022
Viaarxiv icon

Prompting through Prototype: A Prototype-based Prompt Learning on Pretrained Vision-Language Models

Add code
Oct 19, 2022
Figure 1 for Prompting through Prototype: A Prototype-based Prompt Learning on Pretrained Vision-Language Models
Figure 2 for Prompting through Prototype: A Prototype-based Prompt Learning on Pretrained Vision-Language Models
Figure 3 for Prompting through Prototype: A Prototype-based Prompt Learning on Pretrained Vision-Language Models
Figure 4 for Prompting through Prototype: A Prototype-based Prompt Learning on Pretrained Vision-Language Models
Viaarxiv icon