Picture for Huijia Zhu

Huijia Zhu

COCO-Inpaint: A Benchmark for Image Inpainting Detection and Manipulation Localization

Add code
Apr 25, 2025
Viaarxiv icon

Towards Explainable Fake Image Detection with Multi-Modal Large Language Models

Add code
Apr 19, 2025
Viaarxiv icon

InterAnimate: Taming Region-aware Diffusion Model for Realistic Human Interaction Animation

Add code
Apr 15, 2025
Viaarxiv icon

Generalizable Audio Deepfake Detection via Latent Space Refinement and Augmentation

Add code
Jan 24, 2025
Viaarxiv icon

Auto-RT: Automatic Jailbreak Strategy Exploration for Red-Teaming Large Language Models

Add code
Jan 03, 2025
Figure 1 for Auto-RT: Automatic Jailbreak Strategy Exploration for Red-Teaming Large Language Models
Figure 2 for Auto-RT: Automatic Jailbreak Strategy Exploration for Red-Teaming Large Language Models
Figure 3 for Auto-RT: Automatic Jailbreak Strategy Exploration for Red-Teaming Large Language Models
Figure 4 for Auto-RT: Automatic Jailbreak Strategy Exploration for Red-Teaming Large Language Models
Viaarxiv icon

Maintain Plasticity in Long-timescale Continual Test-time Adaptation

Add code
Dec 28, 2024
Figure 1 for Maintain Plasticity in Long-timescale Continual Test-time Adaptation
Figure 2 for Maintain Plasticity in Long-timescale Continual Test-time Adaptation
Figure 3 for Maintain Plasticity in Long-timescale Continual Test-time Adaptation
Figure 4 for Maintain Plasticity in Long-timescale Continual Test-time Adaptation
Viaarxiv icon

DomainGallery: Few-shot Domain-driven Image Generation by Attribute-centric Finetuning

Add code
Nov 07, 2024
Figure 1 for DomainGallery: Few-shot Domain-driven Image Generation by Attribute-centric Finetuning
Figure 2 for DomainGallery: Few-shot Domain-driven Image Generation by Attribute-centric Finetuning
Figure 3 for DomainGallery: Few-shot Domain-driven Image Generation by Attribute-centric Finetuning
Figure 4 for DomainGallery: Few-shot Domain-driven Image Generation by Attribute-centric Finetuning
Viaarxiv icon

Modeling Layout Reading Order as Ordering Relations for Visually-rich Document Understanding

Add code
Sep 29, 2024
Viaarxiv icon

Stochastic Layer-Wise Shuffle: A Good Practice to Improve Vision Mamba Training

Add code
Aug 30, 2024
Figure 1 for Stochastic Layer-Wise Shuffle: A Good Practice to Improve Vision Mamba Training
Figure 2 for Stochastic Layer-Wise Shuffle: A Good Practice to Improve Vision Mamba Training
Figure 3 for Stochastic Layer-Wise Shuffle: A Good Practice to Improve Vision Mamba Training
Figure 4 for Stochastic Layer-Wise Shuffle: A Good Practice to Improve Vision Mamba Training
Viaarxiv icon

UNER: A Unified Prediction Head for Named Entity Recognition in Visually-rich Documents

Add code
Aug 02, 2024
Figure 1 for UNER: A Unified Prediction Head for Named Entity Recognition in Visually-rich Documents
Figure 2 for UNER: A Unified Prediction Head for Named Entity Recognition in Visually-rich Documents
Figure 3 for UNER: A Unified Prediction Head for Named Entity Recognition in Visually-rich Documents
Figure 4 for UNER: A Unified Prediction Head for Named Entity Recognition in Visually-rich Documents
Viaarxiv icon