Picture for Zhun Zhong

Zhun Zhong

TDEdit: A Unified Diffusion Framework for Text-Drag Guided Image Manipulation

Add code
Sep 26, 2025
Viaarxiv icon

Beyond Artificial Misalignment: Detecting and Grounding Semantic-Coordinated Multimodal Manipulations

Add code
Sep 16, 2025
Viaarxiv icon

Towards Fine-Grained Emotion Understanding via Skeleton-Based Micro-Gesture Recognition

Add code
Jun 15, 2025
Viaarxiv icon

Towards Micro-Action Recognition with Limited Annotations: An Asynchronous Pseudo Labeling and Training Approach

Add code
Apr 10, 2025
Viaarxiv icon

ATM-Net: Anatomy-Aware Text-Guided Multi-Modal Fusion for Fine-Grained Lumbar Spine Segmentation

Add code
Apr 04, 2025
Viaarxiv icon

Noisy Test-Time Adaptation in Vision-Language Models

Add code
Feb 20, 2025
Figure 1 for Noisy Test-Time Adaptation in Vision-Language Models
Figure 2 for Noisy Test-Time Adaptation in Vision-Language Models
Figure 3 for Noisy Test-Time Adaptation in Vision-Language Models
Figure 4 for Noisy Test-Time Adaptation in Vision-Language Models
Viaarxiv icon

Prior-Constrained Association Learning for Fine-Grained Generalized Category Discovery

Add code
Feb 13, 2025
Figure 1 for Prior-Constrained Association Learning for Fine-Grained Generalized Category Discovery
Figure 2 for Prior-Constrained Association Learning for Fine-Grained Generalized Category Discovery
Figure 3 for Prior-Constrained Association Learning for Fine-Grained Generalized Category Discovery
Figure 4 for Prior-Constrained Association Learning for Fine-Grained Generalized Category Discovery
Viaarxiv icon

Knowledge Swapping via Learning and Unlearning

Add code
Feb 12, 2025
Figure 1 for Knowledge Swapping via Learning and Unlearning
Figure 2 for Knowledge Swapping via Learning and Unlearning
Figure 3 for Knowledge Swapping via Learning and Unlearning
Figure 4 for Knowledge Swapping via Learning and Unlearning
Viaarxiv icon

Multi-Modality Driven LoRA for Adverse Condition Depth Estimation

Add code
Dec 28, 2024
Figure 1 for Multi-Modality Driven LoRA for Adverse Condition Depth Estimation
Figure 2 for Multi-Modality Driven LoRA for Adverse Condition Depth Estimation
Figure 3 for Multi-Modality Driven LoRA for Adverse Condition Depth Estimation
Figure 4 for Multi-Modality Driven LoRA for Adverse Condition Depth Estimation
Viaarxiv icon

ChangeDiff: A Multi-Temporal Change Detection Data Generator with Flexible Text Prompts via Diffusion Model

Add code
Dec 20, 2024
Figure 1 for ChangeDiff: A Multi-Temporal Change Detection Data Generator with Flexible Text Prompts via Diffusion Model
Figure 2 for ChangeDiff: A Multi-Temporal Change Detection Data Generator with Flexible Text Prompts via Diffusion Model
Figure 3 for ChangeDiff: A Multi-Temporal Change Detection Data Generator with Flexible Text Prompts via Diffusion Model
Figure 4 for ChangeDiff: A Multi-Temporal Change Detection Data Generator with Flexible Text Prompts via Diffusion Model
Viaarxiv icon