Picture for You Zhang

You Zhang

Medical Artificial Intelligence and Automation

WeDefense: A Toolkit to Defend Against Fake Audio

Add code
Jan 21, 2026
Viaarxiv icon

Exploiting DINOv3-Based Self-Supervised Features for Robust Few-Shot Medical Image Segmentation

Add code
Jan 12, 2026
Viaarxiv icon

How Does Instrumental Music Help SingFake Detection?

Add code
Sep 18, 2025
Viaarxiv icon

Segment Anything for Video: A Comprehensive Review of Video Object Segmentation and Tracking from Past to Future

Add code
Jul 30, 2025
Figure 1 for Segment Anything for Video: A Comprehensive Review of Video Object Segmentation and Tracking from Past to Future
Figure 2 for Segment Anything for Video: A Comprehensive Review of Video Object Segmentation and Tracking from Past to Future
Figure 3 for Segment Anything for Video: A Comprehensive Review of Video Object Segmentation and Tracking from Past to Future
Figure 4 for Segment Anything for Video: A Comprehensive Review of Video Object Segmentation and Tracking from Past to Future
Viaarxiv icon

Towards Perception-Informed Latent HRTF Representations

Add code
Jul 03, 2025
Viaarxiv icon

SAM-aware Test-time Adaptation for Universal Medical Image Segmentation

Add code
Jun 05, 2025
Figure 1 for SAM-aware Test-time Adaptation for Universal Medical Image Segmentation
Figure 2 for SAM-aware Test-time Adaptation for Universal Medical Image Segmentation
Figure 3 for SAM-aware Test-time Adaptation for Universal Medical Image Segmentation
Figure 4 for SAM-aware Test-time Adaptation for Universal Medical Image Segmentation
Viaarxiv icon

Time-resolved dynamic CBCT reconstruction using prior-model-free spatiotemporal Gaussian representation (PMF-STGR)

Add code
Mar 28, 2025
Viaarxiv icon

Multi-Attribute Multi-Grained Adaptation of Pre-Trained Language Models for Text Understanding from Bayesian Perspective

Add code
Mar 08, 2025
Figure 1 for Multi-Attribute Multi-Grained Adaptation of Pre-Trained Language Models for Text Understanding from Bayesian Perspective
Figure 2 for Multi-Attribute Multi-Grained Adaptation of Pre-Trained Language Models for Text Understanding from Bayesian Perspective
Figure 3 for Multi-Attribute Multi-Grained Adaptation of Pre-Trained Language Models for Text Understanding from Bayesian Perspective
Figure 4 for Multi-Attribute Multi-Grained Adaptation of Pre-Trained Language Models for Text Understanding from Bayesian Perspective
Viaarxiv icon

Audio Visual Segmentation Through Text Embeddings

Add code
Feb 22, 2025
Viaarxiv icon

ASVspoof 5: Design, Collection and Validation of Resources for Spoofing, Deepfake, and Adversarial Attack Detection Using Crowdsourced Speech

Add code
Feb 13, 2025
Figure 1 for ASVspoof 5: Design, Collection and Validation of Resources for Spoofing, Deepfake, and Adversarial Attack Detection Using Crowdsourced Speech
Figure 2 for ASVspoof 5: Design, Collection and Validation of Resources for Spoofing, Deepfake, and Adversarial Attack Detection Using Crowdsourced Speech
Figure 3 for ASVspoof 5: Design, Collection and Validation of Resources for Spoofing, Deepfake, and Adversarial Attack Detection Using Crowdsourced Speech
Figure 4 for ASVspoof 5: Design, Collection and Validation of Resources for Spoofing, Deepfake, and Adversarial Attack Detection Using Crowdsourced Speech
Viaarxiv icon