Picture for Ngai-Man Cheung

Ngai-Man Cheung

How Do Medical MLLMs Fail? A Study on Visual Grounding in Medical Images

Add code
Mar 15, 2026
Viaarxiv icon

On the Adversarial Robustness of 3D Large Vision-Language Models

Add code
Jan 10, 2026
Viaarxiv icon

Towards Sustainable Universal Deepfake Detection with Frequency-Domain Masking

Add code
Dec 08, 2025
Figure 1 for Towards Sustainable Universal Deepfake Detection with Frequency-Domain Masking
Figure 2 for Towards Sustainable Universal Deepfake Detection with Frequency-Domain Masking
Figure 3 for Towards Sustainable Universal Deepfake Detection with Frequency-Domain Masking
Figure 4 for Towards Sustainable Universal Deepfake Detection with Frequency-Domain Masking
Viaarxiv icon

Model Inversion Attacks on Vision-Language Models: Do They Leak What They Learn?

Add code
Aug 06, 2025
Viaarxiv icon

AIR: Zero-shot Generative Model Adaptation with Iterative Refinement

Add code
Jun 12, 2025
Viaarxiv icon

Uncovering the Limitations of Model Inversion Evaluation -- Benchmarks and Connection to Type-I Adversarial Attacks

Add code
May 08, 2025
Figure 1 for Uncovering the Limitations of Model Inversion Evaluation -- Benchmarks and Connection to Type-I Adversarial Attacks
Figure 2 for Uncovering the Limitations of Model Inversion Evaluation -- Benchmarks and Connection to Type-I Adversarial Attacks
Figure 3 for Uncovering the Limitations of Model Inversion Evaluation -- Benchmarks and Connection to Type-I Adversarial Attacks
Figure 4 for Uncovering the Limitations of Model Inversion Evaluation -- Benchmarks and Connection to Type-I Adversarial Attacks
Viaarxiv icon

Text to Image Generation and Editing: A Survey

Add code
May 05, 2025
Viaarxiv icon

Vision Transformer Neural Architecture Search for Out-of-Distribution Generalization: Benchmark and Insights

Add code
Jan 07, 2025
Figure 1 for Vision Transformer Neural Architecture Search for Out-of-Distribution Generalization: Benchmark and Insights
Figure 2 for Vision Transformer Neural Architecture Search for Out-of-Distribution Generalization: Benchmark and Insights
Figure 3 for Vision Transformer Neural Architecture Search for Out-of-Distribution Generalization: Benchmark and Insights
Figure 4 for Vision Transformer Neural Architecture Search for Out-of-Distribution Generalization: Benchmark and Insights
Viaarxiv icon

Multimodal Preference Data Synthetic Alignment with Reward Model

Add code
Dec 23, 2024
Figure 1 for Multimodal Preference Data Synthetic Alignment with Reward Model
Figure 2 for Multimodal Preference Data Synthetic Alignment with Reward Model
Figure 3 for Multimodal Preference Data Synthetic Alignment with Reward Model
Figure 4 for Multimodal Preference Data Synthetic Alignment with Reward Model
Viaarxiv icon

Urban Air Temperature Prediction using Conditional Diffusion Models

Add code
Dec 18, 2024
Viaarxiv icon