Picture for Mengxuan Hu

Mengxuan Hu

Benign Samples Matter! Fine-tuning On Outlier Benign Samples Severely Breaks Safety

Add code
May 11, 2025
Viaarxiv icon

BalancEdit: Dynamically Balancing the Generality-Locality Trade-off in Multi-modal Model Editing

Add code
May 02, 2025
Viaarxiv icon

Backdoor in Seconds: Unlocking Vulnerabilities in Large Pre-trained Models via Model Editing

Add code
Oct 23, 2024
Figure 1 for Backdoor in Seconds: Unlocking Vulnerabilities in Large Pre-trained Models via Model Editing
Figure 2 for Backdoor in Seconds: Unlocking Vulnerabilities in Large Pre-trained Models via Model Editing
Figure 3 for Backdoor in Seconds: Unlocking Vulnerabilities in Large Pre-trained Models via Model Editing
Figure 4 for Backdoor in Seconds: Unlocking Vulnerabilities in Large Pre-trained Models via Model Editing
Viaarxiv icon

No Free Lunch: Retrieval-Augmented Generation Undermines Fairness in LLMs, Even for Vigilant Users

Add code
Oct 10, 2024
Viaarxiv icon

Causal Inference with Latent Variables: Recent Advances and Future Prospectives

Add code
Jun 20, 2024
Viaarxiv icon

UFID: A Unified Framework for Input-level Backdoor Detection on Diffusion Models

Add code
Apr 01, 2024
Figure 1 for UFID: A Unified Framework for Input-level Backdoor Detection on Diffusion Models
Figure 2 for UFID: A Unified Framework for Input-level Backdoor Detection on Diffusion Models
Figure 3 for UFID: A Unified Framework for Input-level Backdoor Detection on Diffusion Models
Figure 4 for UFID: A Unified Framework for Input-level Backdoor Detection on Diffusion Models
Viaarxiv icon

Img2Loc: Revisiting Image Geolocalization using Multi-modality Foundation Models and Image-based Retrieval-Augmented Generation

Add code
Mar 28, 2024
Viaarxiv icon

Bridging Causal Discovery and Large Language Models: A Comprehensive Survey of Integrative Approaches and Future Directions

Add code
Feb 16, 2024
Viaarxiv icon

Task-Driven Causal Feature Distillation: Towards Trustworthy Risk Prediction

Add code
Dec 20, 2023
Figure 1 for Task-Driven Causal Feature Distillation: Towards Trustworthy Risk Prediction
Figure 2 for Task-Driven Causal Feature Distillation: Towards Trustworthy Risk Prediction
Figure 3 for Task-Driven Causal Feature Distillation: Towards Trustworthy Risk Prediction
Figure 4 for Task-Driven Causal Feature Distillation: Towards Trustworthy Risk Prediction
Viaarxiv icon

XAI meets Biology: A Comprehensive Review of Explainable AI in Bioinformatics Applications

Add code
Dec 11, 2023
Viaarxiv icon