Picture for Xiaoling Wang

Xiaoling Wang

Hierarchical Safety Realignment: Lightweight Restoration of Safety in Pruned Large Vision-Language Models

Add code
May 22, 2025
Viaarxiv icon

AutoMedEval: Harnessing Language Models for Automatic Medical Capability Evaluation

Add code
May 17, 2025
Viaarxiv icon

Unified Attacks to Large Language Model Watermarks: Spoofing and Scrubbing in Unauthorized Knowledge Distillation

Add code
Apr 24, 2025
Viaarxiv icon

HSACNet: Hierarchical Scale-Aware Consistency Regularized Semi-Supervised Change Detection

Add code
Apr 18, 2025
Viaarxiv icon

Latent-space adversarial training with post-aware calibration for defending large language models against jailbreak attacks

Add code
Jan 18, 2025
Viaarxiv icon

Hierarchical Divide-and-Conquer for Fine-Grained Alignment in LLM-Based Medical Evaluation

Add code
Jan 12, 2025
Viaarxiv icon

NLSR: Neuron-Level Safety Realignment of Large Language Models Against Harmful Fine-Tuning

Add code
Dec 17, 2024
Viaarxiv icon

ACE-$M^3$: Automatic Capability Evaluator for Multimodal Medical Models

Add code
Dec 16, 2024
Viaarxiv icon

Typicalness-Aware Learning for Failure Detection

Add code
Nov 04, 2024
Figure 1 for Typicalness-Aware Learning for Failure Detection
Figure 2 for Typicalness-Aware Learning for Failure Detection
Figure 3 for Typicalness-Aware Learning for Failure Detection
Figure 4 for Typicalness-Aware Learning for Failure Detection
Viaarxiv icon

Online Convex Optimization with Memory and Limited Predictions

Add code
Oct 31, 2024
Viaarxiv icon