Picture for Huanran Chen

Huanran Chen

Mitigating Overthinking in Large Reasoning Models via Manifold Steering

Add code
May 28, 2025
Viaarxiv icon

Understanding Pre-training and Fine-tuning from Loss Landscape Perspectives

Add code
May 23, 2025
Viaarxiv icon

Towards the Worst-case Robustness of Large Language Models

Add code
Jan 31, 2025
Figure 1 for Towards the Worst-case Robustness of Large Language Models
Figure 2 for Towards the Worst-case Robustness of Large Language Models
Figure 3 for Towards the Worst-case Robustness of Large Language Models
Figure 4 for Towards the Worst-case Robustness of Large Language Models
Viaarxiv icon

Scaling Laws for Black box Adversarial Attacks

Add code
Nov 25, 2024
Figure 1 for Scaling Laws for Black box Adversarial Attacks
Figure 2 for Scaling Laws for Black box Adversarial Attacks
Figure 3 for Scaling Laws for Black box Adversarial Attacks
Figure 4 for Scaling Laws for Black box Adversarial Attacks
Viaarxiv icon

ADBM: Adversarial diffusion bridge model for reliable adversarial purification

Add code
Aug 01, 2024
Figure 1 for ADBM: Adversarial diffusion bridge model for reliable adversarial purification
Figure 2 for ADBM: Adversarial diffusion bridge model for reliable adversarial purification
Figure 3 for ADBM: Adversarial diffusion bridge model for reliable adversarial purification
Figure 4 for ADBM: Adversarial diffusion bridge model for reliable adversarial purification
Viaarxiv icon

Benchmarking Trustworthiness of Multimodal Large Language Models: A Comprehensive Study

Add code
Jun 11, 2024
Figure 1 for Benchmarking Trustworthiness of Multimodal Large Language Models: A Comprehensive Study
Figure 2 for Benchmarking Trustworthiness of Multimodal Large Language Models: A Comprehensive Study
Figure 3 for Benchmarking Trustworthiness of Multimodal Large Language Models: A Comprehensive Study
Figure 4 for Benchmarking Trustworthiness of Multimodal Large Language Models: A Comprehensive Study
Viaarxiv icon

Membership Inference on Text-to-Image Diffusion Models via Conditional Likelihood Discrepancy

Add code
May 23, 2024
Figure 1 for Membership Inference on Text-to-Image Diffusion Models via Conditional Likelihood Discrepancy
Figure 2 for Membership Inference on Text-to-Image Diffusion Models via Conditional Likelihood Discrepancy
Figure 3 for Membership Inference on Text-to-Image Diffusion Models via Conditional Likelihood Discrepancy
Figure 4 for Membership Inference on Text-to-Image Diffusion Models via Conditional Likelihood Discrepancy
Viaarxiv icon

Elucidating the Design Space of Dataset Condensation

Add code
Apr 21, 2024
Figure 1 for Elucidating the Design Space of Dataset Condensation
Figure 2 for Elucidating the Design Space of Dataset Condensation
Figure 3 for Elucidating the Design Space of Dataset Condensation
Figure 4 for Elucidating the Design Space of Dataset Condensation
Viaarxiv icon

On the Duality Between Sharpness-Aware Minimization and Adversarial Training

Add code
Feb 23, 2024
Figure 1 for On the Duality Between Sharpness-Aware Minimization and Adversarial Training
Figure 2 for On the Duality Between Sharpness-Aware Minimization and Adversarial Training
Figure 3 for On the Duality Between Sharpness-Aware Minimization and Adversarial Training
Figure 4 for On the Duality Between Sharpness-Aware Minimization and Adversarial Training
Viaarxiv icon

Your Diffusion Model is Secretly a Certifiably Robust Classifier

Add code
Feb 13, 2024
Viaarxiv icon