Picture for Huanran Chen

Huanran Chen

ClawWorm: Self-Propagating Attacks Across LLM Agent Ecosystems

Add code
Mar 16, 2026
Viaarxiv icon

Mitigating Overthinking in Large Reasoning Models via Manifold Steering

Add code
May 28, 2025
Viaarxiv icon

Understanding Pre-training and Fine-tuning from Loss Landscape Perspectives

Add code
May 23, 2025
Figure 1 for Understanding Pre-training and Fine-tuning from Loss Landscape Perspectives
Figure 2 for Understanding Pre-training and Fine-tuning from Loss Landscape Perspectives
Figure 3 for Understanding Pre-training and Fine-tuning from Loss Landscape Perspectives
Figure 4 for Understanding Pre-training and Fine-tuning from Loss Landscape Perspectives
Viaarxiv icon

Towards the Worst-case Robustness of Large Language Models

Add code
Jan 31, 2025
Figure 1 for Towards the Worst-case Robustness of Large Language Models
Figure 2 for Towards the Worst-case Robustness of Large Language Models
Figure 3 for Towards the Worst-case Robustness of Large Language Models
Figure 4 for Towards the Worst-case Robustness of Large Language Models
Viaarxiv icon

Scaling Laws for Black box Adversarial Attacks

Add code
Nov 25, 2024
Figure 1 for Scaling Laws for Black box Adversarial Attacks
Figure 2 for Scaling Laws for Black box Adversarial Attacks
Figure 3 for Scaling Laws for Black box Adversarial Attacks
Figure 4 for Scaling Laws for Black box Adversarial Attacks
Viaarxiv icon

ADBM: Adversarial diffusion bridge model for reliable adversarial purification

Add code
Aug 01, 2024
Figure 1 for ADBM: Adversarial diffusion bridge model for reliable adversarial purification
Figure 2 for ADBM: Adversarial diffusion bridge model for reliable adversarial purification
Figure 3 for ADBM: Adversarial diffusion bridge model for reliable adversarial purification
Figure 4 for ADBM: Adversarial diffusion bridge model for reliable adversarial purification
Viaarxiv icon

Benchmarking Trustworthiness of Multimodal Large Language Models: A Comprehensive Study

Add code
Jun 11, 2024
Figure 1 for Benchmarking Trustworthiness of Multimodal Large Language Models: A Comprehensive Study
Figure 2 for Benchmarking Trustworthiness of Multimodal Large Language Models: A Comprehensive Study
Figure 3 for Benchmarking Trustworthiness of Multimodal Large Language Models: A Comprehensive Study
Figure 4 for Benchmarking Trustworthiness of Multimodal Large Language Models: A Comprehensive Study
Viaarxiv icon

Membership Inference on Text-to-Image Diffusion Models via Conditional Likelihood Discrepancy

Add code
May 23, 2024
Figure 1 for Membership Inference on Text-to-Image Diffusion Models via Conditional Likelihood Discrepancy
Figure 2 for Membership Inference on Text-to-Image Diffusion Models via Conditional Likelihood Discrepancy
Figure 3 for Membership Inference on Text-to-Image Diffusion Models via Conditional Likelihood Discrepancy
Figure 4 for Membership Inference on Text-to-Image Diffusion Models via Conditional Likelihood Discrepancy
Viaarxiv icon

Elucidating the Design Space of Dataset Condensation

Add code
Apr 21, 2024
Figure 1 for Elucidating the Design Space of Dataset Condensation
Figure 2 for Elucidating the Design Space of Dataset Condensation
Figure 3 for Elucidating the Design Space of Dataset Condensation
Figure 4 for Elucidating the Design Space of Dataset Condensation
Viaarxiv icon

On the Duality Between Sharpness-Aware Minimization and Adversarial Training

Add code
Feb 23, 2024
Figure 1 for On the Duality Between Sharpness-Aware Minimization and Adversarial Training
Figure 2 for On the Duality Between Sharpness-Aware Minimization and Adversarial Training
Figure 3 for On the Duality Between Sharpness-Aware Minimization and Adversarial Training
Figure 4 for On the Duality Between Sharpness-Aware Minimization and Adversarial Training
Viaarxiv icon