Picture for Meng Han

Meng Han

MEraser: An Effective Fingerprint Erasure Approach for Large Language Models

Add code
Jun 14, 2025
Viaarxiv icon

Pushing the Limits of Safety: A Technical Report on the ATLAS Challenge 2025

Add code
Jun 14, 2025
Viaarxiv icon

ChineseHarm-Bench: A Chinese Harmful Content Detection Benchmark

Add code
Jun 12, 2025
Viaarxiv icon

Direct Behavior Optimization: Unlocking the Potential of Lightweight LLMs

Add code
Jun 06, 2025
Viaarxiv icon

CaMDN: Enhancing Cache Efficiency for Multi-tenant DNNs on Integrated NPUs

Add code
May 10, 2025
Viaarxiv icon

NeuRel-Attack: Neuron Relearning for Safety Disalignment in Large Language Models

Add code
Apr 29, 2025
Viaarxiv icon

FineQ: Software-Hardware Co-Design for Low-Bit Fine-Grained Mixed-Precision Quantization of LLMs

Add code
Apr 28, 2025
Viaarxiv icon

Towards Robust and Secure Embodied AI: A Survey on Vulnerabilities and Attacks

Add code
Feb 18, 2025
Viaarxiv icon

CopyrightMeter: Revisiting Copyright Protection in Text-to-image Models

Add code
Nov 20, 2024
Figure 1 for CopyrightMeter: Revisiting Copyright Protection in Text-to-image Models
Figure 2 for CopyrightMeter: Revisiting Copyright Protection in Text-to-image Models
Figure 3 for CopyrightMeter: Revisiting Copyright Protection in Text-to-image Models
Figure 4 for CopyrightMeter: Revisiting Copyright Protection in Text-to-image Models
Viaarxiv icon

GenTel-Safe: A Unified Benchmark and Shielding Framework for Defending Against Prompt Injection Attacks

Add code
Sep 29, 2024
Figure 1 for GenTel-Safe: A Unified Benchmark and Shielding Framework for Defending Against Prompt Injection Attacks
Figure 2 for GenTel-Safe: A Unified Benchmark and Shielding Framework for Defending Against Prompt Injection Attacks
Figure 3 for GenTel-Safe: A Unified Benchmark and Shielding Framework for Defending Against Prompt Injection Attacks
Figure 4 for GenTel-Safe: A Unified Benchmark and Shielding Framework for Defending Against Prompt Injection Attacks
Viaarxiv icon