Picture for Han Qiu

Han Qiu

Exploring Multimodal Challenges in Toxic Chinese Detection: Taxonomy, Benchmark, and Findings

Add code
May 30, 2025
Viaarxiv icon

BitHydra: Towards Bit-flip Inference Cost Attack against Large Language Models

Add code
May 22, 2025
Viaarxiv icon

ShieldVLM: Safeguarding the Multimodal Implicit Toxicity via Deliberative Reasoning with LVLMs

Add code
May 20, 2025
Viaarxiv icon

Holmes: Automated Fact Check with Large Language Models

Add code
May 06, 2025
Viaarxiv icon

Mask Image Watermarking

Add code
Apr 17, 2025
Viaarxiv icon

FaceID-6M: A Large-Scale, Open-Source FaceID Customization Dataset

Add code
Mar 11, 2025
Viaarxiv icon

Picky LLMs and Unreliable RMs: An Empirical Study on Safety Alignment after Instruction Tuning

Add code
Feb 03, 2025
Viaarxiv icon

VideoShield: Regulating Diffusion-based Video Generation Models via Watermarking

Add code
Jan 24, 2025
Figure 1 for VideoShield: Regulating Diffusion-based Video Generation Models via Watermarking
Figure 2 for VideoShield: Regulating Diffusion-based Video Generation Models via Watermarking
Figure 3 for VideoShield: Regulating Diffusion-based Video Generation Models via Watermarking
Figure 4 for VideoShield: Regulating Diffusion-based Video Generation Models via Watermarking
Viaarxiv icon

An Engorgio Prompt Makes Large Language Model Babble on

Add code
Dec 27, 2024
Figure 1 for An Engorgio Prompt Makes Large Language Model Babble on
Figure 2 for An Engorgio Prompt Makes Large Language Model Babble on
Figure 3 for An Engorgio Prompt Makes Large Language Model Babble on
Figure 4 for An Engorgio Prompt Makes Large Language Model Babble on
Viaarxiv icon

Understanding the Dark Side of LLMs' Intrinsic Self-Correction

Add code
Dec 19, 2024
Figure 1 for Understanding the Dark Side of LLMs' Intrinsic Self-Correction
Figure 2 for Understanding the Dark Side of LLMs' Intrinsic Self-Correction
Figure 3 for Understanding the Dark Side of LLMs' Intrinsic Self-Correction
Figure 4 for Understanding the Dark Side of LLMs' Intrinsic Self-Correction
Viaarxiv icon