Picture for Chia-Mu Yu

Chia-Mu Yu

Hidden Thoughts Are Not Secret: Reasoning Trace Exposure in LLMs

Add code
May 30, 2026
Viaarxiv icon

Harmless Yet Harmful: Neutral Prompting Attacks for Stealthy Hallucination Steering in Agent Skills

Add code
May 28, 2026
Viaarxiv icon

IU: Imperceptible Universal Backdoor Attack

Add code
Feb 28, 2026
Viaarxiv icon

Defending Unauthorized Model Merging via Dual-Stage Weight Protection

Add code
Nov 14, 2025
Figure 1 for Defending Unauthorized Model Merging via Dual-Stage Weight Protection
Figure 2 for Defending Unauthorized Model Merging via Dual-Stage Weight Protection
Figure 3 for Defending Unauthorized Model Merging via Dual-Stage Weight Protection
Figure 4 for Defending Unauthorized Model Merging via Dual-Stage Weight Protection
Viaarxiv icon

VP-NTK: Exploring the Benefits of Visual Prompting in Differentially Private Data Synthesis

Add code
Mar 20, 2025
Viaarxiv icon

Data Poisoning Attacks to Locally Differentially Private Range Query Protocols

Add code
Mar 05, 2025
Figure 1 for Data Poisoning Attacks to Locally Differentially Private Range Query Protocols
Figure 2 for Data Poisoning Attacks to Locally Differentially Private Range Query Protocols
Figure 3 for Data Poisoning Attacks to Locally Differentially Private Range Query Protocols
Figure 4 for Data Poisoning Attacks to Locally Differentially Private Range Query Protocols
Viaarxiv icon

Beyond Natural Language Perplexity: Detecting Dead Code Poisoning in Code Generation Datasets

Add code
Feb 28, 2025
Figure 1 for Beyond Natural Language Perplexity: Detecting Dead Code Poisoning in Code Generation Datasets
Figure 2 for Beyond Natural Language Perplexity: Detecting Dead Code Poisoning in Code Generation Datasets
Figure 3 for Beyond Natural Language Perplexity: Detecting Dead Code Poisoning in Code Generation Datasets
Figure 4 for Beyond Natural Language Perplexity: Detecting Dead Code Poisoning in Code Generation Datasets
Viaarxiv icon

Layer-Aware Task Arithmetic: Disentangling Task-Specific and Instruction-Following Knowledge

Add code
Feb 27, 2025
Figure 1 for Layer-Aware Task Arithmetic: Disentangling Task-Specific and Instruction-Following Knowledge
Figure 2 for Layer-Aware Task Arithmetic: Disentangling Task-Specific and Instruction-Following Knowledge
Figure 3 for Layer-Aware Task Arithmetic: Disentangling Task-Specific and Instruction-Following Knowledge
Figure 4 for Layer-Aware Task Arithmetic: Disentangling Task-Specific and Instruction-Following Knowledge
Viaarxiv icon

BADTV: Unveiling Backdoor Threats in Third-Party Task Vectors

Add code
Jan 04, 2025
Viaarxiv icon

Prompting the Unseen: Detecting Hidden Backdoors in Black-Box Models

Add code
Nov 14, 2024
Figure 1 for Prompting the Unseen: Detecting Hidden Backdoors in Black-Box Models
Figure 2 for Prompting the Unseen: Detecting Hidden Backdoors in Black-Box Models
Figure 3 for Prompting the Unseen: Detecting Hidden Backdoors in Black-Box Models
Figure 4 for Prompting the Unseen: Detecting Hidden Backdoors in Black-Box Models
Viaarxiv icon