Picture for Chengzhi Mao

Chengzhi Mao

Differences That Matter: Auditing Models for Capability Gap Discovery and Rectification

Add code
Dec 18, 2025
Figure 1 for Differences That Matter: Auditing Models for Capability Gap Discovery and Rectification
Figure 2 for Differences That Matter: Auditing Models for Capability Gap Discovery and Rectification
Figure 3 for Differences That Matter: Auditing Models for Capability Gap Discovery and Rectification
Figure 4 for Differences That Matter: Auditing Models for Capability Gap Discovery and Rectification
Viaarxiv icon

Mull-Tokens: Modality-Agnostic Latent Thinking

Add code
Dec 11, 2025
Figure 1 for Mull-Tokens: Modality-Agnostic Latent Thinking
Figure 2 for Mull-Tokens: Modality-Agnostic Latent Thinking
Figure 3 for Mull-Tokens: Modality-Agnostic Latent Thinking
Figure 4 for Mull-Tokens: Modality-Agnostic Latent Thinking
Viaarxiv icon

LARGO: Latent Adversarial Reflection through Gradient Optimization for Jailbreaking LLMs

Add code
May 16, 2025
Viaarxiv icon

LAVID: An Agentic LVLM Framework for Diffusion-Generated Video Detection

Add code
Feb 20, 2025
Viaarxiv icon

Diversity Helps Jailbreak Large Language Models

Add code
Nov 06, 2024
Figure 1 for Diversity Helps Jailbreak Large Language Models
Figure 2 for Diversity Helps Jailbreak Large Language Models
Figure 3 for Diversity Helps Jailbreak Large Language Models
Figure 4 for Diversity Helps Jailbreak Large Language Models
Viaarxiv icon

I Can Hear You: Selective Robust Training for Deepfake Audio Detection

Add code
Oct 31, 2024
Figure 1 for I Can Hear You: Selective Robust Training for Deepfake Audio Detection
Figure 2 for I Can Hear You: Selective Robust Training for Deepfake Audio Detection
Figure 3 for I Can Hear You: Selective Robust Training for Deepfake Audio Detection
Figure 4 for I Can Hear You: Selective Robust Training for Deepfake Audio Detection
Viaarxiv icon

SPIN: Self-Supervised Prompt INjection

Add code
Oct 17, 2024
Figure 1 for SPIN: Self-Supervised Prompt INjection
Figure 2 for SPIN: Self-Supervised Prompt INjection
Figure 3 for SPIN: Self-Supervised Prompt INjection
Figure 4 for SPIN: Self-Supervised Prompt INjection
Viaarxiv icon

RAFT: Realistic Attacks to Fool Text Detectors

Add code
Oct 04, 2024
Figure 1 for RAFT: Realistic Attacks to Fool Text Detectors
Figure 2 for RAFT: Realistic Attacks to Fool Text Detectors
Figure 3 for RAFT: Realistic Attacks to Fool Text Detectors
Figure 4 for RAFT: Realistic Attacks to Fool Text Detectors
Viaarxiv icon

Learning to Rewrite: Generalized LLM-Generated Text Detection

Add code
Aug 08, 2024
Figure 1 for Learning to Rewrite: Generalized LLM-Generated Text Detection
Figure 2 for Learning to Rewrite: Generalized LLM-Generated Text Detection
Figure 3 for Learning to Rewrite: Generalized LLM-Generated Text Detection
Figure 4 for Learning to Rewrite: Generalized LLM-Generated Text Detection
Viaarxiv icon

Turns Out I'm Not Real: Towards Robust Detection of AI-Generated Videos

Add code
Jun 13, 2024
Viaarxiv icon