Picture for Quanchen Zou

Quanchen Zou

SafeHarbor: Hierarchical Memory-Augmented Guardrail for LLM Agent Safety

Add code
May 07, 2026
Viaarxiv icon

TrajShield: Trajectory-Level Safety Mediation for Defending Text-to-Video Models Against Jailbreak Attacks

Add code
May 03, 2026
Viaarxiv icon

Reading Between the Pixels: An Inscriptive Jailbreak Attack on Text-to-Image Models

Add code
Apr 07, 2026
Viaarxiv icon

Robust Privacy: Inference-Time Privacy through Certified Robustness

Add code
Jan 24, 2026
Viaarxiv icon

DIVER: Dynamic Iterative Visual Evidence Reasoning for Multimodal Fake News Detection

Add code
Jan 12, 2026
Viaarxiv icon

SAPL: Semantic-Agnostic Prompt Learning in CLIP for Weakly Supervised Image Manipulation Localization

Add code
Jan 09, 2026
Viaarxiv icon

RoboSafe: Safeguarding Embodied Agents via Executable Safety Logic

Add code
Dec 24, 2025
Viaarxiv icon

Disentangling Fact from Sentiment: A Dynamic Conflict-Consensus Framework for Multimodal Fake News Detection

Add code
Dec 19, 2025
Figure 1 for Disentangling Fact from Sentiment: A Dynamic Conflict-Consensus Framework for Multimodal Fake News Detection
Figure 2 for Disentangling Fact from Sentiment: A Dynamic Conflict-Consensus Framework for Multimodal Fake News Detection
Figure 3 for Disentangling Fact from Sentiment: A Dynamic Conflict-Consensus Framework for Multimodal Fake News Detection
Figure 4 for Disentangling Fact from Sentiment: A Dynamic Conflict-Consensus Framework for Multimodal Fake News Detection
Viaarxiv icon

VEIL: Jailbreaking Text-to-Video Models via Visual Exploitation from Implicit Language

Add code
Nov 17, 2025
Viaarxiv icon

Towards Understanding the Safety Boundaries of DeepSeek Models: Evaluation and Findings

Add code
Mar 19, 2025
Viaarxiv icon