Picture for Zongjie Li

Zongjie Li

Beyond Content Safety: Real-Time Monitoring for Reasoning Vulnerabilities in Large Language Models

Add code
Mar 26, 2026
Viaarxiv icon

WARBENCH: A Comprehensive Benchmark for Evaluating LLMs in Military Decision-Making

Add code
Mar 22, 2026
Viaarxiv icon

On Protecting Agentic Systems' Intellectual Property via Watermarking

Add code
Feb 09, 2026
Viaarxiv icon

Taxonomy, Evaluation and Exploitation of IPI-Centric LLM Agent Defense Frameworks

Add code
Nov 19, 2025
Viaarxiv icon

Disabling Self-Correction in Retrieval-Augmented Generation via Stealthy Retriever Poisoning

Add code
Aug 27, 2025
Figure 1 for Disabling Self-Correction in Retrieval-Augmented Generation via Stealthy Retriever Poisoning
Figure 2 for Disabling Self-Correction in Retrieval-Augmented Generation via Stealthy Retriever Poisoning
Figure 3 for Disabling Self-Correction in Retrieval-Augmented Generation via Stealthy Retriever Poisoning
Figure 4 for Disabling Self-Correction in Retrieval-Augmented Generation via Stealthy Retriever Poisoning
Viaarxiv icon

SoK: Evaluating Jailbreak Guardrails for Large Language Models

Add code
Jun 12, 2025
Figure 1 for SoK: Evaluating Jailbreak Guardrails for Large Language Models
Figure 2 for SoK: Evaluating Jailbreak Guardrails for Large Language Models
Figure 3 for SoK: Evaluating Jailbreak Guardrails for Large Language Models
Figure 4 for SoK: Evaluating Jailbreak Guardrails for Large Language Models
Viaarxiv icon

Reasoning as a Resource: Optimizing Fast and Slow Thinking in Code Generation Models

Add code
Jun 11, 2025
Figure 1 for Reasoning as a Resource: Optimizing Fast and Slow Thinking in Code Generation Models
Figure 2 for Reasoning as a Resource: Optimizing Fast and Slow Thinking in Code Generation Models
Viaarxiv icon

IP Leakage Attacks Targeting LLM-Based Multi-Agent Systems

Add code
May 18, 2025
Viaarxiv icon

NAMET: Robust Massive Model Editing via Noise-Aware Memory Optimization

Add code
May 17, 2025
Figure 1 for NAMET: Robust Massive Model Editing via Noise-Aware Memory Optimization
Figure 2 for NAMET: Robust Massive Model Editing via Noise-Aware Memory Optimization
Figure 3 for NAMET: Robust Massive Model Editing via Noise-Aware Memory Optimization
Figure 4 for NAMET: Robust Massive Model Editing via Noise-Aware Memory Optimization
Viaarxiv icon

GuidedBench: Equipping Jailbreak Evaluation with Guidelines

Add code
Feb 24, 2025
Figure 1 for GuidedBench: Equipping Jailbreak Evaluation with Guidelines
Figure 2 for GuidedBench: Equipping Jailbreak Evaluation with Guidelines
Figure 3 for GuidedBench: Equipping Jailbreak Evaluation with Guidelines
Figure 4 for GuidedBench: Equipping Jailbreak Evaluation with Guidelines
Viaarxiv icon