Picture for Chao Shen

Chao Shen

AutoEmpirical: LLM-Based Automated Research for Empirical Software Fault Analysis

Add code
Oct 06, 2025
Viaarxiv icon

JADES: A Universal Framework for Jailbreak Assessment via Decompositional Scoring

Add code
Aug 28, 2025
Viaarxiv icon

Adversarial Video Promotion Against Text-to-Video Retrieval

Add code
Aug 12, 2025
Viaarxiv icon

D3: Training-Free AI-Generated Video Detection Using Second-Order Features

Add code
Aug 01, 2025
Viaarxiv icon

Revisiting Adversarial Patch Defenses on Object Detectors: Unified Evaluation, Large-Scale Dataset, and New Insights

Add code
Aug 01, 2025
Figure 1 for Revisiting Adversarial Patch Defenses on Object Detectors: Unified Evaluation, Large-Scale Dataset, and New Insights
Figure 2 for Revisiting Adversarial Patch Defenses on Object Detectors: Unified Evaluation, Large-Scale Dataset, and New Insights
Figure 3 for Revisiting Adversarial Patch Defenses on Object Detectors: Unified Evaluation, Large-Scale Dataset, and New Insights
Figure 4 for Revisiting Adversarial Patch Defenses on Object Detectors: Unified Evaluation, Large-Scale Dataset, and New Insights
Viaarxiv icon

Concept Unlearning by Modeling Key Steps of Diffusion Process

Add code
Jul 09, 2025
Viaarxiv icon

The Foundation Cracks: A Comprehensive Study on Bugs and Testing Practices in LLM Libraries

Add code
Jun 14, 2025
Viaarxiv icon

Seeing It or Not? Interpretable Vision-aware Latent Steering to Mitigate Object Hallucinations

Add code
May 23, 2025
Viaarxiv icon

MCP-RADAR: A Multi-Dimensional Benchmark for Evaluating Tool Use Capabilities in Large Language Models

Add code
May 22, 2025
Figure 1 for MCP-RADAR: A Multi-Dimensional Benchmark for Evaluating Tool Use Capabilities in Large Language Models
Figure 2 for MCP-RADAR: A Multi-Dimensional Benchmark for Evaluating Tool Use Capabilities in Large Language Models
Figure 3 for MCP-RADAR: A Multi-Dimensional Benchmark for Evaluating Tool Use Capabilities in Large Language Models
Figure 4 for MCP-RADAR: A Multi-Dimensional Benchmark for Evaluating Tool Use Capabilities in Large Language Models
Viaarxiv icon

DeepEyes: Incentivizing "Thinking with Images" via Reinforcement Learning

Add code
May 20, 2025
Viaarxiv icon