Picture for Xiaohan Yuan

Xiaohan Yuan

RAS-Eval: A Comprehensive Benchmark for Security Evaluation of LLM Agents in Real-World Environments

Add code
Jun 18, 2025
Viaarxiv icon

PRUNE: A Patching Based Repair Framework for Certiffable Unlearning of Neural Networks

Add code
May 10, 2025
Viaarxiv icon

S-Eval: Automatic and Adaptive Test Generation for Benchmarking Safety Evaluation of Large Language Models

Add code
May 28, 2024
Viaarxiv icon

4D Myocardium Reconstruction with Decoupled Motion and Shape Model

Add code
Aug 27, 2023
Viaarxiv icon

DiffuseExpand: Expanding dataset for 2D medical image segmentation using diffusion models

Add code
Apr 26, 2023
Viaarxiv icon