Picture for Xiaoying Huang

Xiaoying Huang

StressEval: Failure-Driven Dynamic Benchmarking for Knowledge-Intensive Reasoning in Large Language Models

Add code
May 03, 2026
Viaarxiv icon

AT-ADD: All-Type Audio Deepfake Detection Challenge Evaluation Plan

Add code
Apr 09, 2026
Viaarxiv icon