Picture for Shu Yang

Shu Yang

Benchmarking and Mitigate Psychological Sycophancy in Medical Vision-Language Models

Add code
Sep 26, 2025
Viaarxiv icon

Rate doubly robust estimation for weighted average treatment effects

Add code
Sep 18, 2025
Viaarxiv icon

RepreGuard: Detecting LLM-Generated Text by Revealing Hidden Representation Patterns

Add code
Aug 18, 2025
Viaarxiv icon

Comparative validation of surgical phase recognition, instrument keypoint estimation, and instrument instance segmentation in endoscopy: Results of the PhaKIR 2024 challenge

Add code
Jul 22, 2025
Viaarxiv icon

Is Long-to-Short a Free Lunch? Investigating Inconsistency and Reasoning Efficiency in LRMs

Add code
Jun 24, 2025
Viaarxiv icon

The Compositional Architecture of Regret in Large Language Models

Add code
Jun 18, 2025
Viaarxiv icon

Flattery in Motion: Benchmarking and Analyzing Sycophancy in Video-LLMs

Add code
Jun 08, 2025
Viaarxiv icon

Mitigating Behavioral Hallucination in Multimodal Large Language Models for Sequential Images

Add code
Jun 08, 2025
Viaarxiv icon

Stable Vision Concept Transformers for Medical Diagnosis

Add code
Jun 05, 2025
Viaarxiv icon

Understanding How Value Neurons Shape the Generation of Specified Values in LLMs

Add code
May 23, 2025
Viaarxiv icon