Picture for Shu Yang

Shu Yang

AutoMonitor-Bench: Evaluating the Reliability of LLM-Based Misbehavior Monitor

Add code
Jan 09, 2026
Viaarxiv icon

MambaMIL+: Modeling Long-Term Contextual Patterns for Gigapixel Whole Slide Image

Add code
Dec 19, 2025
Figure 1 for MambaMIL+: Modeling Long-Term Contextual Patterns for Gigapixel Whole Slide Image
Figure 2 for MambaMIL+: Modeling Long-Term Contextual Patterns for Gigapixel Whole Slide Image
Figure 3 for MambaMIL+: Modeling Long-Term Contextual Patterns for Gigapixel Whole Slide Image
Figure 4 for MambaMIL+: Modeling Long-Term Contextual Patterns for Gigapixel Whole Slide Image
Viaarxiv icon

Investigating CoT Monitorability in Large Reasoning Models

Add code
Nov 13, 2025
Figure 1 for Investigating CoT Monitorability in Large Reasoning Models
Figure 2 for Investigating CoT Monitorability in Large Reasoning Models
Figure 3 for Investigating CoT Monitorability in Large Reasoning Models
Figure 4 for Investigating CoT Monitorability in Large Reasoning Models
Viaarxiv icon

MONICA: Real-Time Monitoring and Calibration of Chain-of-Thought Sycophancy in Large Reasoning Models

Add code
Nov 09, 2025
Viaarxiv icon

GenAR: Next-Scale Autoregressive Generation for Spatial Gene Expression Prediction

Add code
Oct 05, 2025
Figure 1 for GenAR: Next-Scale Autoregressive Generation for Spatial Gene Expression Prediction
Figure 2 for GenAR: Next-Scale Autoregressive Generation for Spatial Gene Expression Prediction
Figure 3 for GenAR: Next-Scale Autoregressive Generation for Spatial Gene Expression Prediction
Figure 4 for GenAR: Next-Scale Autoregressive Generation for Spatial Gene Expression Prediction
Viaarxiv icon

Benchmarking and Mitigate Psychological Sycophancy in Medical Vision-Language Models

Add code
Sep 26, 2025
Viaarxiv icon

Rate doubly robust estimation for weighted average treatment effects

Add code
Sep 18, 2025
Viaarxiv icon

RepreGuard: Detecting LLM-Generated Text by Revealing Hidden Representation Patterns

Add code
Aug 18, 2025
Figure 1 for RepreGuard: Detecting LLM-Generated Text by Revealing Hidden Representation Patterns
Figure 2 for RepreGuard: Detecting LLM-Generated Text by Revealing Hidden Representation Patterns
Figure 3 for RepreGuard: Detecting LLM-Generated Text by Revealing Hidden Representation Patterns
Figure 4 for RepreGuard: Detecting LLM-Generated Text by Revealing Hidden Representation Patterns
Viaarxiv icon

Comparative validation of surgical phase recognition, instrument keypoint estimation, and instrument instance segmentation in endoscopy: Results of the PhaKIR 2024 challenge

Add code
Jul 22, 2025
Viaarxiv icon

Is Long-to-Short a Free Lunch? Investigating Inconsistency and Reasoning Efficiency in LRMs

Add code
Jun 24, 2025
Viaarxiv icon