Picture for Youting Wang

Youting Wang

Self-Commitment Latency: A Reward-Free Probe for Prompted Implicit Hacking

Add code
Jun 04, 2026
Viaarxiv icon

When LLM Reward Design Fails: Diagnostic-Driven Refinement for Sparse Structured RL

Add code
May 27, 2026
Viaarxiv icon

VisualLeakBench: Auditing the Fragility of Large Vision-Language Models against PII Leakage and Social Engineering

Add code
Mar 11, 2026
Viaarxiv icon

Locally Interpretable One-Class Anomaly Detection for Credit Card Fraud Detection

Add code
Aug 05, 2021
Figure 1 for Locally Interpretable One-Class Anomaly Detection for Credit Card Fraud Detection
Figure 2 for Locally Interpretable One-Class Anomaly Detection for Credit Card Fraud Detection
Figure 3 for Locally Interpretable One-Class Anomaly Detection for Credit Card Fraud Detection
Figure 4 for Locally Interpretable One-Class Anomaly Detection for Credit Card Fraud Detection
Viaarxiv icon