Picture for Eric Wong

Eric Wong

Detecting Safety Violations Across Many Agent Traces

Add code
Apr 13, 2026
Viaarxiv icon

Detecting and Correcting Reference Hallucinations in Commercial LLMs and Deep Research Agents

Add code
Apr 03, 2026
Viaarxiv icon

Missingness Bias Calibration in Feature Attribution Explanations

Add code
Mar 05, 2026
Viaarxiv icon

CAMEL: An ECG Language Model for Forecasting Cardiac Events

Add code
Feb 17, 2026
Viaarxiv icon

Semantics-Preserving Evasion of LLM Vulnerability Detectors

Add code
Jan 30, 2026
Viaarxiv icon

T-FIX: Text-Based Explanations with Features Interpretable to eXperts

Add code
Nov 06, 2025
Viaarxiv icon

Once Upon an Input: Reasoning via Per-Instance Program Synthesis

Add code
Oct 26, 2025
Viaarxiv icon

Stable Prediction of Adverse Events in Medical Time-Series Data

Add code
Oct 16, 2025
Figure 1 for Stable Prediction of Adverse Events in Medical Time-Series Data
Figure 2 for Stable Prediction of Adverse Events in Medical Time-Series Data
Figure 3 for Stable Prediction of Adverse Events in Medical Time-Series Data
Figure 4 for Stable Prediction of Adverse Events in Medical Time-Series Data
Viaarxiv icon

Instruction Following by Boosting Attention of Large Language Models

Add code
Jun 16, 2025
Figure 1 for Instruction Following by Boosting Attention of Large Language Models
Figure 2 for Instruction Following by Boosting Attention of Large Language Models
Figure 3 for Instruction Following by Boosting Attention of Large Language Models
Figure 4 for Instruction Following by Boosting Attention of Large Language Models
Viaarxiv icon

Benchmarking Misuse Mitigation Against Covert Adversaries

Add code
Jun 06, 2025
Viaarxiv icon