Picture for Patrick D. Watson

Patrick D. Watson

Agent-Diff: Benchmarking LLM Agents on Enterprise API Tasks via Code Execution with State-Diff-Based Evaluation

Add code
Feb 11, 2026
Viaarxiv icon

Frame Shift Prediction

Add code
Jan 05, 2022
Figure 1 for Frame Shift Prediction
Figure 2 for Frame Shift Prediction
Figure 3 for Frame Shift Prediction
Figure 4 for Frame Shift Prediction
Viaarxiv icon