Picture for William Jurayj

William Jurayj

Many-Tier Instruction Hierarchy in LLM Agents

Add code
Apr 14, 2026
Viaarxiv icon

Unified Multimodal Uncertain Inference

Add code
Apr 13, 2026
Viaarxiv icon

Weird Generalization is Weirdly Brittle

Add code
Apr 11, 2026
Viaarxiv icon

DeonticBench: A Benchmark for Reasoning over Rules

Add code
Apr 06, 2026
Viaarxiv icon

Conformal Thinking: Risk Control for Reasoning on a Compute Budget

Add code
Feb 03, 2026
Viaarxiv icon

Enabling Equitable Access to Trustworthy Financial Reasoning

Add code
Aug 28, 2025
Viaarxiv icon

CLAIMCHECK: How Grounded are LLM Critiques of Scientific Papers?

Add code
Mar 27, 2025
Figure 1 for CLAIMCHECK: How Grounded are LLM Critiques of Scientific Papers?
Figure 2 for CLAIMCHECK: How Grounded are LLM Critiques of Scientific Papers?
Figure 3 for CLAIMCHECK: How Grounded are LLM Critiques of Scientific Papers?
Figure 4 for CLAIMCHECK: How Grounded are LLM Critiques of Scientific Papers?
Viaarxiv icon

Is That Your Final Answer? Test-Time Scaling Improves Selective Question Answering

Add code
Feb 19, 2025
Viaarxiv icon

Gaps or Hallucinations? Gazing into Machine-Generated Legal Analysis for Fine-grained Text Evaluations

Add code
Sep 16, 2024
Viaarxiv icon

Garden-Path Traversal within GPT-2

Add code
May 24, 2022
Figure 1 for Garden-Path Traversal within GPT-2
Figure 2 for Garden-Path Traversal within GPT-2
Figure 3 for Garden-Path Traversal within GPT-2
Figure 4 for Garden-Path Traversal within GPT-2
Viaarxiv icon