Picture for Greg Durrett

Greg Durrett

EvalAgent: Discovering Implicit Evaluation Criteria from the Web

Add code
Apr 21, 2025
Viaarxiv icon

CRUST-Bench: A Comprehensive Benchmark for C-to-safe-Rust Transpilation

Add code
Apr 21, 2025
Viaarxiv icon

Pairwise or Pointwise? Evaluating Feedback Protocols for Bias in LLM-Based Evaluation

Add code
Apr 20, 2025
Viaarxiv icon

RankAlign: A Ranking View of the Generator-Validator Gap in Large Language Models

Add code
Apr 15, 2025
Viaarxiv icon

QUDsim: Quantifying Discourse Similarities in LLM-Generated Text

Add code
Apr 12, 2025
Viaarxiv icon

Is the Top Still Spinning? Evaluating Subjectivity in Narrative Understanding

Add code
Apr 01, 2025
Viaarxiv icon

${\rm P{\small ROOF}W{\small ALA}}$: Multilingual Proof Data Synthesis and Theorem-Proving

Add code
Feb 07, 2025
Viaarxiv icon

LongProc: Benchmarking Long-Context Language Models on Long Procedural Generation

Add code
Jan 09, 2025
Figure 1 for LongProc: Benchmarking Long-Context Language Models on Long Procedural Generation
Figure 2 for LongProc: Benchmarking Long-Context Language Models on Long Procedural Generation
Figure 3 for LongProc: Benchmarking Long-Context Language Models on Long Procedural Generation
Figure 4 for LongProc: Benchmarking Long-Context Language Models on Long Procedural Generation
Viaarxiv icon

Understanding Synthetic Context Extension via Retrieval Heads

Add code
Oct 29, 2024
Figure 1 for Understanding Synthetic Context Extension via Retrieval Heads
Figure 2 for Understanding Synthetic Context Extension via Retrieval Heads
Figure 3 for Understanding Synthetic Context Extension via Retrieval Heads
Figure 4 for Understanding Synthetic Context Extension via Retrieval Heads
Viaarxiv icon

Contrastive Learning to Improve Retrieval for Real-world Fact Checking

Add code
Oct 07, 2024
Figure 1 for Contrastive Learning to Improve Retrieval for Real-world Fact Checking
Figure 2 for Contrastive Learning to Improve Retrieval for Real-world Fact Checking
Figure 3 for Contrastive Learning to Improve Retrieval for Real-world Fact Checking
Figure 4 for Contrastive Learning to Improve Retrieval for Real-world Fact Checking
Viaarxiv icon