Picture for Negar Arabzadeh

Negar Arabzadeh

RAG over Thinking Traces Can Improve Reasoning Tasks

Add code
May 05, 2026
Viaarxiv icon

PeeriScope: A Multi-Faceted Framework for Evaluating Peer Review Quality

Add code
Apr 27, 2026
Viaarxiv icon

Can QPP Choose the Right Query Variant? Evaluating Query Variant Selection for RAG Pipelines

Add code
Apr 24, 2026
Viaarxiv icon

Peerispect: Claim Verification in Scientific Peer Reviews

Add code
Apr 19, 2026
Viaarxiv icon

PeerPrism: Peer Evaluation Expertise vs Review-writing AI

Add code
Apr 16, 2026
Viaarxiv icon

ReFormeR: Learning and Applying Explicit Query Reformulation Patterns

Add code
Apr 01, 2026
Viaarxiv icon

From Noise to Order: Learning to Rank via Denoising Diffusion

Add code
Feb 12, 2026
Viaarxiv icon

DeepScholar-Bench: A Live Benchmark and Automated Evaluation for Generative Research Synthesis

Add code
Aug 27, 2025
Figure 1 for DeepScholar-Bench: A Live Benchmark and Automated Evaluation for Generative Research Synthesis
Figure 2 for DeepScholar-Bench: A Live Benchmark and Automated Evaluation for Generative Research Synthesis
Figure 3 for DeepScholar-Bench: A Live Benchmark and Automated Evaluation for Generative Research Synthesis
Figure 4 for DeepScholar-Bench: A Live Benchmark and Automated Evaluation for Generative Research Synthesis
Viaarxiv icon

Benchmarking LLM-based Relevance Judgment Methods

Add code
Apr 17, 2025
Viaarxiv icon

A Human-AI Comparative Analysis of Prompt Sensitivity in LLM-Based Relevance Judgment

Add code
Apr 16, 2025
Figure 1 for A Human-AI Comparative Analysis of Prompt Sensitivity in LLM-Based Relevance Judgment
Figure 2 for A Human-AI Comparative Analysis of Prompt Sensitivity in LLM-Based Relevance Judgment
Figure 3 for A Human-AI Comparative Analysis of Prompt Sensitivity in LLM-Based Relevance Judgment
Figure 4 for A Human-AI Comparative Analysis of Prompt Sensitivity in LLM-Based Relevance Judgment
Viaarxiv icon