Picture for Dengjia Zhang

Dengjia Zhang

Multilingual Reasoning Cascades Need More Context

Add code
Jun 25, 2026
Viaarxiv icon

Findings of the MAGMaR 2026 Shared Task

Add code
Jun 10, 2026
Viaarxiv icon

Diagnosing Multi-step Reasoning Failures in Black-box LLMs via Stepwise Confidence Attribution

Add code
May 19, 2026
Viaarxiv icon

Unified Multimodal Uncertain Inference

Add code
Apr 13, 2026
Viaarxiv icon

SELAUR: Self Evolving LLM Agent via Uncertainty-aware Rewards

Add code
Feb 25, 2026
Viaarxiv icon

HLTCOE Evaluation Team at TREC 2025: VQA Track

Add code
Dec 08, 2025
Viaarxiv icon