Picture for Arman Cohan

Arman Cohan

REVERE: Reflective Evolving Research Engineer for Scientific Workflows

Add code
Mar 21, 2026
Viaarxiv icon

Examining Reasoning LLMs-as-Judges in Non-Verifiable LLM Post-Training

Add code
Mar 12, 2026
Viaarxiv icon

SciMDR: Benchmarking and Advancing Scientific Multimodal Document Reasoning

Add code
Mar 12, 2026
Viaarxiv icon

RbtAct: Rebuttal as Supervision for Actionable Review Feedback Generation

Add code
Mar 10, 2026
Viaarxiv icon

Deconstructing Multimodal Mathematical Reasoning: Towards a Unified Perception-Alignment-Reasoning Paradigm

Add code
Mar 09, 2026
Viaarxiv icon

QEDBENCH: Quantifying the Alignment Gap in Automated Evaluation of University-Level Mathematical Proofs

Add code
Feb 24, 2026
Viaarxiv icon

References Improve LLM Alignment in Non-Verifiable Domains

Add code
Feb 18, 2026
Viaarxiv icon

ResearchGym: Evaluating Language Model Agents on Real-World AI Research

Add code
Feb 16, 2026
Viaarxiv icon

ANCHOR: Branch-Point Data Generation for GUI Agents

Add code
Feb 06, 2026
Viaarxiv icon

SAGE: Benchmarking and Improving Retrieval for Deep Research Agents

Add code
Feb 05, 2026
Viaarxiv icon