Picture for Zihan Dong

Zihan Dong

Evaluating LLMs When They Do Not Know the Answer: Statistical Evaluation of Mathematical Reasoning via Comparative Signals

Add code
Feb 03, 2026
Viaarxiv icon

Alternating Reinforcement Learning for Rubric-Based Reward Modeling in Non-Verifiable LLM Post-Training

Add code
Feb 02, 2026
Viaarxiv icon

Labels or Preferences? Budget-Constrained Learning with Human Judgments over AI-Generated Outputs

Add code
Jan 19, 2026
Viaarxiv icon

Students' Perceptions and Preferences of Generative Artificial Intelligence Feedback for Programming

Add code
Dec 17, 2023
Viaarxiv icon

Enhancing Bloodstain Analysis Through AI-Based Segmentation: Leveraging Segment Anything Model for Crime Scene Investigation

Add code
Aug 27, 2023
Viaarxiv icon