Picture for Kehai Chen

Kehai Chen

Long-form RewardBench: Evaluating Reward Models for Long-form Generation

Add code
Mar 13, 2026
Viaarxiv icon

Mitigating Translationese Bias in Multilingual LLM-as-a-Judge via Disentangled Information Bottleneck

Add code
Mar 11, 2026
Viaarxiv icon

Toward Robust LLM-Based Judges: Taxonomic Bias Evaluation and Debiasing Optimization

Add code
Mar 09, 2026
Viaarxiv icon

Beyond Token-Level Policy Gradients for Complex Reasoning with Large Language Models

Add code
Feb 16, 2026
Viaarxiv icon

Dynamics Within Latent Chain-of-Thought: An Empirical Study of Causal Structure

Add code
Feb 09, 2026
Viaarxiv icon

Beyond Unimodal Shortcuts: MLLMs as Cross-Modal Reasoners for Grounded Named Entity Recognition

Add code
Feb 04, 2026
Viaarxiv icon

Decoupling Skeleton and Flesh: Efficient Multimodal Table Reasoning with Disentangled Alignment and Structure-aware Guidance

Add code
Feb 03, 2026
Viaarxiv icon

VC-Bench: Pioneering the Video Connecting Benchmark with a Dataset and Evaluation Metrics

Add code
Jan 27, 2026
Viaarxiv icon

Beyond Rigid: Benchmarking Non-Rigid Video Editing

Add code
Jan 26, 2026
Viaarxiv icon

Character-R1: Enhancing Role-Aware Reasoning in Role-Playing Agents via RLVR

Add code
Jan 08, 2026
Viaarxiv icon