Picture for Zeming Liu

Zeming Liu

Utilizing and Calibrating Hindsight Process Rewards via Reinforcement with Mutual Information Self-Evaluation

Add code
Apr 13, 2026
Viaarxiv icon

Policy Split: Incentivizing Dual-Mode Exploration in LLM Reinforcement with Dual-Mode Entropy Regularization

Add code
Apr 13, 2026
Viaarxiv icon

Mem$^2$Evolve: Towards Self-Evolving Agents via Co-Evolutionary Capability Expansion and Experience Distillation

Add code
Apr 13, 2026
Viaarxiv icon

Omni-WorldBench: Towards a Comprehensive Interaction-Centric Evaluation for World Models

Add code
Mar 23, 2026
Viaarxiv icon

Beyond Unimodal Shortcuts: MLLMs as Cross-Modal Reasoners for Grounded Named Entity Recognition

Add code
Feb 04, 2026
Viaarxiv icon

Beyond Literal Mapping: Benchmarking and Improving Non-Literal Translation Evaluation

Add code
Jan 12, 2026
Viaarxiv icon

Character-R1: Enhancing Role-Aware Reasoning in Role-Playing Agents via RLVR

Add code
Jan 08, 2026
Viaarxiv icon

DGA-Net: Enhancing SAM with Depth Prompting and Graph-Anchor Guidance for Camouflaged Object Detection

Add code
Jan 06, 2026
Viaarxiv icon

TCM-Eval: An Expert-Level Dynamic and Extensible Benchmark for Traditional Chinese Medicine

Add code
Nov 10, 2025
Viaarxiv icon

Learn More, Forget Less: A Gradient-Aware Data Selection Approach for LLM

Add code
Nov 07, 2025
Figure 1 for Learn More, Forget Less: A Gradient-Aware Data Selection Approach for LLM
Figure 2 for Learn More, Forget Less: A Gradient-Aware Data Selection Approach for LLM
Figure 3 for Learn More, Forget Less: A Gradient-Aware Data Selection Approach for LLM
Figure 4 for Learn More, Forget Less: A Gradient-Aware Data Selection Approach for LLM
Viaarxiv icon