Picture for Zeqian Huang

Zeqian Huang

Negative Advantage Is a Double-Edged Sword: Calibrating Advantage in GRPO for Deep Search

Add code
Apr 20, 2026
Viaarxiv icon

Pretraining De-Biased Language Model with Large-scale Click Logs for Document Ranking

Add code
Feb 27, 2023
Viaarxiv icon

Multi-Feature Integration for Perception-Dependent Examination-Bias Estimation

Add code
Feb 27, 2023
Viaarxiv icon