Picture for Wangjie Gan

Wangjie Gan

GFT: From Imitation to Reward Fine-Tuning with Unbiased Group Advantages and Dynamic Coefficient Rectification

Add code
Apr 15, 2026
Viaarxiv icon

Ground What You See: Hallucination-Resistant MLLMs via Caption Feedback, Diversity-Aware Sampling, and Conflict Regularization

Add code
Jan 13, 2026
Viaarxiv icon