Picture for Jiakang Li

Jiakang Li

Latent Reward Steering: An Adaptive Inference-Time Framework that Implicitly Promotes Cognitive Behaviors in Reasoning LLMs

Add code
May 30, 2026
Viaarxiv icon

Weak Critics Make Strong Learners: On-Policy Critique Distillation for Scalable Oversight

Add code
May 29, 2026
Viaarxiv icon

Teach Me How to Denoise: A Universal Framework for Denoising Multi-modal Recommender Systems via Guided Calibration

Add code
Apr 19, 2025
Viaarxiv icon

A Comprehensive Review of Community Detection in Graphs

Add code
Sep 26, 2023
Viaarxiv icon

Community Detection Using Revised Medoid-Shift Based on KNN

Add code
Apr 19, 2023
Viaarxiv icon