Picture for Fuzhen Zhuang

Fuzhen Zhuang

Weak-Driven Learning: How Weak Agents make Strong Agents Stronger

Add code
Feb 09, 2026
Viaarxiv icon

Does Your Reasoning Model Implicitly Know When to Stop Thinking?

Add code
Feb 09, 2026
Viaarxiv icon

Contextual Rollout Bandits for Reinforcement Learning with Verifiable Rewards

Add code
Feb 09, 2026
Viaarxiv icon

Real-Time Aligned Reward Model beyond Semantics

Add code
Jan 30, 2026
Viaarxiv icon

Enhancing LLM-based Recommendation with Preference Hint Discovery from Knowledge Graph

Add code
Jan 26, 2026
Viaarxiv icon

Your Group-Relative Advantage Is Biased

Add code
Jan 13, 2026
Viaarxiv icon

LLMBoost: Make Large Language Models Stronger with Boosting

Add code
Dec 26, 2025
Viaarxiv icon

How Do Graph Signals Affect Recommendation: Unveiling the Mystery of Low and High-Frequency Graph Signals

Add code
Dec 10, 2025
Figure 1 for How Do Graph Signals Affect Recommendation: Unveiling the Mystery of Low and High-Frequency Graph Signals
Figure 2 for How Do Graph Signals Affect Recommendation: Unveiling the Mystery of Low and High-Frequency Graph Signals
Figure 3 for How Do Graph Signals Affect Recommendation: Unveiling the Mystery of Low and High-Frequency Graph Signals
Figure 4 for How Do Graph Signals Affect Recommendation: Unveiling the Mystery of Low and High-Frequency Graph Signals
Viaarxiv icon

Multi-Aspect Cross-modal Quantization for Generative Recommendation

Add code
Nov 19, 2025
Viaarxiv icon

FLeW: Facet-Level and Adaptive Weighted Representation Learning of Scientific Documents

Add code
Sep 09, 2025
Viaarxiv icon