Picture for Jiancan Wu

Jiancan Wu

SepSeq: A Training-Free Framework for Long Numerical Sequence Processing in LLMs

Add code
Apr 09, 2026
Viaarxiv icon

Beyond Where to Look: Trajectory-Guided Reinforcement Learning for Multimodal RLVR

Add code
Mar 27, 2026
Viaarxiv icon

Bridging Perception and Reasoning: Token Reweighting for RLVR in Multimodal LLMs

Add code
Mar 26, 2026
Viaarxiv icon

On the Direction of RLVR Updates for LLM Reasoning: Identification and Exploitation

Add code
Mar 23, 2026
Viaarxiv icon

Fine-grained Semantics Integration for Large Language Model-based Recommendation

Add code
Feb 26, 2026
Viaarxiv icon

Think before Recommendation: Autonomous Reasoning-enhanced Recommender

Add code
Oct 27, 2025
Viaarxiv icon

Quantile Advantage Estimation for Entropy-Safe Reasoning

Add code
Sep 26, 2025
Viaarxiv icon

Addressing Missing Data Issue for Diffusion-based Recommendation

Add code
May 18, 2025
Viaarxiv icon

AdaViP: Aligning Multi-modal LLMs via Adaptive Vision-enhanced Preference Optimization

Add code
Apr 22, 2025
Viaarxiv icon

RePO: ReLU-based Preference Optimization

Add code
Mar 10, 2025
Viaarxiv icon