Picture for Xiangnan He

Xiangnan He

Teaching LLM to Reason: Reinforcement Learning from Algorithmic Problems without Code

Add code
Jul 10, 2025
Viaarxiv icon

Boosting Parameter Efficiency in LLM-Based Recommendation through Sophisticated Pruning

Add code
Jul 09, 2025
Viaarxiv icon

Mitigating Safety Fallback in Editing-based Backdoor Injection on LLMs

Add code
Jun 16, 2025
Viaarxiv icon

Reinforced Latent Reasoning for LLM-based Recommendation

Add code
May 25, 2025
Viaarxiv icon

Addressing Missing Data Issue for Diffusion-based Recommendation

Add code
May 18, 2025
Viaarxiv icon

AdaViP: Aligning Multi-modal LLMs via Adaptive Vision-enhanced Preference Optimization

Add code
Apr 22, 2025
Viaarxiv icon

Route Sparse Autoencoder to Interpret Large Language Models

Add code
Mar 11, 2025
Viaarxiv icon

Process-Supervised LLM Recommenders via Flow-guided Tuning

Add code
Mar 10, 2025
Viaarxiv icon

RePO: ReLU-based Preference Optimization

Add code
Mar 10, 2025
Viaarxiv icon

Addressing Overprescribing Challenges: Fine-Tuning Large Language Models for Medication Recommendation Tasks

Add code
Mar 05, 2025
Viaarxiv icon