Picture for Xiangnan He

Xiangnan He

Contrastive Weak-to-strong Generalization

Add code
Oct 09, 2025
Viaarxiv icon

Quantile Advantage Estimation for Entropy-Safe Reasoning

Add code
Sep 26, 2025
Viaarxiv icon

AppAgent-Pro: A Proactive GUI Agent System for Multidomain Information Integration and User Assistance

Add code
Aug 27, 2025
Viaarxiv icon

Teaching LLM to Reason: Reinforcement Learning from Algorithmic Problems without Code

Add code
Jul 10, 2025
Viaarxiv icon

Boosting Parameter Efficiency in LLM-Based Recommendation through Sophisticated Pruning

Add code
Jul 09, 2025
Viaarxiv icon

Mitigating Safety Fallback in Editing-based Backdoor Injection on LLMs

Add code
Jun 16, 2025
Figure 1 for Mitigating Safety Fallback in Editing-based Backdoor Injection on LLMs
Figure 2 for Mitigating Safety Fallback in Editing-based Backdoor Injection on LLMs
Figure 3 for Mitigating Safety Fallback in Editing-based Backdoor Injection on LLMs
Figure 4 for Mitigating Safety Fallback in Editing-based Backdoor Injection on LLMs
Viaarxiv icon

Reinforced Latent Reasoning for LLM-based Recommendation

Add code
May 25, 2025
Viaarxiv icon

Addressing Missing Data Issue for Diffusion-based Recommendation

Add code
May 18, 2025
Viaarxiv icon

AdaViP: Aligning Multi-modal LLMs via Adaptive Vision-enhanced Preference Optimization

Add code
Apr 22, 2025
Viaarxiv icon

Route Sparse Autoencoder to Interpret Large Language Models

Add code
Mar 11, 2025
Viaarxiv icon