Picture for Kan Ren

Kan Ren

Interpreting and Controlling LLM Reasoning through Integrated Policy Gradient

Add code
Feb 03, 2026
Viaarxiv icon

Grad2Reward: From Sparse Judgment to Dense Rewards for Improving Open-Ended LLM Reasoning

Add code
Feb 02, 2026
Viaarxiv icon

DiffusionUavLoc: Visually Prompted Diffusion for Cross-View UAV Localization

Add code
Nov 09, 2025
Figure 1 for DiffusionUavLoc: Visually Prompted Diffusion for Cross-View UAV Localization
Figure 2 for DiffusionUavLoc: Visually Prompted Diffusion for Cross-View UAV Localization
Figure 3 for DiffusionUavLoc: Visually Prompted Diffusion for Cross-View UAV Localization
Figure 4 for DiffusionUavLoc: Visually Prompted Diffusion for Cross-View UAV Localization
Viaarxiv icon

Linking Process to Outcome: Conditional Reward Modeling for LLM Reasoning

Add code
Sep 30, 2025
Viaarxiv icon

Learning to Select In-Context Demonstration Preferred by Large Language Model

Add code
May 26, 2025
Viaarxiv icon

Chain-of-Model Learning for Language Model

Add code
May 17, 2025
Viaarxiv icon

Dissecting and Mitigating Diffusion Bias via Mechanistic Interpretability

Add code
Mar 26, 2025
Viaarxiv icon

Discovering Influential Neuron Path in Vision Transformers

Add code
Mar 12, 2025
Viaarxiv icon

VisEval: A Benchmark for Data Visualization in the Era of Large Language Models

Add code
Jul 01, 2024
Viaarxiv icon

Diffusion-based Reinforcement Learning via Q-weighted Variational Policy Optimization

Add code
May 25, 2024
Figure 1 for Diffusion-based Reinforcement Learning via Q-weighted Variational Policy Optimization
Figure 2 for Diffusion-based Reinforcement Learning via Q-weighted Variational Policy Optimization
Figure 3 for Diffusion-based Reinforcement Learning via Q-weighted Variational Policy Optimization
Figure 4 for Diffusion-based Reinforcement Learning via Q-weighted Variational Policy Optimization
Viaarxiv icon