Picture for Ling Pan

Ling Pan

Let It Flow: Agentic Crafting on Rock and Roll, Building the ROME Model within an Open Agentic Learning Ecosystem

Add code
Dec 31, 2025
Viaarxiv icon

GARDO: Reinforcing Diffusion Models without Reward Hacking

Add code
Dec 30, 2025
Viaarxiv icon

Asymmetric Proximal Policy Optimization: mini-critics boost LLM reasoning

Add code
Oct 02, 2025
Viaarxiv icon

Generative Flow Networks for Personalized Multimedia Systems: A Case Study on Short Video Feeds

Add code
Aug 23, 2025
Viaarxiv icon

Part I: Tricks or Traps? A Deep Dive into RL for LLM Reasoning

Add code
Aug 11, 2025
Viaarxiv icon

The Courage to Stop: Overcoming Sunk Cost Fallacy in Deep Reinforcement Learning

Add code
Jun 16, 2025
Figure 1 for The Courage to Stop: Overcoming Sunk Cost Fallacy in Deep Reinforcement Learning
Figure 2 for The Courage to Stop: Overcoming Sunk Cost Fallacy in Deep Reinforcement Learning
Figure 3 for The Courage to Stop: Overcoming Sunk Cost Fallacy in Deep Reinforcement Learning
Figure 4 for The Courage to Stop: Overcoming Sunk Cost Fallacy in Deep Reinforcement Learning
Viaarxiv icon

Measure gradients, not activations! Enhancing neuronal activity in deep reinforcement learning

Add code
May 29, 2025
Viaarxiv icon

Navigate the Unknown: Enhancing LLM Reasoning with Intrinsic Motivation Guided Exploration

Add code
May 23, 2025
Viaarxiv icon

Scaling Image and Video Generation via Test-Time Evolutionary Search

Add code
May 23, 2025
Figure 1 for Scaling Image and Video Generation via Test-Time Evolutionary Search
Figure 2 for Scaling Image and Video Generation via Test-Time Evolutionary Search
Figure 3 for Scaling Image and Video Generation via Test-Time Evolutionary Search
Figure 4 for Scaling Image and Video Generation via Test-Time Evolutionary Search
Viaarxiv icon

Beyond the Destination: A Novel Benchmark for Exploration-Aware Embodied Question Answering

Add code
Mar 14, 2025
Figure 1 for Beyond the Destination: A Novel Benchmark for Exploration-Aware Embodied Question Answering
Figure 2 for Beyond the Destination: A Novel Benchmark for Exploration-Aware Embodied Question Answering
Figure 3 for Beyond the Destination: A Novel Benchmark for Exploration-Aware Embodied Question Answering
Figure 4 for Beyond the Destination: A Novel Benchmark for Exploration-Aware Embodied Question Answering
Viaarxiv icon