Picture for Ruochen Jiao

Ruochen Jiao

CreFlow: Corrective Reflow for Sparse-Reward Embodied Video Diffusion RL

Add code
May 14, 2026
Viaarxiv icon

Shop-R1: Rewarding LLMs to Simulate Human Behavior in Online Shopping via Reinforcement Learning

Add code
Jul 23, 2025
Figure 1 for Shop-R1: Rewarding LLMs to Simulate Human Behavior in Online Shopping via Reinforcement Learning
Figure 2 for Shop-R1: Rewarding LLMs to Simulate Human Behavior in Online Shopping via Reinforcement Learning
Figure 3 for Shop-R1: Rewarding LLMs to Simulate Human Behavior in Online Shopping via Reinforcement Learning
Figure 4 for Shop-R1: Rewarding LLMs to Simulate Human Behavior in Online Shopping via Reinforcement Learning
Viaarxiv icon

Inverse Delayed Reinforcement Learning

Add code
Dec 04, 2024
Figure 1 for Inverse Delayed Reinforcement Learning
Figure 2 for Inverse Delayed Reinforcement Learning
Figure 3 for Inverse Delayed Reinforcement Learning
Figure 4 for Inverse Delayed Reinforcement Learning
Viaarxiv icon

Model-Based Reward Shaping for Adversarial Inverse Reinforcement Learning in Stochastic Environments

Add code
Oct 04, 2024
Figure 1 for Model-Based Reward Shaping for Adversarial Inverse Reinforcement Learning in Stochastic Environments
Figure 2 for Model-Based Reward Shaping for Adversarial Inverse Reinforcement Learning in Stochastic Environments
Figure 3 for Model-Based Reward Shaping for Adversarial Inverse Reinforcement Learning in Stochastic Environments
Figure 4 for Model-Based Reward Shaping for Adversarial Inverse Reinforcement Learning in Stochastic Environments
Viaarxiv icon

Exploring Backdoor Attacks against Large Language Model-based Decision Making

Add code
May 27, 2024
Figure 1 for Exploring Backdoor Attacks against Large Language Model-based Decision Making
Figure 2 for Exploring Backdoor Attacks against Large Language Model-based Decision Making
Figure 3 for Exploring Backdoor Attacks against Large Language Model-based Decision Making
Figure 4 for Exploring Backdoor Attacks against Large Language Model-based Decision Making
Viaarxiv icon

Empowering Autonomous Driving with Large Language Models: A Safety Perspective

Add code
Nov 28, 2023
Viaarxiv icon

State-wise Safe Reinforcement Learning With Pixel Observations

Add code
Nov 03, 2023
Figure 1 for State-wise Safe Reinforcement Learning With Pixel Observations
Figure 2 for State-wise Safe Reinforcement Learning With Pixel Observations
Figure 3 for State-wise Safe Reinforcement Learning With Pixel Observations
Figure 4 for State-wise Safe Reinforcement Learning With Pixel Observations
Viaarxiv icon

Kinematics-aware Trajectory Generation and Prediction with Latent Stochastic Differential Modeling

Add code
Sep 17, 2023
Viaarxiv icon

Safety-Assured Speculative Planning with Adaptive Prediction

Add code
Jul 21, 2023
Figure 1 for Safety-Assured Speculative Planning with Adaptive Prediction
Figure 2 for Safety-Assured Speculative Planning with Adaptive Prediction
Figure 3 for Safety-Assured Speculative Planning with Adaptive Prediction
Figure 4 for Safety-Assured Speculative Planning with Adaptive Prediction
Viaarxiv icon

Learning Representation for Anomaly Detection of Vehicle Trajectories

Add code
Mar 09, 2023
Viaarxiv icon