Picture for Pengyi Li

Pengyi Li

Confidence Is All You Need: Few-Shot RL Fine-Tuning of Language Models

Add code
Jun 05, 2025
Viaarxiv icon

From Seeing to Doing: Bridging Reasoning and Decision for Robotic Manipulation

Add code
May 13, 2025
Viaarxiv icon

MaxInfo: A Training-Free Key-Frame Selection Method Using Maximum Volume for Enhanced Video Understanding

Add code
Feb 05, 2025
Viaarxiv icon

CleanDiffuser: An Easy-to-use Modularized Library for Diffusion Models in Decision Making

Add code
Jun 13, 2024
Viaarxiv icon

DiffuserLite: Towards Real-time Diffusion Planning

Add code
Feb 02, 2024
Viaarxiv icon

Bridging Evolutionary Algorithms and Reinforcement Learning: A Comprehensive Survey

Add code
Jan 22, 2024
Viaarxiv icon

ERL-Re$^2$: Efficient Evolutionary Reinforcement Learning with Shared State Representation and Individual Policy Representation

Add code
Oct 26, 2022
Viaarxiv icon

PMIC: Improving Multi-Agent Reinforcement Learning with Progressive Mutual Information Collaboration

Add code
Mar 16, 2022
Figure 1 for PMIC: Improving Multi-Agent Reinforcement Learning with Progressive Mutual Information Collaboration
Figure 2 for PMIC: Improving Multi-Agent Reinforcement Learning with Progressive Mutual Information Collaboration
Figure 3 for PMIC: Improving Multi-Agent Reinforcement Learning with Progressive Mutual Information Collaboration
Figure 4 for PMIC: Improving Multi-Agent Reinforcement Learning with Progressive Mutual Information Collaboration
Viaarxiv icon

HyAR: Addressing Discrete-Continuous Action Reinforcement Learning via Hybrid Action Representation

Add code
Sep 12, 2021
Figure 1 for HyAR: Addressing Discrete-Continuous Action Reinforcement Learning via Hybrid Action Representation
Figure 2 for HyAR: Addressing Discrete-Continuous Action Reinforcement Learning via Hybrid Action Representation
Figure 3 for HyAR: Addressing Discrete-Continuous Action Reinforcement Learning via Hybrid Action Representation
Figure 4 for HyAR: Addressing Discrete-Continuous Action Reinforcement Learning via Hybrid Action Representation
Viaarxiv icon