Picture for Zheng Wu

Zheng Wu

Enhancing Multi-Step Reasoning Abilities of Language Models through Direct Q-Function Optimization

Add code
Oct 11, 2024
Viaarxiv icon

Efficient Reinforcement Learning of Task Planners for Robotic Palletization through Iterative Action Masking Learning

Add code
Apr 07, 2024
Viaarxiv icon

DBPF: A Framework for Efficient and Robust Dynamic Bin-Picking

Add code
Mar 25, 2024
Viaarxiv icon

Pearl: A Production-ready Reinforcement Learning Agent

Add code
Dec 06, 2023
Viaarxiv icon

Efficient Sim-to-real Transfer of Contact-Rich Manipulation Skills with Online Admittance Residual Learning

Add code
Oct 16, 2023
Viaarxiv icon

Reinforcement learning with Demonstrations from Mismatched Task under Sparse Reward

Add code
Dec 03, 2022
Figure 1 for Reinforcement learning with Demonstrations from Mismatched Task under Sparse Reward
Figure 2 for Reinforcement learning with Demonstrations from Mismatched Task under Sparse Reward
Figure 3 for Reinforcement learning with Demonstrations from Mismatched Task under Sparse Reward
Figure 4 for Reinforcement learning with Demonstrations from Mismatched Task under Sparse Reward
Viaarxiv icon

Prim-LAfD: A Framework to Learn and Adapt Primitive-Based Skills from Demonstrations for Insertion Tasks

Add code
Dec 02, 2022
Viaarxiv icon

Zero-Shot Policy Transfer with Disentangled Task Representation of Meta-Reinforcement Learning

Add code
Oct 01, 2022
Figure 1 for Zero-Shot Policy Transfer with Disentangled Task Representation of Meta-Reinforcement Learning
Figure 2 for Zero-Shot Policy Transfer with Disentangled Task Representation of Meta-Reinforcement Learning
Figure 3 for Zero-Shot Policy Transfer with Disentangled Task Representation of Meta-Reinforcement Learning
Figure 4 for Zero-Shot Policy Transfer with Disentangled Task Representation of Meta-Reinforcement Learning
Viaarxiv icon

ILASR: Privacy-Preserving Incremental Learning for Automatic Speech Recognition at Production Scale

Add code
Jul 22, 2022
Figure 1 for ILASR: Privacy-Preserving Incremental Learning for Automatic Speech Recognition at Production Scale
Figure 2 for ILASR: Privacy-Preserving Incremental Learning for Automatic Speech Recognition at Production Scale
Figure 3 for ILASR: Privacy-Preserving Incremental Learning for Automatic Speech Recognition at Production Scale
Figure 4 for ILASR: Privacy-Preserving Incremental Learning for Automatic Speech Recognition at Production Scale
Viaarxiv icon

Offline-Online Learning of Deformation Model for Cable Manipulation with Graph Neural Networks

Add code
Mar 28, 2022
Figure 1 for Offline-Online Learning of Deformation Model for Cable Manipulation with Graph Neural Networks
Figure 2 for Offline-Online Learning of Deformation Model for Cable Manipulation with Graph Neural Networks
Figure 3 for Offline-Online Learning of Deformation Model for Cable Manipulation with Graph Neural Networks
Figure 4 for Offline-Online Learning of Deformation Model for Cable Manipulation with Graph Neural Networks
Viaarxiv icon