Alert button
Picture for Zheng Wu

Zheng Wu

Alert button

Efficient Reinforcement Learning of Task Planners for Robotic Palletization through Iterative Action Masking Learning

Add code
Bookmark button
Alert button
Apr 07, 2024
Zheng Wu, Yichuan Li, Wei Zhan, Changliu Liu, Yun-Hui Liu, Masayoshi Tomizuka

Viaarxiv icon

DBPF: A Framework for Efficient and Robust Dynamic Bin-Picking

Add code
Bookmark button
Alert button
Mar 25, 2024
Yichuan Li, Junkai Zhao, Yixiao Li, Zheng Wu, Rui Cao, Masayoshi Tomizuka, Yunhui Liu

Viaarxiv icon

Pearl: A Production-ready Reinforcement Learning Agent

Add code
Bookmark button
Alert button
Dec 06, 2023
Zheqing Zhu, Rodrigo de Salvo Braz, Jalaj Bhandari, Daniel Jiang, Yi Wan, Yonathan Efroni, Liyuan Wang, Ruiyang Xu, Hongbo Guo, Alex Nikulkov, Dmytro Korenkevych, Urun Dogan, Frank Cheng, Zheng Wu, Wanqiao Xu

Viaarxiv icon

Efficient Sim-to-real Transfer of Contact-Rich Manipulation Skills with Online Admittance Residual Learning

Add code
Bookmark button
Alert button
Oct 16, 2023
Xiang Zhang, Changhao Wang, Lingfeng Sun, Zheng Wu, Xinghao Zhu, Masayoshi Tomizuka

Figure 1 for Efficient Sim-to-real Transfer of Contact-Rich Manipulation Skills with Online Admittance Residual Learning
Figure 2 for Efficient Sim-to-real Transfer of Contact-Rich Manipulation Skills with Online Admittance Residual Learning
Figure 3 for Efficient Sim-to-real Transfer of Contact-Rich Manipulation Skills with Online Admittance Residual Learning
Figure 4 for Efficient Sim-to-real Transfer of Contact-Rich Manipulation Skills with Online Admittance Residual Learning
Viaarxiv icon

Reinforcement learning with Demonstrations from Mismatched Task under Sparse Reward

Add code
Bookmark button
Alert button
Dec 03, 2022
Yanjiang Guo, Jingyue Gao, Zheng Wu, Chengming Shi, Jianyu Chen

Figure 1 for Reinforcement learning with Demonstrations from Mismatched Task under Sparse Reward
Figure 2 for Reinforcement learning with Demonstrations from Mismatched Task under Sparse Reward
Figure 3 for Reinforcement learning with Demonstrations from Mismatched Task under Sparse Reward
Figure 4 for Reinforcement learning with Demonstrations from Mismatched Task under Sparse Reward
Viaarxiv icon

Prim-LAfD: A Framework to Learn and Adapt Primitive-Based Skills from Demonstrations for Insertion Tasks

Add code
Bookmark button
Alert button
Dec 02, 2022
Zheng Wu, Wenzhao Lian, Changhao Wang, Mengxi Li, Stefan Schaal, Masayoshi Tomizuka

Figure 1 for Prim-LAfD: A Framework to Learn and Adapt Primitive-Based Skills from Demonstrations for Insertion Tasks
Figure 2 for Prim-LAfD: A Framework to Learn and Adapt Primitive-Based Skills from Demonstrations for Insertion Tasks
Figure 3 for Prim-LAfD: A Framework to Learn and Adapt Primitive-Based Skills from Demonstrations for Insertion Tasks
Figure 4 for Prim-LAfD: A Framework to Learn and Adapt Primitive-Based Skills from Demonstrations for Insertion Tasks
Viaarxiv icon

Zero-Shot Policy Transfer with Disentangled Task Representation of Meta-Reinforcement Learning

Add code
Bookmark button
Alert button
Oct 01, 2022
Zheng Wu, Yichen Xie, Wenzhao Lian, Changhao Wang, Yanjiang Guo, Jianyu Chen, Stefan Schaal, Masayoshi Tomizuka

Figure 1 for Zero-Shot Policy Transfer with Disentangled Task Representation of Meta-Reinforcement Learning
Figure 2 for Zero-Shot Policy Transfer with Disentangled Task Representation of Meta-Reinforcement Learning
Figure 3 for Zero-Shot Policy Transfer with Disentangled Task Representation of Meta-Reinforcement Learning
Figure 4 for Zero-Shot Policy Transfer with Disentangled Task Representation of Meta-Reinforcement Learning
Viaarxiv icon

ILASR: Privacy-Preserving Incremental Learning for Automatic Speech Recognition at Production Scale

Add code
Bookmark button
Alert button
Jul 22, 2022
Gopinath Chennupati, Milind Rao, Gurpreet Chadha, Aaron Eakin, Anirudh Raju, Gautam Tiwari, Anit Kumar Sahu, Ariya Rastrow, Jasha Droppo, Andy Oberlin, Buddha Nandanoor, Prahalad Venkataramanan, Zheng Wu, Pankaj Sitpure

Figure 1 for ILASR: Privacy-Preserving Incremental Learning for Automatic Speech Recognition at Production Scale
Figure 2 for ILASR: Privacy-Preserving Incremental Learning for Automatic Speech Recognition at Production Scale
Figure 3 for ILASR: Privacy-Preserving Incremental Learning for Automatic Speech Recognition at Production Scale
Figure 4 for ILASR: Privacy-Preserving Incremental Learning for Automatic Speech Recognition at Production Scale
Viaarxiv icon

ILASR: Privacy-Preserving Incremental Learning for AutomaticSpeech Recognition at Production Scale

Add code
Bookmark button
Alert button
Jul 19, 2022
Gopinath Chennupati, Milind Rao, Gurpreet Chadha, Aaron Eakin, Anirudh Raju, Gautam Tiwari, Anit Kumar Sahu, Ariya Rastrow, Jasha Droppo, Andy Oberlin, Buddha Nandanoor, Prahalad Venkataramanan, Zheng Wu, Pankaj Sitpure

Figure 1 for ILASR: Privacy-Preserving Incremental Learning for AutomaticSpeech Recognition at Production Scale
Figure 2 for ILASR: Privacy-Preserving Incremental Learning for AutomaticSpeech Recognition at Production Scale
Figure 3 for ILASR: Privacy-Preserving Incremental Learning for AutomaticSpeech Recognition at Production Scale
Figure 4 for ILASR: Privacy-Preserving Incremental Learning for AutomaticSpeech Recognition at Production Scale
Viaarxiv icon

Offline-Online Learning of Deformation Model for Cable Manipulation with Graph Neural Networks

Add code
Bookmark button
Alert button
Mar 28, 2022
Changhao Wang, Yuyou Zhang, Xiang Zhang, Zheng Wu, Xinghao Zhu, Shiyu Jin, Te Tang, Masayoshi Tomizuka

Figure 1 for Offline-Online Learning of Deformation Model for Cable Manipulation with Graph Neural Networks
Figure 2 for Offline-Online Learning of Deformation Model for Cable Manipulation with Graph Neural Networks
Figure 3 for Offline-Online Learning of Deformation Model for Cable Manipulation with Graph Neural Networks
Figure 4 for Offline-Online Learning of Deformation Model for Cable Manipulation with Graph Neural Networks
Viaarxiv icon