Alert button
Picture for Huaimin Wang

Huaimin Wang

Alert button

Optimistic Model Rollouts for Pessimistic Offline Policy Optimization

Add code
Bookmark button
Alert button
Jan 11, 2024
Yuanzhao Zhai, Yiying Li, Zijian Gao, Xudong Gong, Kele Xu, Dawei Feng, Ding Bo, Huaimin Wang

Viaarxiv icon

Uncertainty-Penalized Reinforcement Learning from Human Feedback with Diverse Reward LoRA Ensembles

Add code
Bookmark button
Alert button
Dec 30, 2023
Yuanzhao Zhai, Han Zhang, Yu Lei, Yue Yu, Kele Xu, Dawei Feng, Bo Ding, Huaimin Wang

Viaarxiv icon

Intelligent Computing: The Latest Advances, Challenges and Future

Add code
Bookmark button
Alert button
Nov 21, 2022
Shiqiang Zhu, Ting Yu, Tao Xu, Hongyang Chen, Schahram Dustdar, Sylvain Gigan, Deniz Gunduz, Ekram Hossain, Yaochu Jin, Feng Lin, Bo Liu, Zhiguo Wan, Ji Zhang, Zhifeng Zhao, Wentao Zhu, Zuoning Chen, Tariq Durrani, Huaimin Wang, Jiangxing Wu, Tongyi Zhang, Yunhe Pan

Figure 1 for Intelligent Computing: The Latest Advances, Challenges and Future
Figure 2 for Intelligent Computing: The Latest Advances, Challenges and Future
Figure 3 for Intelligent Computing: The Latest Advances, Challenges and Future
Figure 4 for Intelligent Computing: The Latest Advances, Challenges and Future
Viaarxiv icon

Self-Supervised Exploration via Temporal Inconsistency in Reinforcement Learning

Add code
Bookmark button
Alert button
Aug 24, 2022
Zijian Gao, Kele Xu, HengXing Cai, Yuanzhao Zhai, Dawei Feng, Bo Ding, XinJun Mao, Huaimin Wang

Figure 1 for Self-Supervised Exploration via Temporal Inconsistency in Reinforcement Learning
Figure 2 for Self-Supervised Exploration via Temporal Inconsistency in Reinforcement Learning
Figure 3 for Self-Supervised Exploration via Temporal Inconsistency in Reinforcement Learning
Figure 4 for Self-Supervised Exploration via Temporal Inconsistency in Reinforcement Learning
Viaarxiv icon

Dynamic Memory-based Curiosity: A Bootstrap Approach for Exploration

Add code
Bookmark button
Alert button
Aug 24, 2022
Zijian Gao, Kele Xu, YiYing Li, Yuanzhao Zhai, Dawei Feng, Bo Ding, XinJun Mao, Huaimin Wang

Figure 1 for Dynamic Memory-based Curiosity: A Bootstrap Approach for Exploration
Figure 2 for Dynamic Memory-based Curiosity: A Bootstrap Approach for Exploration
Figure 3 for Dynamic Memory-based Curiosity: A Bootstrap Approach for Exploration
Figure 4 for Dynamic Memory-based Curiosity: A Bootstrap Approach for Exploration
Viaarxiv icon

Trusted Multi-Scale Classification Framework for Whole Slide Image

Add code
Bookmark button
Alert button
Jul 12, 2022
Ming Feng, Kele Xu, Nanhui Wu, Weiquan Huang, Yan Bai, Changjian Wang, Huaimin Wang

Figure 1 for Trusted Multi-Scale Classification Framework for Whole Slide Image
Figure 2 for Trusted Multi-Scale Classification Framework for Whole Slide Image
Figure 3 for Trusted Multi-Scale Classification Framework for Whole Slide Image
Figure 4 for Trusted Multi-Scale Classification Framework for Whole Slide Image
Viaarxiv icon

Nuclear Norm Maximization Based Curiosity-Driven Learning

Add code
Bookmark button
Alert button
May 28, 2022
Chao Chen, Zijian Gao, Kele Xu, Sen Yang, Yiying Li, Bo Ding, Dawei Feng, Huaimin Wang

Figure 1 for Nuclear Norm Maximization Based Curiosity-Driven Learning
Figure 2 for Nuclear Norm Maximization Based Curiosity-Driven Learning
Figure 3 for Nuclear Norm Maximization Based Curiosity-Driven Learning
Figure 4 for Nuclear Norm Maximization Based Curiosity-Driven Learning
Viaarxiv icon

Unsupervised Voice-Face Representation Learning by Cross-Modal Prototype Contrast

Add code
Bookmark button
Alert button
May 02, 2022
Boqing Zhu, Kele Xu, Changjian Wang, Zheng Qin, Tao Sun, Huaimin Wang, Yuxing Peng

Figure 1 for Unsupervised Voice-Face Representation Learning by Cross-Modal Prototype Contrast
Figure 2 for Unsupervised Voice-Face Representation Learning by Cross-Modal Prototype Contrast
Figure 3 for Unsupervised Voice-Face Representation Learning by Cross-Modal Prototype Contrast
Figure 4 for Unsupervised Voice-Face Representation Learning by Cross-Modal Prototype Contrast
Viaarxiv icon