Alert button
Picture for Hongyi Guo

Hongyi Guo

Alert button

Diverse Randomized Value Functions: A Provably Pessimistic Approach for Offline Reinforcement Learning

Add code
Bookmark button
Alert button
Apr 09, 2024
Xudong Yu, Chenjia Bai, Hongyi Guo, Changhong Wang, Zhen Wang

Viaarxiv icon

Improving Reinforcement Learning from Human Feedback Using Contrastive Rewards

Add code
Bookmark button
Alert button
Mar 14, 2024
Wei Shen, Xiaoying Zhang, Yuanshun Yao, Rui Zheng, Hongyi Guo, Yang Liu

Figure 1 for Improving Reinforcement Learning from Human Feedback Using Contrastive Rewards
Figure 2 for Improving Reinforcement Learning from Human Feedback Using Contrastive Rewards
Figure 3 for Improving Reinforcement Learning from Human Feedback Using Contrastive Rewards
Figure 4 for Improving Reinforcement Learning from Human Feedback Using Contrastive Rewards
Viaarxiv icon

Can Large Language Models Play Games? A Case Study of A Self-Play Approach

Add code
Bookmark button
Alert button
Mar 08, 2024
Hongyi Guo, Zhihan Liu, Yufeng Zhang, Zhaoran Wang

Figure 1 for Can Large Language Models Play Games? A Case Study of A Self-Play Approach
Figure 2 for Can Large Language Models Play Games? A Case Study of A Self-Play Approach
Figure 3 for Can Large Language Models Play Games? A Case Study of A Self-Play Approach
Figure 4 for Can Large Language Models Play Games? A Case Study of A Self-Play Approach
Viaarxiv icon

Measuring and Reducing LLM Hallucination without Gold-Standard Answers via Expertise-Weighting

Add code
Bookmark button
Alert button
Feb 16, 2024
Jiaheng Wei, Yuanshun Yao, Jean-Francois Ton, Hongyi Guo, Andrew Estornell, Yang Liu

Viaarxiv icon

Human-Instruction-Free LLM Self-Alignment with Limited Samples

Add code
Bookmark button
Alert button
Jan 06, 2024
Hongyi Guo, Yuanshun Yao, Wei Shen, Jiaheng Wei, Xiaoying Zhang, Zhaoran Wang, Yang Liu

Viaarxiv icon

Reason for Future, Act for Now: A Principled Framework for Autonomous LLM Agents with Provable Sample Efficiency

Add code
Bookmark button
Alert button
Oct 11, 2023
Zhihan Liu, Hao Hu, Shenao Zhang, Hongyi Guo, Shuqi Ke, Boyi Liu, Zhaoran Wang

Figure 1 for Reason for Future, Act for Now: A Principled Framework for Autonomous LLM Agents with Provable Sample Efficiency
Figure 2 for Reason for Future, Act for Now: A Principled Framework for Autonomous LLM Agents with Provable Sample Efficiency
Figure 3 for Reason for Future, Act for Now: A Principled Framework for Autonomous LLM Agents with Provable Sample Efficiency
Figure 4 for Reason for Future, Act for Now: A Principled Framework for Autonomous LLM Agents with Provable Sample Efficiency
Viaarxiv icon

Behavior Contrastive Learning for Unsupervised Skill Discovery

Add code
Bookmark button
Alert button
May 08, 2023
Rushuai Yang, Chenjia Bai, Hongyi Guo, Siyuan Li, Bin Zhao, Zhen Wang, Peng Liu, Xuelong Li

Figure 1 for Behavior Contrastive Learning for Unsupervised Skill Discovery
Figure 2 for Behavior Contrastive Learning for Unsupervised Skill Discovery
Figure 3 for Behavior Contrastive Learning for Unsupervised Skill Discovery
Figure 4 for Behavior Contrastive Learning for Unsupervised Skill Discovery
Viaarxiv icon

Policy Learning Using Weak Supervision

Add code
Bookmark button
Alert button
Oct 05, 2020
Jingkang Wang, Hongyi Guo, Zhaowei Zhu, Yang Liu

Figure 1 for Policy Learning Using Weak Supervision
Figure 2 for Policy Learning Using Weak Supervision
Figure 3 for Policy Learning Using Weak Supervision
Figure 4 for Policy Learning Using Weak Supervision
Viaarxiv icon

Peer Loss Functions: Learning from Noisy Labels without Knowing Noise Rates

Add code
Bookmark button
Alert button
Oct 08, 2019
Yang Liu, Hongyi Guo

Figure 1 for Peer Loss Functions: Learning from Noisy Labels without Knowing Noise Rates
Figure 2 for Peer Loss Functions: Learning from Noisy Labels without Knowing Noise Rates
Figure 3 for Peer Loss Functions: Learning from Noisy Labels without Knowing Noise Rates
Figure 4 for Peer Loss Functions: Learning from Noisy Labels without Knowing Noise Rates
Viaarxiv icon