Picture for Jianye Hao

Jianye Hao

Reinforced In-Context Black-Box Optimization

Add code
Feb 27, 2024
Figure 1 for Reinforced In-Context Black-Box Optimization
Figure 2 for Reinforced In-Context Black-Box Optimization
Figure 3 for Reinforced In-Context Black-Box Optimization
Figure 4 for Reinforced In-Context Black-Box Optimization
Viaarxiv icon

Enhancing Robotic Manipulation with AI Feedback from Multimodal Large Language Models

Add code
Feb 22, 2024
Figure 1 for Enhancing Robotic Manipulation with AI Feedback from Multimodal Large Language Models
Figure 2 for Enhancing Robotic Manipulation with AI Feedback from Multimodal Large Language Models
Figure 3 for Enhancing Robotic Manipulation with AI Feedback from Multimodal Large Language Models
Figure 4 for Enhancing Robotic Manipulation with AI Feedback from Multimodal Large Language Models
Viaarxiv icon

MENTOR: Guiding Hierarchical Reinforcement Learning with Human Feedback and Dynamic Distance Constraint

Add code
Feb 22, 2024
Figure 1 for MENTOR: Guiding Hierarchical Reinforcement Learning with Human Feedback and Dynamic Distance Constraint
Figure 2 for MENTOR: Guiding Hierarchical Reinforcement Learning with Human Feedback and Dynamic Distance Constraint
Figure 3 for MENTOR: Guiding Hierarchical Reinforcement Learning with Human Feedback and Dynamic Distance Constraint
Figure 4 for MENTOR: Guiding Hierarchical Reinforcement Learning with Human Feedback and Dynamic Distance Constraint
Viaarxiv icon

Uni-RLHF: Universal Platform and Benchmark Suite for Reinforcement Learning with Diverse Human Feedback

Add code
Feb 04, 2024
Viaarxiv icon

DiffuserLite: Towards Real-time Diffusion Planning

Add code
Feb 02, 2024
Figure 1 for DiffuserLite: Towards Real-time Diffusion Planning
Figure 2 for DiffuserLite: Towards Real-time Diffusion Planning
Figure 3 for DiffuserLite: Towards Real-time Diffusion Planning
Figure 4 for DiffuserLite: Towards Real-time Diffusion Planning
Viaarxiv icon

Bridging Evolutionary Algorithms and Reinforcement Learning: A Comprehensive Survey

Add code
Jan 22, 2024
Viaarxiv icon

Machine Learning Insides OptVerse AI Solver: Design Principles and Applications

Add code
Jan 17, 2024
Figure 1 for Machine Learning Insides OptVerse AI Solver: Design Principles and Applications
Figure 2 for Machine Learning Insides OptVerse AI Solver: Design Principles and Applications
Figure 3 for Machine Learning Insides OptVerse AI Solver: Design Principles and Applications
Figure 4 for Machine Learning Insides OptVerse AI Solver: Design Principles and Applications
Viaarxiv icon

Pangu-Agent: A Fine-Tunable Generalist Agent with Structured Reasoning

Add code
Dec 22, 2023
Figure 1 for Pangu-Agent: A Fine-Tunable Generalist Agent with Structured Reasoning
Figure 2 for Pangu-Agent: A Fine-Tunable Generalist Agent with Structured Reasoning
Figure 3 for Pangu-Agent: A Fine-Tunable Generalist Agent with Structured Reasoning
Figure 4 for Pangu-Agent: A Fine-Tunable Generalist Agent with Structured Reasoning
Viaarxiv icon

OVD-Explorer: Optimism Should Not Be the Sole Pursuit of Exploration in Noisy Environments

Add code
Dec 20, 2023
Figure 1 for OVD-Explorer: Optimism Should Not Be the Sole Pursuit of Exploration in Noisy Environments
Figure 2 for OVD-Explorer: Optimism Should Not Be the Sole Pursuit of Exploration in Noisy Environments
Figure 3 for OVD-Explorer: Optimism Should Not Be the Sole Pursuit of Exploration in Noisy Environments
Figure 4 for OVD-Explorer: Optimism Should Not Be the Sole Pursuit of Exploration in Noisy Environments
Viaarxiv icon

Rethinking Decision Transformer via Hierarchical Reinforcement Learning

Add code
Nov 01, 2023
Figure 1 for Rethinking Decision Transformer via Hierarchical Reinforcement Learning
Figure 2 for Rethinking Decision Transformer via Hierarchical Reinforcement Learning
Figure 3 for Rethinking Decision Transformer via Hierarchical Reinforcement Learning
Figure 4 for Rethinking Decision Transformer via Hierarchical Reinforcement Learning
Viaarxiv icon