Picture for Haifeng Zhang

Haifeng Zhang

Learning to Discuss Strategically: A Case Study on One Night Ultimate Werewolf

Add code
May 30, 2024
Viaarxiv icon

Token-level Direct Preference Optimization

Add code
Apr 18, 2024
Viaarxiv icon

Learning Macroeconomic Policies based on Microfoundations: A Stackelberg Mean Field Game Approach

Add code
Mar 14, 2024
Figure 1 for Learning Macroeconomic Policies based on Microfoundations: A Stackelberg Mean Field Game Approach
Figure 2 for Learning Macroeconomic Policies based on Microfoundations: A Stackelberg Mean Field Game Approach
Figure 3 for Learning Macroeconomic Policies based on Microfoundations: A Stackelberg Mean Field Game Approach
Figure 4 for Learning Macroeconomic Policies based on Microfoundations: A Stackelberg Mean Field Game Approach
Viaarxiv icon

AI-Based Energy Transportation Safety: Pipeline Radial Threat Estimation Using Intelligent Sensing System

Add code
Dec 26, 2023
Figure 1 for AI-Based Energy Transportation Safety: Pipeline Radial Threat Estimation Using Intelligent Sensing System
Figure 2 for AI-Based Energy Transportation Safety: Pipeline Radial Threat Estimation Using Intelligent Sensing System
Figure 3 for AI-Based Energy Transportation Safety: Pipeline Radial Threat Estimation Using Intelligent Sensing System
Figure 4 for AI-Based Energy Transportation Safety: Pipeline Radial Threat Estimation Using Intelligent Sensing System
Viaarxiv icon

Large Language Models Play StarCraft II: Benchmarks and A Chain of Summarization Approach

Add code
Dec 19, 2023
Viaarxiv icon

Ask more, know better: Reinforce-Learned Prompt Questions for Decision Making with Large Language Models

Add code
Oct 27, 2023
Viaarxiv icon

Analysis on Multi-robot Relative 6-DOF Pose Estimation Error Based on UWB Range

Add code
Sep 27, 2023
Figure 1 for Analysis on Multi-robot Relative 6-DOF Pose Estimation Error Based on UWB Range
Figure 2 for Analysis on Multi-robot Relative 6-DOF Pose Estimation Error Based on UWB Range
Figure 3 for Analysis on Multi-robot Relative 6-DOF Pose Estimation Error Based on UWB Range
Figure 4 for Analysis on Multi-robot Relative 6-DOF Pose Estimation Error Based on UWB Range
Viaarxiv icon

Large Sequence Models for Sequential Decision-Making: A Survey

Add code
Jun 24, 2023
Figure 1 for Large Sequence Models for Sequential Decision-Making: A Survey
Figure 2 for Large Sequence Models for Sequential Decision-Making: A Survey
Figure 3 for Large Sequence Models for Sequential Decision-Making: A Survey
Figure 4 for Large Sequence Models for Sequential Decision-Making: A Survey
Viaarxiv icon

An Empirical Study on Google Research Football Multi-agent Scenarios

Add code
May 16, 2023
Figure 1 for An Empirical Study on Google Research Football Multi-agent Scenarios
Figure 2 for An Empirical Study on Google Research Football Multi-agent Scenarios
Figure 3 for An Empirical Study on Google Research Football Multi-agent Scenarios
Figure 4 for An Empirical Study on Google Research Football Multi-agent Scenarios
Viaarxiv icon

Contextual Transformer for Offline Meta Reinforcement Learning

Add code
Nov 15, 2022
Figure 1 for Contextual Transformer for Offline Meta Reinforcement Learning
Figure 2 for Contextual Transformer for Offline Meta Reinforcement Learning
Figure 3 for Contextual Transformer for Offline Meta Reinforcement Learning
Figure 4 for Contextual Transformer for Offline Meta Reinforcement Learning
Viaarxiv icon