Picture for Zhaoran Wang

Zhaoran Wang

A Mean-Field Analysis of Neural Gradient Descent-Ascent: Applications to Functional Conditional Moment Equations

Add code
Apr 18, 2024
Viaarxiv icon

Advancing Object Goal Navigation Through LLM-enhanced Object Affinities Transfer

Add code
Mar 15, 2024
Figure 1 for Advancing Object Goal Navigation Through LLM-enhanced Object Affinities Transfer
Figure 2 for Advancing Object Goal Navigation Through LLM-enhanced Object Affinities Transfer
Figure 3 for Advancing Object Goal Navigation Through LLM-enhanced Object Affinities Transfer
Figure 4 for Advancing Object Goal Navigation Through LLM-enhanced Object Affinities Transfer
Viaarxiv icon

Can Large Language Models Play Games? A Case Study of A Self-Play Approach

Add code
Mar 08, 2024
Figure 1 for Can Large Language Models Play Games? A Case Study of A Self-Play Approach
Figure 2 for Can Large Language Models Play Games? A Case Study of A Self-Play Approach
Figure 3 for Can Large Language Models Play Games? A Case Study of A Self-Play Approach
Figure 4 for Can Large Language Models Play Games? A Case Study of A Self-Play Approach
Viaarxiv icon

How Can LLM Guide RL? A Value-Based Approach

Add code
Feb 25, 2024
Figure 1 for How Can LLM Guide RL? A Value-Based Approach
Figure 2 for How Can LLM Guide RL? A Value-Based Approach
Figure 3 for How Can LLM Guide RL? A Value-Based Approach
Figure 4 for How Can LLM Guide RL? A Value-Based Approach
Viaarxiv icon

Double Duality: Variational Primal-Dual Policy Optimization for Constrained Reinforcement Learning

Add code
Feb 16, 2024
Viaarxiv icon

Human-Instruction-Free LLM Self-Alignment with Limited Samples

Add code
Jan 06, 2024
Viaarxiv icon

Sparse PCA with Oracle Property

Add code
Dec 28, 2023
Viaarxiv icon

Empowering Autonomous Driving with Large Language Models: A Safety Perspective

Add code
Nov 28, 2023
Viaarxiv icon

Provably Efficient High-Dimensional Bandit Learning with Batched Feedbacks

Add code
Nov 24, 2023
Figure 1 for Provably Efficient High-Dimensional Bandit Learning with Batched Feedbacks
Figure 2 for Provably Efficient High-Dimensional Bandit Learning with Batched Feedbacks
Figure 3 for Provably Efficient High-Dimensional Bandit Learning with Batched Feedbacks
Figure 4 for Provably Efficient High-Dimensional Bandit Learning with Batched Feedbacks
Viaarxiv icon

A Principled Framework for Knowledge-enhanced Large Language Model

Add code
Nov 18, 2023
Figure 1 for A Principled Framework for Knowledge-enhanced Large Language Model
Viaarxiv icon