Picture for Ying Wen

Ying Wen

Aligning Individual and Collective Objectives in Multi-Agent Cooperation

Add code
Feb 19, 2024
Figure 1 for Aligning Individual and Collective Objectives in Multi-Agent Cooperation
Figure 2 for Aligning Individual and Collective Objectives in Multi-Agent Cooperation
Figure 3 for Aligning Individual and Collective Objectives in Multi-Agent Cooperation
Figure 4 for Aligning Individual and Collective Objectives in Multi-Agent Cooperation
Viaarxiv icon

Natural Language Reinforcement Learning

Add code
Feb 14, 2024
Viaarxiv icon

Entropy-Regularized Token-Level Policy Optimization for Large Language Models

Add code
Feb 09, 2024
Figure 1 for Entropy-Regularized Token-Level Policy Optimization for Large Language Models
Figure 2 for Entropy-Regularized Token-Level Policy Optimization for Large Language Models
Figure 3 for Entropy-Regularized Token-Level Policy Optimization for Large Language Models
Figure 4 for Entropy-Regularized Token-Level Policy Optimization for Large Language Models
Viaarxiv icon

Adaptive Control Strategy for Quadruped Robots in Actuator Degradation Scenarios

Add code
Dec 29, 2023
Figure 1 for Adaptive Control Strategy for Quadruped Robots in Actuator Degradation Scenarios
Figure 2 for Adaptive Control Strategy for Quadruped Robots in Actuator Degradation Scenarios
Figure 3 for Adaptive Control Strategy for Quadruped Robots in Actuator Degradation Scenarios
Figure 4 for Adaptive Control Strategy for Quadruped Robots in Actuator Degradation Scenarios
Viaarxiv icon

Critic-Guided Decision Transformer for Offline Reinforcement Learning

Add code
Dec 21, 2023
Figure 1 for Critic-Guided Decision Transformer for Offline Reinforcement Learning
Figure 2 for Critic-Guided Decision Transformer for Offline Reinforcement Learning
Figure 3 for Critic-Guided Decision Transformer for Offline Reinforcement Learning
Figure 4 for Critic-Guided Decision Transformer for Offline Reinforcement Learning
Viaarxiv icon

Controlling Large Language Model-based Agents for Large-Scale Decision-Making: An Actor-Critic Approach

Add code
Nov 23, 2023
Figure 1 for Controlling Large Language Model-based Agents for Large-Scale Decision-Making: An Actor-Critic Approach
Figure 2 for Controlling Large Language Model-based Agents for Large-Scale Decision-Making: An Actor-Critic Approach
Figure 3 for Controlling Large Language Model-based Agents for Large-Scale Decision-Making: An Actor-Critic Approach
Figure 4 for Controlling Large Language Model-based Agents for Large-Scale Decision-Making: An Actor-Critic Approach
Viaarxiv icon

Quantifying Zero-shot Coordination Capability with Behavior Preferring Partners

Add code
Oct 08, 2023
Figure 1 for Quantifying Zero-shot Coordination Capability with Behavior Preferring Partners
Figure 2 for Quantifying Zero-shot Coordination Capability with Behavior Preferring Partners
Figure 3 for Quantifying Zero-shot Coordination Capability with Behavior Preferring Partners
Figure 4 for Quantifying Zero-shot Coordination Capability with Behavior Preferring Partners
Viaarxiv icon

GEAR: A GPU-Centric Experience Replay System for Large Reinforcement Learning Models

Add code
Oct 08, 2023
Viaarxiv icon

Alphazero-like Tree-Search can Guide Large Language Model Decoding and Training

Add code
Sep 29, 2023
Figure 1 for Alphazero-like Tree-Search can Guide Large Language Model Decoding and Training
Figure 2 for Alphazero-like Tree-Search can Guide Large Language Model Decoding and Training
Figure 3 for Alphazero-like Tree-Search can Guide Large Language Model Decoding and Training
Figure 4 for Alphazero-like Tree-Search can Guide Large Language Model Decoding and Training
Viaarxiv icon

Cross-Utterance Conditioned VAE for Speech Generation

Add code
Sep 08, 2023
Figure 1 for Cross-Utterance Conditioned VAE for Speech Generation
Figure 2 for Cross-Utterance Conditioned VAE for Speech Generation
Figure 3 for Cross-Utterance Conditioned VAE for Speech Generation
Figure 4 for Cross-Utterance Conditioned VAE for Speech Generation
Viaarxiv icon