Picture for Ying Wen

Ying Wen

Fusion-PSRO: Nash Policy Fusion for Policy Space Response Oracles

Add code
Jun 03, 2024
Viaarxiv icon

Efficient Model-agnostic Alignment via Bayesian Persuasion

Add code
May 29, 2024
Viaarxiv icon

Reinforcing Language Agents via Policy Optimization with Action Decomposition

Add code
May 23, 2024
Viaarxiv icon

DS-Agent: Automated Data Science by Empowering Large Language Models with Case-Based Reasoning

Add code
Mar 13, 2024
Figure 1 for DS-Agent: Automated Data Science by Empowering Large Language Models with Case-Based Reasoning
Figure 2 for DS-Agent: Automated Data Science by Empowering Large Language Models with Case-Based Reasoning
Figure 3 for DS-Agent: Automated Data Science by Empowering Large Language Models with Case-Based Reasoning
Figure 4 for DS-Agent: Automated Data Science by Empowering Large Language Models with Case-Based Reasoning
Viaarxiv icon

TRAD: Enhancing LLM Agents with Step-Wise Thought Retrieval and Aligned Decision

Add code
Mar 10, 2024
Figure 1 for TRAD: Enhancing LLM Agents with Step-Wise Thought Retrieval and Aligned Decision
Figure 2 for TRAD: Enhancing LLM Agents with Step-Wise Thought Retrieval and Aligned Decision
Figure 3 for TRAD: Enhancing LLM Agents with Step-Wise Thought Retrieval and Aligned Decision
Figure 4 for TRAD: Enhancing LLM Agents with Step-Wise Thought Retrieval and Aligned Decision
Viaarxiv icon

AceMap: Knowledge Discovery through Academic Graph

Add code
Mar 05, 2024
Figure 1 for AceMap: Knowledge Discovery through Academic Graph
Figure 2 for AceMap: Knowledge Discovery through Academic Graph
Figure 3 for AceMap: Knowledge Discovery through Academic Graph
Figure 4 for AceMap: Knowledge Discovery through Academic Graph
Viaarxiv icon

Offline Fictitious Self-Play for Competitive Games

Add code
Feb 29, 2024
Figure 1 for Offline Fictitious Self-Play for Competitive Games
Figure 2 for Offline Fictitious Self-Play for Competitive Games
Figure 3 for Offline Fictitious Self-Play for Competitive Games
Figure 4 for Offline Fictitious Self-Play for Competitive Games
Viaarxiv icon

Aligning Individual and Collective Objectives in Multi-Agent Cooperation

Add code
Feb 19, 2024
Figure 1 for Aligning Individual and Collective Objectives in Multi-Agent Cooperation
Figure 2 for Aligning Individual and Collective Objectives in Multi-Agent Cooperation
Figure 3 for Aligning Individual and Collective Objectives in Multi-Agent Cooperation
Figure 4 for Aligning Individual and Collective Objectives in Multi-Agent Cooperation
Viaarxiv icon

Natural Language Reinforcement Learning

Add code
Feb 14, 2024
Figure 1 for Natural Language Reinforcement Learning
Figure 2 for Natural Language Reinforcement Learning
Figure 3 for Natural Language Reinforcement Learning
Figure 4 for Natural Language Reinforcement Learning
Viaarxiv icon

Entropy-Regularized Token-Level Policy Optimization for Large Language Models

Add code
Feb 09, 2024
Viaarxiv icon