Picture for Caglar Gulcehre

Caglar Gulcehre

Aligning Large Language Models with Diverse Political Viewpoints

Add code
Jun 20, 2024
Figure 1 for Aligning Large Language Models with Diverse Political Viewpoints
Figure 2 for Aligning Large Language Models with Diverse Political Viewpoints
Figure 3 for Aligning Large Language Models with Diverse Political Viewpoints
Figure 4 for Aligning Large Language Models with Diverse Political Viewpoints
Viaarxiv icon

Promises, Outlooks and Challenges of Diffusion Language Modeling

Add code
Jun 17, 2024
Viaarxiv icon

PlanDQ: Hierarchical Plan Orchestration via D-Conductor and Q-Performer

Add code
Jun 10, 2024
Figure 1 for PlanDQ: Hierarchical Plan Orchestration via D-Conductor and Q-Performer
Figure 2 for PlanDQ: Hierarchical Plan Orchestration via D-Conductor and Q-Performer
Figure 3 for PlanDQ: Hierarchical Plan Orchestration via D-Conductor and Q-Performer
Figure 4 for PlanDQ: Hierarchical Plan Orchestration via D-Conductor and Q-Performer
Viaarxiv icon

Fleet of Agents: Coordinated Problem Solving with Large Language Models using Genetic Particle Filtering

Add code
May 07, 2024
Figure 1 for Fleet of Agents: Coordinated Problem Solving with Large Language Models using Genetic Particle Filtering
Figure 2 for Fleet of Agents: Coordinated Problem Solving with Large Language Models using Genetic Particle Filtering
Figure 3 for Fleet of Agents: Coordinated Problem Solving with Large Language Models using Genetic Particle Filtering
Figure 4 for Fleet of Agents: Coordinated Problem Solving with Large Language Models using Genetic Particle Filtering
Viaarxiv icon

No Representation, No Trust: Connecting Representation, Collapse, and Trust Issues in PPO

Add code
May 01, 2024
Figure 1 for No Representation, No Trust: Connecting Representation, Collapse, and Trust Issues in PPO
Figure 2 for No Representation, No Trust: Connecting Representation, Collapse, and Trust Issues in PPO
Figure 3 for No Representation, No Trust: Connecting Representation, Collapse, and Trust Issues in PPO
Figure 4 for No Representation, No Trust: Connecting Representation, Collapse, and Trust Issues in PPO
Viaarxiv icon

Griffin: Mixing Gated Linear Recurrences with Local Attention for Efficient Language Models

Add code
Feb 29, 2024
Viaarxiv icon

Simple Hierarchical Planning with Diffusion

Add code
Jan 05, 2024
Figure 1 for Simple Hierarchical Planning with Diffusion
Figure 2 for Simple Hierarchical Planning with Diffusion
Figure 3 for Simple Hierarchical Planning with Diffusion
Figure 4 for Simple Hierarchical Planning with Diffusion
Viaarxiv icon

Imagine the Unseen World: A Benchmark for Systematic Generalization in Visual World Models

Add code
Nov 15, 2023
Figure 1 for Imagine the Unseen World: A Benchmark for Systematic Generalization in Visual World Models
Figure 2 for Imagine the Unseen World: A Benchmark for Systematic Generalization in Visual World Models
Figure 3 for Imagine the Unseen World: A Benchmark for Systematic Generalization in Visual World Models
Figure 4 for Imagine the Unseen World: A Benchmark for Systematic Generalization in Visual World Models
Viaarxiv icon

Reinforced Self-Training (ReST) for Language Modeling

Add code
Aug 21, 2023
Viaarxiv icon

AlphaStar Unplugged: Large-Scale Offline Reinforcement Learning

Add code
Aug 07, 2023
Figure 1 for AlphaStar Unplugged: Large-Scale Offline Reinforcement Learning
Figure 2 for AlphaStar Unplugged: Large-Scale Offline Reinforcement Learning
Figure 3 for AlphaStar Unplugged: Large-Scale Offline Reinforcement Learning
Figure 4 for AlphaStar Unplugged: Large-Scale Offline Reinforcement Learning
Viaarxiv icon