Picture for David Brandfonbrener

David Brandfonbrener

CoLoR-Filter: Conditional Loss Reduction Filtering for Targeted Language Model Pre-training

Add code
Jun 15, 2024
Viaarxiv icon

Q-Probe: A Lightweight Approach to Reward Maximization for Language Models

Add code
Feb 22, 2024
Viaarxiv icon

Verified Multi-Step Synthesis using Large Language Models and Monte Carlo Tree Search

Add code
Feb 13, 2024
Viaarxiv icon

Repeat After Me: Transformers are Better than State Space Models at Copying

Feb 01, 2024
Viaarxiv icon

Inverse Dynamics Pretraining Learns Good Representations for Multitask Imitation

Add code
May 26, 2023
Figure 1 for Inverse Dynamics Pretraining Learns Good Representations for Multitask Imitation
Figure 2 for Inverse Dynamics Pretraining Learns Good Representations for Multitask Imitation
Figure 3 for Inverse Dynamics Pretraining Learns Good Representations for Multitask Imitation
Figure 4 for Inverse Dynamics Pretraining Learns Good Representations for Multitask Imitation
Viaarxiv icon

Visual Backtracking Teleoperation: A Data Collection Protocol for Offline Image-Based Reinforcement Learning

Oct 05, 2022
Figure 1 for Visual Backtracking Teleoperation: A Data Collection Protocol for Offline Image-Based Reinforcement Learning
Figure 2 for Visual Backtracking Teleoperation: A Data Collection Protocol for Offline Image-Based Reinforcement Learning
Figure 3 for Visual Backtracking Teleoperation: A Data Collection Protocol for Offline Image-Based Reinforcement Learning
Figure 4 for Visual Backtracking Teleoperation: A Data Collection Protocol for Offline Image-Based Reinforcement Learning
Viaarxiv icon

Incorporating Explicit Uncertainty Estimates into Deep Offline Reinforcement Learning

Add code
Jun 02, 2022
Figure 1 for Incorporating Explicit Uncertainty Estimates into Deep Offline Reinforcement Learning
Figure 2 for Incorporating Explicit Uncertainty Estimates into Deep Offline Reinforcement Learning
Figure 3 for Incorporating Explicit Uncertainty Estimates into Deep Offline Reinforcement Learning
Figure 4 for Incorporating Explicit Uncertainty Estimates into Deep Offline Reinforcement Learning
Viaarxiv icon

When does return-conditioned supervised learning work for offline reinforcement learning?

Add code
Jun 02, 2022
Figure 1 for When does return-conditioned supervised learning work for offline reinforcement learning?
Figure 2 for When does return-conditioned supervised learning work for offline reinforcement learning?
Figure 3 for When does return-conditioned supervised learning work for offline reinforcement learning?
Figure 4 for When does return-conditioned supervised learning work for offline reinforcement learning?
Viaarxiv icon

Don't Change the Algorithm, Change the Data: Exploratory Data for Offline Reinforcement Learning

Add code
Feb 08, 2022
Figure 1 for Don't Change the Algorithm, Change the Data: Exploratory Data for Offline Reinforcement Learning
Figure 2 for Don't Change the Algorithm, Change the Data: Exploratory Data for Offline Reinforcement Learning
Figure 3 for Don't Change the Algorithm, Change the Data: Exploratory Data for Offline Reinforcement Learning
Figure 4 for Don't Change the Algorithm, Change the Data: Exploratory Data for Offline Reinforcement Learning
Viaarxiv icon

Quantile Filtered Imitation Learning

Dec 02, 2021
Figure 1 for Quantile Filtered Imitation Learning
Figure 2 for Quantile Filtered Imitation Learning
Figure 3 for Quantile Filtered Imitation Learning
Figure 4 for Quantile Filtered Imitation Learning
Viaarxiv icon