Picture for John D. Co-Reyes

John D. Co-Reyes

Many-Shot In-Context Learning

Add code
Apr 17, 2024
Viaarxiv icon

Guided Evolution with Binary Discriminators for ML Program Search

Add code
Feb 08, 2024
Viaarxiv icon

Beyond Human Data: Scaling Self-Training for Problem-Solving with Language Models

Add code
Dec 22, 2023
Figure 1 for Beyond Human Data: Scaling Self-Training for Problem-Solving with Language Models
Figure 2 for Beyond Human Data: Scaling Self-Training for Problem-Solving with Language Models
Figure 3 for Beyond Human Data: Scaling Self-Training for Problem-Solving with Language Models
Figure 4 for Beyond Human Data: Scaling Self-Training for Problem-Solving with Language Models
Viaarxiv icon

Improving Large Language Model Fine-tuning for Solving Math Problems

Add code
Oct 16, 2023
Figure 1 for Improving Large Language Model Fine-tuning for Solving Math Problems
Figure 2 for Improving Large Language Model Fine-tuning for Solving Math Problems
Figure 3 for Improving Large Language Model Fine-tuning for Solving Math Problems
Figure 4 for Improving Large Language Model Fine-tuning for Solving Math Problems
Viaarxiv icon

Waymax: An Accelerated, Data-Driven Simulator for Large-Scale Autonomous Driving Research

Add code
Oct 12, 2023
Figure 1 for Waymax: An Accelerated, Data-Driven Simulator for Large-Scale Autonomous Driving Research
Figure 2 for Waymax: An Accelerated, Data-Driven Simulator for Large-Scale Autonomous Driving Research
Figure 3 for Waymax: An Accelerated, Data-Driven Simulator for Large-Scale Autonomous Driving Research
Figure 4 for Waymax: An Accelerated, Data-Driven Simulator for Large-Scale Autonomous Driving Research
Viaarxiv icon

Small-scale proxies for large-scale Transformer training instabilities

Add code
Sep 25, 2023
Figure 1 for Small-scale proxies for large-scale Transformer training instabilities
Figure 2 for Small-scale proxies for large-scale Transformer training instabilities
Figure 3 for Small-scale proxies for large-scale Transformer training instabilities
Figure 4 for Small-scale proxies for large-scale Transformer training instabilities
Viaarxiv icon

Multi-objective evolution for Generalizable Policy Gradient Algorithms

Add code
Apr 08, 2022
Figure 1 for Multi-objective evolution for Generalizable Policy Gradient Algorithms
Figure 2 for Multi-objective evolution for Generalizable Policy Gradient Algorithms
Figure 3 for Multi-objective evolution for Generalizable Policy Gradient Algorithms
Figure 4 for Multi-objective evolution for Generalizable Policy Gradient Algorithms
Viaarxiv icon

Information is Power: Intrinsic Control via Information Capture

Add code
Dec 07, 2021
Figure 1 for Information is Power: Intrinsic Control via Information Capture
Figure 2 for Information is Power: Intrinsic Control via Information Capture
Figure 3 for Information is Power: Intrinsic Control via Information Capture
Figure 4 for Information is Power: Intrinsic Control via Information Capture
Viaarxiv icon

Evolving Reinforcement Learning Algorithms

Add code
Jan 08, 2021
Figure 1 for Evolving Reinforcement Learning Algorithms
Figure 2 for Evolving Reinforcement Learning Algorithms
Figure 3 for Evolving Reinforcement Learning Algorithms
Figure 4 for Evolving Reinforcement Learning Algorithms
Viaarxiv icon

Ecological Reinforcement Learning

Add code
Jun 22, 2020
Figure 1 for Ecological Reinforcement Learning
Figure 2 for Ecological Reinforcement Learning
Figure 3 for Ecological Reinforcement Learning
Figure 4 for Ecological Reinforcement Learning
Viaarxiv icon