Picture for Quoc V. Le

Quoc V. Le

BIG-Bench Extra Hard

Add code
Feb 26, 2025
Viaarxiv icon

Gold-medalist Performance in Solving Olympiad Geometry with AlphaGeometry2

Add code
Feb 05, 2025
Viaarxiv icon

SFT Memorizes, RL Generalizes: A Comparative Study of Foundation Model Post-training

Add code
Jan 28, 2025
Figure 1 for SFT Memorizes, RL Generalizes: A Comparative Study of Foundation Model Post-training
Figure 2 for SFT Memorizes, RL Generalizes: A Comparative Study of Foundation Model Post-training
Figure 3 for SFT Memorizes, RL Generalizes: A Comparative Study of Foundation Model Post-training
Figure 4 for SFT Memorizes, RL Generalizes: A Comparative Study of Foundation Model Post-training
Viaarxiv icon

Evolving Alignment via Asymmetric Self-Play

Add code
Oct 31, 2024
Figure 1 for Evolving Alignment via Asymmetric Self-Play
Figure 2 for Evolving Alignment via Asymmetric Self-Play
Figure 3 for Evolving Alignment via Asymmetric Self-Play
Figure 4 for Evolving Alignment via Asymmetric Self-Play
Viaarxiv icon

EVOLvE: Evaluating and Optimizing LLMs For Exploration

Add code
Oct 08, 2024
Figure 1 for EVOLvE: Evaluating and Optimizing LLMs For Exploration
Figure 2 for EVOLvE: Evaluating and Optimizing LLMs For Exploration
Figure 3 for EVOLvE: Evaluating and Optimizing LLMs For Exploration
Figure 4 for EVOLvE: Evaluating and Optimizing LLMs For Exploration
Viaarxiv icon

Large Language Monkeys: Scaling Inference Compute with Repeated Sampling

Add code
Jul 31, 2024
Figure 1 for Large Language Monkeys: Scaling Inference Compute with Repeated Sampling
Figure 2 for Large Language Monkeys: Scaling Inference Compute with Repeated Sampling
Figure 3 for Large Language Monkeys: Scaling Inference Compute with Repeated Sampling
Figure 4 for Large Language Monkeys: Scaling Inference Compute with Repeated Sampling
Viaarxiv icon

NATURAL PLAN: Benchmarking LLMs on Natural Language Planning

Add code
Jun 06, 2024
Figure 1 for NATURAL PLAN: Benchmarking LLMs on Natural Language Planning
Figure 2 for NATURAL PLAN: Benchmarking LLMs on Natural Language Planning
Figure 3 for NATURAL PLAN: Benchmarking LLMs on Natural Language Planning
Figure 4 for NATURAL PLAN: Benchmarking LLMs on Natural Language Planning
Viaarxiv icon

Long-form factuality in large language models

Add code
Apr 03, 2024
Figure 1 for Long-form factuality in large language models
Figure 2 for Long-form factuality in large language models
Figure 3 for Long-form factuality in large language models
Figure 4 for Long-form factuality in large language models
Viaarxiv icon

Self-Discover: Large Language Models Self-Compose Reasoning Structures

Add code
Feb 06, 2024
Figure 1 for Self-Discover: Large Language Models Self-Compose Reasoning Structures
Figure 2 for Self-Discover: Large Language Models Self-Compose Reasoning Structures
Figure 3 for Self-Discover: Large Language Models Self-Compose Reasoning Structures
Figure 4 for Self-Discover: Large Language Models Self-Compose Reasoning Structures
Viaarxiv icon

AutoNumerics-Zero: Automated Discovery of State-of-the-Art Mathematical Functions

Add code
Dec 13, 2023
Viaarxiv icon