Picture for Gregory Wornell

Gregory Wornell

GSRM: Generative Speech Reward Model for Speech RLHF

Add code
Feb 14, 2026
Viaarxiv icon

Satori-SWE: Evolutionary Test-Time Scaling for Sample-Efficient Software Engineering

Add code
May 29, 2025
Figure 1 for Satori-SWE: Evolutionary Test-Time Scaling for Sample-Efficient Software Engineering
Figure 2 for Satori-SWE: Evolutionary Test-Time Scaling for Sample-Efficient Software Engineering
Figure 3 for Satori-SWE: Evolutionary Test-Time Scaling for Sample-Efficient Software Engineering
Figure 4 for Satori-SWE: Evolutionary Test-Time Scaling for Sample-Efficient Software Engineering
Viaarxiv icon

Score-of-Mixture Training: Training One-Step Generative Models Made Simple

Add code
Feb 13, 2025
Figure 1 for Score-of-Mixture Training: Training One-Step Generative Models Made Simple
Figure 2 for Score-of-Mixture Training: Training One-Step Generative Models Made Simple
Figure 3 for Score-of-Mixture Training: Training One-Step Generative Models Made Simple
Figure 4 for Score-of-Mixture Training: Training One-Step Generative Models Made Simple
Viaarxiv icon

Satori: Reinforcement Learning with Chain-of-Action-Thought Enhances LLM Reasoning via Autoregressive Search

Add code
Feb 04, 2025
Figure 1 for Satori: Reinforcement Learning with Chain-of-Action-Thought Enhances LLM Reasoning via Autoregressive Search
Figure 2 for Satori: Reinforcement Learning with Chain-of-Action-Thought Enhances LLM Reasoning via Autoregressive Search
Figure 3 for Satori: Reinforcement Learning with Chain-of-Action-Thought Enhances LLM Reasoning via Autoregressive Search
Figure 4 for Satori: Reinforcement Learning with Chain-of-Action-Thought Enhances LLM Reasoning via Autoregressive Search
Viaarxiv icon

Reliable Gradient-free and Likelihood-free Prompt Tuning

Add code
Apr 30, 2023
Figure 1 for Reliable Gradient-free and Likelihood-free Prompt Tuning
Figure 2 for Reliable Gradient-free and Likelihood-free Prompt Tuning
Figure 3 for Reliable Gradient-free and Likelihood-free Prompt Tuning
Figure 4 for Reliable Gradient-free and Likelihood-free Prompt Tuning
Viaarxiv icon

On the Generalization Error of Meta Learning for the Gibbs Algorithm

Add code
Apr 27, 2023
Figure 1 for On the Generalization Error of Meta Learning for the Gibbs Algorithm
Figure 2 for On the Generalization Error of Meta Learning for the Gibbs Algorithm
Viaarxiv icon

Post-hoc Uncertainty Learning using a Dirichlet Meta-Model

Add code
Dec 14, 2022
Figure 1 for Post-hoc Uncertainty Learning using a Dirichlet Meta-Model
Figure 2 for Post-hoc Uncertainty Learning using a Dirichlet Meta-Model
Figure 3 for Post-hoc Uncertainty Learning using a Dirichlet Meta-Model
Figure 4 for Post-hoc Uncertainty Learning using a Dirichlet Meta-Model
Viaarxiv icon

Tighter Expected Generalization Error Bounds via Convexity of Information Measures

Add code
Feb 24, 2022
Figure 1 for Tighter Expected Generalization Error Bounds via Convexity of Information Measures
Viaarxiv icon

On the Benefits of Selectivity in Pseudo-Labeling for Unsupervised Multi-Source-Free Domain Adaptation

Add code
Feb 16, 2022
Figure 1 for On the Benefits of Selectivity in Pseudo-Labeling for Unsupervised Multi-Source-Free Domain Adaptation
Figure 2 for On the Benefits of Selectivity in Pseudo-Labeling for Unsupervised Multi-Source-Free Domain Adaptation
Figure 3 for On the Benefits of Selectivity in Pseudo-Labeling for Unsupervised Multi-Source-Free Domain Adaptation
Figure 4 for On the Benefits of Selectivity in Pseudo-Labeling for Unsupervised Multi-Source-Free Domain Adaptation
Viaarxiv icon

Characterizing and Understanding the Generalization Error of Transfer Learning with Gibbs Algorithm

Add code
Nov 02, 2021
Figure 1 for Characterizing and Understanding the Generalization Error of Transfer Learning with Gibbs Algorithm
Figure 2 for Characterizing and Understanding the Generalization Error of Transfer Learning with Gibbs Algorithm
Figure 3 for Characterizing and Understanding the Generalization Error of Transfer Learning with Gibbs Algorithm
Figure 4 for Characterizing and Understanding the Generalization Error of Transfer Learning with Gibbs Algorithm
Viaarxiv icon