Picture for Keith Ross

Keith Ross

Is Optimal Transport Necessary for Inverse Reinforcement Learning?

Add code
Jun 07, 2025
Viaarxiv icon

Reinforcement Learning vs. Distillation: Understanding Accuracy and Capability in LLM Reasoning

Add code
May 20, 2025
Viaarxiv icon

Warm Up Before You Train: Unlocking General Reasoning in Resource-Constrained Settings

Add code
May 19, 2025
Viaarxiv icon

Neural Multivariate Regression: Qualitative Insights from the Unconstrained Feature Model

Add code
May 14, 2025
Viaarxiv icon

Mathematical Reasoning in Large Language Models: Assessing Logical and Arithmetic Errors across Wide Numerical Ranges

Add code
Feb 12, 2025
Viaarxiv icon

Finite-Sample Analysis of the Monte Carlo Exploring Starts Algorithm for Reinforcement Learning

Add code
Oct 03, 2024
Viaarxiv icon

The Prevalence of Neural Collapse in Neural Multivariate Regression

Add code
Sep 06, 2024
Viaarxiv icon

Cross Entropy versus Label Smoothing: A Neural Collapse Perspective

Add code
Feb 07, 2024
Viaarxiv icon

Pre-training with Synthetic Data Helps Offline Reinforcement Learning

Add code
Oct 06, 2023
Viaarxiv icon

On the Convergence of Monte Carlo UCB for Random-Length Episodic MDPs

Add code
Sep 07, 2022
Figure 1 for On the Convergence of Monte Carlo UCB for Random-Length Episodic MDPs
Figure 2 for On the Convergence of Monte Carlo UCB for Random-Length Episodic MDPs
Figure 3 for On the Convergence of Monte Carlo UCB for Random-Length Episodic MDPs
Figure 4 for On the Convergence of Monte Carlo UCB for Random-Length Episodic MDPs
Viaarxiv icon