Picture for Sridhar Thiagarajan

Sridhar Thiagarajan

Inference-Aware Fine-Tuning for Best-of-N Sampling in Large Language Models

Add code
Dec 18, 2024
Figure 1 for Inference-Aware Fine-Tuning for Best-of-N Sampling in Large Language Models
Figure 2 for Inference-Aware Fine-Tuning for Best-of-N Sampling in Large Language Models
Figure 3 for Inference-Aware Fine-Tuning for Best-of-N Sampling in Large Language Models
Figure 4 for Inference-Aware Fine-Tuning for Best-of-N Sampling in Large Language Models
Viaarxiv icon

Finetuning Language Models to Emit Linguistic Expressions of Uncertainty

Add code
Sep 18, 2024
Figure 1 for Finetuning Language Models to Emit Linguistic Expressions of Uncertainty
Figure 2 for Finetuning Language Models to Emit Linguistic Expressions of Uncertainty
Figure 3 for Finetuning Language Models to Emit Linguistic Expressions of Uncertainty
Figure 4 for Finetuning Language Models to Emit Linguistic Expressions of Uncertainty
Viaarxiv icon

Sample Efficient Deep Reinforcement Learning via Local Planning

Add code
Jan 29, 2023
Figure 1 for Sample Efficient Deep Reinforcement Learning via Local Planning
Figure 2 for Sample Efficient Deep Reinforcement Learning via Local Planning
Figure 3 for Sample Efficient Deep Reinforcement Learning via Local Planning
Figure 4 for Sample Efficient Deep Reinforcement Learning via Local Planning
Viaarxiv icon