Picture for Dhruv Madeka

Dhruv Madeka

A Study on the Calibration of In-context Learning

Add code
Dec 11, 2023
Figure 1 for A Study on the Calibration of In-context Learning
Figure 2 for A Study on the Calibration of In-context Learning
Figure 3 for A Study on the Calibration of In-context Learning
Figure 4 for A Study on the Calibration of In-context Learning
Viaarxiv icon

Learning an Inventory Control Policy with General Inventory Arrival Dynamics

Oct 26, 2023
Figure 1 for Learning an Inventory Control Policy with General Inventory Arrival Dynamics
Figure 2 for Learning an Inventory Control Policy with General Inventory Arrival Dynamics
Figure 3 for Learning an Inventory Control Policy with General Inventory Arrival Dynamics
Figure 4 for Learning an Inventory Control Policy with General Inventory Arrival Dynamics
Viaarxiv icon

Contextual Bandits for Evaluating and Improving Inventory Control Policies

Oct 24, 2023
Figure 1 for Contextual Bandits for Evaluating and Improving Inventory Control Policies
Viaarxiv icon

Scaling Laws for Imitation Learning in NetHack

Add code
Jul 18, 2023
Figure 1 for Scaling Laws for Imitation Learning in NetHack
Figure 2 for Scaling Laws for Imitation Learning in NetHack
Figure 3 for Scaling Laws for Imitation Learning in NetHack
Figure 4 for Scaling Laws for Imitation Learning in NetHack
Viaarxiv icon

Linear Reinforcement Learning with Ball Structure Action Space

Nov 14, 2022
Viaarxiv icon

Deep Inventory Management

Oct 06, 2022
Figure 1 for Deep Inventory Management
Figure 2 for Deep Inventory Management
Figure 3 for Deep Inventory Management
Figure 4 for Deep Inventory Management
Viaarxiv icon

MQRetNN: Multi-Horizon Time Series Forecasting with Retrieval Augmentation

Jul 21, 2022
Figure 1 for MQRetNN: Multi-Horizon Time Series Forecasting with Retrieval Augmentation
Figure 2 for MQRetNN: Multi-Horizon Time Series Forecasting with Retrieval Augmentation
Figure 3 for MQRetNN: Multi-Horizon Time Series Forecasting with Retrieval Augmentation
Figure 4 for MQRetNN: Multi-Horizon Time Series Forecasting with Retrieval Augmentation
Viaarxiv icon

A Few Expert Queries Suffices for Sample-Efficient RL with Resets and Linear Value Approximation

Jul 18, 2022
Figure 1 for A Few Expert Queries Suffices for Sample-Efficient RL with Resets and Linear Value Approximation
Viaarxiv icon

Assessment of Treatment Effect Estimators for Heavy-Tailed Data

Dec 19, 2021
Figure 1 for Assessment of Treatment Effect Estimators for Heavy-Tailed Data
Figure 2 for Assessment of Treatment Effect Estimators for Heavy-Tailed Data
Figure 3 for Assessment of Treatment Effect Estimators for Heavy-Tailed Data
Figure 4 for Assessment of Treatment Effect Estimators for Heavy-Tailed Data
Viaarxiv icon

MQTransformer: Multi-Horizon Forecasts with Context Dependent and Feedback-Aware Attention

Add code
Oct 07, 2020
Figure 1 for MQTransformer: Multi-Horizon Forecasts with Context Dependent and Feedback-Aware Attention
Figure 2 for MQTransformer: Multi-Horizon Forecasts with Context Dependent and Feedback-Aware Attention
Figure 3 for MQTransformer: Multi-Horizon Forecasts with Context Dependent and Feedback-Aware Attention
Figure 4 for MQTransformer: Multi-Horizon Forecasts with Context Dependent and Feedback-Aware Attention
Viaarxiv icon