Picture for Alexander J. Smola

Alexander J. Smola

Trust the Batch, On- or Off-Policy: Adaptive Policy Optimization for RL Post-Training

Add code
May 12, 2026
Viaarxiv icon

Data drift correction via time-varying importance weight estimator

Add code
Oct 04, 2022
Figure 1 for Data drift correction via time-varying importance weight estimator
Figure 2 for Data drift correction via time-varying importance weight estimator
Figure 3 for Data drift correction via time-varying importance weight estimator
Figure 4 for Data drift correction via time-varying importance weight estimator
Viaarxiv icon

Deep Q-Network with Proximal Iteration

Add code
Dec 10, 2021
Figure 1 for Deep Q-Network with Proximal Iteration
Figure 2 for Deep Q-Network with Proximal Iteration
Figure 3 for Deep Q-Network with Proximal Iteration
Figure 4 for Deep Q-Network with Proximal Iteration
Viaarxiv icon

Benchmarking Multimodal AutoML for Tabular Data with Text Fields

Add code
Nov 04, 2021
Figure 1 for Benchmarking Multimodal AutoML for Tabular Data with Text Fields
Figure 2 for Benchmarking Multimodal AutoML for Tabular Data with Text Fields
Figure 3 for Benchmarking Multimodal AutoML for Tabular Data with Text Fields
Figure 4 for Benchmarking Multimodal AutoML for Tabular Data with Text Fields
Viaarxiv icon

Deep Explicit Duration Switching Models for Time Series

Add code
Oct 26, 2021
Figure 1 for Deep Explicit Duration Switching Models for Time Series
Figure 2 for Deep Explicit Duration Switching Models for Time Series
Figure 3 for Deep Explicit Duration Switching Models for Time Series
Figure 4 for Deep Explicit Duration Switching Models for Time Series
Viaarxiv icon

Dive into Deep Learning

Add code
Jun 21, 2021
Figure 1 for Dive into Deep Learning
Viaarxiv icon

Deep Quantile Aggregation

Add code
Mar 16, 2021
Figure 1 for Deep Quantile Aggregation
Figure 2 for Deep Quantile Aggregation
Figure 3 for Deep Quantile Aggregation
Figure 4 for Deep Quantile Aggregation
Viaarxiv icon

Continuous Doubly Constrained Batch Reinforcement Learning

Add code
Feb 23, 2021
Figure 1 for Continuous Doubly Constrained Batch Reinforcement Learning
Figure 2 for Continuous Doubly Constrained Batch Reinforcement Learning
Figure 3 for Continuous Doubly Constrained Batch Reinforcement Learning
Figure 4 for Continuous Doubly Constrained Batch Reinforcement Learning
Viaarxiv icon

DDPG++: Striving for Simplicity in Continuous-control Off-Policy Reinforcement Learning

Add code
Jun 26, 2020
Figure 1 for DDPG++: Striving for Simplicity in Continuous-control Off-Policy Reinforcement Learning
Figure 2 for DDPG++: Striving for Simplicity in Continuous-control Off-Policy Reinforcement Learning
Figure 3 for DDPG++: Striving for Simplicity in Continuous-control Off-Policy Reinforcement Learning
Figure 4 for DDPG++: Striving for Simplicity in Continuous-control Off-Policy Reinforcement Learning
Viaarxiv icon

Fast, Accurate, and Simple Models for Tabular Data via Augmented Distillation

Add code
Jun 25, 2020
Figure 1 for Fast, Accurate, and Simple Models for Tabular Data via Augmented Distillation
Figure 2 for Fast, Accurate, and Simple Models for Tabular Data via Augmented Distillation
Figure 3 for Fast, Accurate, and Simple Models for Tabular Data via Augmented Distillation
Figure 4 for Fast, Accurate, and Simple Models for Tabular Data via Augmented Distillation
Viaarxiv icon