Picture for Mladen Kolar

Mladen Kolar

SMART: A Spectral Transfer Approach to Multi-Task Learning

Add code
Apr 22, 2026
Viaarxiv icon

Adaptive Batch Size Schedules for Distributed Training of Language Models with Data and Model Parallelism

Add code
Dec 30, 2024
Figure 1 for Adaptive Batch Size Schedules for Distributed Training of Language Models with Data and Model Parallelism
Figure 2 for Adaptive Batch Size Schedules for Distributed Training of Language Models with Data and Model Parallelism
Figure 3 for Adaptive Batch Size Schedules for Distributed Training of Language Models with Data and Model Parallelism
Figure 4 for Adaptive Batch Size Schedules for Distributed Training of Language Models with Data and Model Parallelism
Viaarxiv icon

High-Dimensional Markov-switching Ordinary Differential Processes

Add code
Dec 30, 2024
Viaarxiv icon

Trans-Glasso: A Transfer Learning Approach to Precision Matrix Estimation

Add code
Nov 23, 2024
Viaarxiv icon

High-Dimensional Differential Parameter Inference in Exponential Family using Time Score Matching

Add code
Oct 14, 2024
Figure 1 for High-Dimensional Differential Parameter Inference in Exponential Family using Time Score Matching
Figure 2 for High-Dimensional Differential Parameter Inference in Exponential Family using Time Score Matching
Figure 3 for High-Dimensional Differential Parameter Inference in Exponential Family using Time Score Matching
Figure 4 for High-Dimensional Differential Parameter Inference in Exponential Family using Time Score Matching
Viaarxiv icon

Pessimism Meets Risk: Risk-Sensitive Offline Reinforcement Learning

Add code
Jul 10, 2024
Figure 1 for Pessimism Meets Risk: Risk-Sensitive Offline Reinforcement Learning
Viaarxiv icon

Communication-Efficient Adaptive Batch Size Strategies for Distributed Local Gradient Methods

Add code
Jun 20, 2024
Figure 1 for Communication-Efficient Adaptive Batch Size Strategies for Distributed Local Gradient Methods
Figure 2 for Communication-Efficient Adaptive Batch Size Strategies for Distributed Local Gradient Methods
Figure 3 for Communication-Efficient Adaptive Batch Size Strategies for Distributed Local Gradient Methods
Figure 4 for Communication-Efficient Adaptive Batch Size Strategies for Distributed Local Gradient Methods
Viaarxiv icon

Personalized Binomial DAGs Learning with Network Structured Covariates

Add code
Jun 10, 2024
Figure 1 for Personalized Binomial DAGs Learning with Network Structured Covariates
Figure 2 for Personalized Binomial DAGs Learning with Network Structured Covariates
Figure 3 for Personalized Binomial DAGs Learning with Network Structured Covariates
Figure 4 for Personalized Binomial DAGs Learning with Network Structured Covariates
Viaarxiv icon

AdAdaGrad: Adaptive Batch Size Schemes for Adaptive Gradient Methods

Add code
Feb 17, 2024
Viaarxiv icon

Inconsistency of cross-validation for structure learning in Gaussian graphical models

Add code
Dec 28, 2023
Viaarxiv icon