Picture for Tim Tsz-Kit Lau

Tim Tsz-Kit Lau

PolarGrad: A Class of Matrix-Gradient Optimizers from a Unifying Preconditioning Perspective

Add code
May 27, 2025
Viaarxiv icon

Adaptive Batch Size Schedules for Distributed Training of Language Models with Data and Model Parallelism

Add code
Dec 30, 2024
Viaarxiv icon

Communication-Efficient Adaptive Batch Size Strategies for Distributed Local Gradient Methods

Add code
Jun 20, 2024
Figure 1 for Communication-Efficient Adaptive Batch Size Strategies for Distributed Local Gradient Methods
Figure 2 for Communication-Efficient Adaptive Batch Size Strategies for Distributed Local Gradient Methods
Figure 3 for Communication-Efficient Adaptive Batch Size Strategies for Distributed Local Gradient Methods
Figure 4 for Communication-Efficient Adaptive Batch Size Strategies for Distributed Local Gradient Methods
Viaarxiv icon

AdAdaGrad: Adaptive Batch Size Schemes for Adaptive Gradient Methods

Add code
Feb 17, 2024
Viaarxiv icon

Non-Log-Concave and Nonsmooth Sampling via Langevin Monte Carlo Algorithms

Add code
May 25, 2023
Viaarxiv icon

Bregman Proximal Langevin Monte Carlo via Bregman--Moreau Envelopes

Add code
Jul 10, 2022
Figure 1 for Bregman Proximal Langevin Monte Carlo via Bregman--Moreau Envelopes
Figure 2 for Bregman Proximal Langevin Monte Carlo via Bregman--Moreau Envelopes
Figure 3 for Bregman Proximal Langevin Monte Carlo via Bregman--Moreau Envelopes
Figure 4 for Bregman Proximal Langevin Monte Carlo via Bregman--Moreau Envelopes
Viaarxiv icon

Wasserstein Distributionally Robust Optimization via Wasserstein Barycenters

Add code
Mar 23, 2022
Figure 1 for Wasserstein Distributionally Robust Optimization via Wasserstein Barycenters
Figure 2 for Wasserstein Distributionally Robust Optimization via Wasserstein Barycenters
Viaarxiv icon

The Multi-Agent Pickup and Delivery Problem: MAPF, MARL and Its Warehouse Applications

Add code
Mar 14, 2022
Figure 1 for The Multi-Agent Pickup and Delivery Problem: MAPF, MARL and Its Warehouse Applications
Figure 2 for The Multi-Agent Pickup and Delivery Problem: MAPF, MARL and Its Warehouse Applications
Figure 3 for The Multi-Agent Pickup and Delivery Problem: MAPF, MARL and Its Warehouse Applications
Viaarxiv icon

Global Convergence in Deep Learning with Variable Splitting via the Kurdyka-Łojasiewicz Property

Add code
Jun 11, 2018
Figure 1 for Global Convergence in Deep Learning with Variable Splitting via the Kurdyka-Łojasiewicz Property
Viaarxiv icon

A Proximal Block Coordinate Descent Algorithm for Deep Neural Network Training

Add code
Mar 24, 2018
Figure 1 for A Proximal Block Coordinate Descent Algorithm for Deep Neural Network Training
Figure 2 for A Proximal Block Coordinate Descent Algorithm for Deep Neural Network Training
Viaarxiv icon