Picture for Thalaiyasingam Ajanthan

Thalaiyasingam Ajanthan

NuMuon: Nuclear-Norm-Constrained Muon for Compressible LLM Training

Add code
Mar 04, 2026
Viaarxiv icon

SENTINEL: Stagewise Integrity Verification for Pipeline Parallel Decentralized Training

Add code
Mar 03, 2026
Viaarxiv icon

AsyncMesh: Fully Asynchronous Optimization for Data and Pipeline Parallelism

Add code
Jan 30, 2026
Viaarxiv icon

Sharper Convergence Rates for Nonconvex Optimisation via Reduction Mappings

Add code
Jun 10, 2025
Figure 1 for Sharper Convergence Rates for Nonconvex Optimisation via Reduction Mappings
Figure 2 for Sharper Convergence Rates for Nonconvex Optimisation via Reduction Mappings
Figure 3 for Sharper Convergence Rates for Nonconvex Optimisation via Reduction Mappings
Figure 4 for Sharper Convergence Rates for Nonconvex Optimisation via Reduction Mappings
Viaarxiv icon

Nesterov Method for Asynchronous Pipeline Parallel Optimization

Add code
May 02, 2025
Viaarxiv icon

Learning Visual Hierarchies with Hyperbolic Embeddings

Add code
Nov 26, 2024
Viaarxiv icon

Guiding Neural Collapse: Optimising Towards the Nearest Simplex Equiangular Tight Frame

Add code
Nov 02, 2024
Figure 1 for Guiding Neural Collapse: Optimising Towards the Nearest Simplex Equiangular Tight Frame
Figure 2 for Guiding Neural Collapse: Optimising Towards the Nearest Simplex Equiangular Tight Frame
Figure 3 for Guiding Neural Collapse: Optimising Towards the Nearest Simplex Equiangular Tight Frame
Figure 4 for Guiding Neural Collapse: Optimising Towards the Nearest Simplex Equiangular Tight Frame
Viaarxiv icon

Self-Supervision Improves Diffusion Models for Tabular Data Imputation

Add code
Jul 25, 2024
Figure 1 for Self-Supervision Improves Diffusion Models for Tabular Data Imputation
Figure 2 for Self-Supervision Improves Diffusion Models for Tabular Data Imputation
Figure 3 for Self-Supervision Improves Diffusion Models for Tabular Data Imputation
Figure 4 for Self-Supervision Improves Diffusion Models for Tabular Data Imputation
Viaarxiv icon

Adaptive Cross Batch Normalization for Metric Learning

Add code
Mar 30, 2023
Figure 1 for Adaptive Cross Batch Normalization for Metric Learning
Figure 2 for Adaptive Cross Batch Normalization for Metric Learning
Figure 3 for Adaptive Cross Batch Normalization for Metric Learning
Figure 4 for Adaptive Cross Batch Normalization for Metric Learning
Viaarxiv icon

Understanding and Improving the Role of Projection Head in Self-Supervised Learning

Add code
Dec 22, 2022
Figure 1 for Understanding and Improving the Role of Projection Head in Self-Supervised Learning
Figure 2 for Understanding and Improving the Role of Projection Head in Self-Supervised Learning
Figure 3 for Understanding and Improving the Role of Projection Head in Self-Supervised Learning
Figure 4 for Understanding and Improving the Role of Projection Head in Self-Supervised Learning
Viaarxiv icon