Picture for Michael W. Mahoney

Michael W. Mahoney

UC Berkeley/LBNL/ICSI

AlphaPruning: Using Heavy-Tailed Self Regularization Theory for Improved Layer-wise Pruning of Large Language Models

Add code
Oct 14, 2024
Figure 1 for AlphaPruning: Using Heavy-Tailed Self Regularization Theory for Improved Layer-wise Pruning of Large Language Models
Figure 2 for AlphaPruning: Using Heavy-Tailed Self Regularization Theory for Improved Layer-wise Pruning of Large Language Models
Figure 3 for AlphaPruning: Using Heavy-Tailed Self Regularization Theory for Improved Layer-wise Pruning of Large Language Models
Figure 4 for AlphaPruning: Using Heavy-Tailed Self Regularization Theory for Improved Layer-wise Pruning of Large Language Models
Viaarxiv icon

Elucidating the Design Choice of Probability Paths in Flow Matching for Forecasting

Add code
Oct 04, 2024
Viaarxiv icon

Mitigating Memorization In Language Models

Add code
Oct 03, 2024
Figure 1 for Mitigating Memorization In Language Models
Figure 2 for Mitigating Memorization In Language Models
Figure 3 for Mitigating Memorization In Language Models
Figure 4 for Mitigating Memorization In Language Models
Viaarxiv icon

Tuning Frequency Bias of State Space Models

Add code
Oct 02, 2024
Figure 1 for Tuning Frequency Bias of State Space Models
Figure 2 for Tuning Frequency Bias of State Space Models
Figure 3 for Tuning Frequency Bias of State Space Models
Figure 4 for Tuning Frequency Bias of State Space Models
Viaarxiv icon

Learning Physics for Unveiling Hidden Earthquake Ground Motions via Conditional Generative Modeling

Add code
Jul 21, 2024
Figure 1 for Learning Physics for Unveiling Hidden Earthquake Ground Motions via Conditional Generative Modeling
Figure 2 for Learning Physics for Unveiling Hidden Earthquake Ground Motions via Conditional Generative Modeling
Figure 3 for Learning Physics for Unveiling Hidden Earthquake Ground Motions via Conditional Generative Modeling
Figure 4 for Learning Physics for Unveiling Hidden Earthquake Ground Motions via Conditional Generative Modeling
Viaarxiv icon

Comparing and Contrasting Deep Learning Weather Prediction Backbones on Navier-Stokes and Atmospheric Dynamics

Add code
Jul 19, 2024
Figure 1 for Comparing and Contrasting Deep Learning Weather Prediction Backbones on Navier-Stokes and Atmospheric Dynamics
Figure 2 for Comparing and Contrasting Deep Learning Weather Prediction Backbones on Navier-Stokes and Atmospheric Dynamics
Figure 3 for Comparing and Contrasting Deep Learning Weather Prediction Backbones on Navier-Stokes and Atmospheric Dynamics
Figure 4 for Comparing and Contrasting Deep Learning Weather Prediction Backbones on Navier-Stokes and Atmospheric Dynamics
Viaarxiv icon

Sharpness-diversity tradeoff: improving flat ensembles with SharpBalance

Add code
Jul 17, 2024
Figure 1 for Sharpness-diversity tradeoff: improving flat ensembles with SharpBalance
Figure 2 for Sharpness-diversity tradeoff: improving flat ensembles with SharpBalance
Figure 3 for Sharpness-diversity tradeoff: improving flat ensembles with SharpBalance
Figure 4 for Sharpness-diversity tradeoff: improving flat ensembles with SharpBalance
Viaarxiv icon

Reliable edge machine learning hardware for scientific applications

Add code
Jun 27, 2024
Figure 1 for Reliable edge machine learning hardware for scientific applications
Figure 2 for Reliable edge machine learning hardware for scientific applications
Figure 3 for Reliable edge machine learning hardware for scientific applications
Figure 4 for Reliable edge machine learning hardware for scientific applications
Viaarxiv icon

Recent and Upcoming Developments in Randomized Numerical Linear Algebra for Machine Learning

Add code
Jun 17, 2024
Figure 1 for Recent and Upcoming Developments in Randomized Numerical Linear Algebra for Machine Learning
Figure 2 for Recent and Upcoming Developments in Randomized Numerical Linear Algebra for Machine Learning
Figure 3 for Recent and Upcoming Developments in Randomized Numerical Linear Algebra for Machine Learning
Viaarxiv icon

Towards Scalable and Versatile Weight Space Learning

Add code
Jun 14, 2024
Viaarxiv icon