Picture for Meisam Razaviyayn

Meisam Razaviyayn

It's All Connected: A Journey Through Test-Time Memorization, Attentional Bias, Retention, and Online Optimization

Add code
Apr 17, 2025
Viaarxiv icon

Synthetic Text Generation for Training Large Language Models via Gradient Matching

Add code
Feb 24, 2025
Viaarxiv icon

PiKE: Adaptive Data Mixing for Multi-Task Learning Under Low Gradient Conflicts

Add code
Feb 10, 2025
Viaarxiv icon

Stochastic Control for Fine-tuning Diffusion Models: Optimality, Regularity, and Convergence

Add code
Dec 24, 2024
Viaarxiv icon

A Stochastic Optimization Framework for Private and Fair Learning From Decentralized Data

Add code
Nov 12, 2024
Figure 1 for A Stochastic Optimization Framework for Private and Fair Learning From Decentralized Data
Figure 2 for A Stochastic Optimization Framework for Private and Fair Learning From Decentralized Data
Figure 3 for A Stochastic Optimization Framework for Private and Fair Learning From Decentralized Data
Figure 4 for A Stochastic Optimization Framework for Private and Fair Learning From Decentralized Data
Viaarxiv icon

Addax: Utilizing Zeroth-Order Gradients to Improve Memory Efficiency and Performance of SGD for Fine-Tuning Language Models

Add code
Oct 09, 2024
Figure 1 for Addax: Utilizing Zeroth-Order Gradients to Improve Memory Efficiency and Performance of SGD for Fine-Tuning Language Models
Figure 2 for Addax: Utilizing Zeroth-Order Gradients to Improve Memory Efficiency and Performance of SGD for Fine-Tuning Language Models
Figure 3 for Addax: Utilizing Zeroth-Order Gradients to Improve Memory Efficiency and Performance of SGD for Fine-Tuning Language Models
Figure 4 for Addax: Utilizing Zeroth-Order Gradients to Improve Memory Efficiency and Performance of SGD for Fine-Tuning Language Models
Viaarxiv icon

DiSK: Differentially Private Optimizer with Simplified Kalman Filter for Noise Reduction

Add code
Oct 04, 2024
Figure 1 for DiSK: Differentially Private Optimizer with Simplified Kalman Filter for Noise Reduction
Figure 2 for DiSK: Differentially Private Optimizer with Simplified Kalman Filter for Noise Reduction
Figure 3 for DiSK: Differentially Private Optimizer with Simplified Kalman Filter for Noise Reduction
Figure 4 for DiSK: Differentially Private Optimizer with Simplified Kalman Filter for Noise Reduction
Viaarxiv icon

Adaptively Private Next-Token Prediction of Large Language Models

Add code
Oct 02, 2024
Viaarxiv icon

DOPPLER: Differentially Private Optimizers with Low-pass Filter for Privacy Noise Reduction

Add code
Aug 24, 2024
Viaarxiv icon

Differentially Private Next-Token Prediction of Large Language Models

Add code
Apr 01, 2024
Figure 1 for Differentially Private Next-Token Prediction of Large Language Models
Figure 2 for Differentially Private Next-Token Prediction of Large Language Models
Figure 3 for Differentially Private Next-Token Prediction of Large Language Models
Figure 4 for Differentially Private Next-Token Prediction of Large Language Models
Viaarxiv icon