Alert button
Picture for Parameswaran Raman

Parameswaran Raman

Alert button

HLAT: High-quality Large Language Model Pre-trained on AWS Trainium

Add code
Bookmark button
Alert button
Apr 16, 2024
Haozheng Fan, Hao Zhou, Guangtai Huang, Parameswaran Raman, Xinwei Fu, Gaurav Gupta, Dhananjay Ram, Yida Wang, Jun Huan

Viaarxiv icon

EMC$^2$: Efficient MCMC Negative Sampling for Contrastive Learning with Global Convergence

Add code
Bookmark button
Alert button
Apr 16, 2024
Chung-Yiu Yau, Hoi-To Wai, Parameswaran Raman, Soumajyoti Sarkar, Mingyi Hong

Viaarxiv icon

Variance-reduced Zeroth-Order Methods for Fine-Tuning Language Models

Add code
Bookmark button
Alert button
Apr 11, 2024
Tanmay Gautam, Youngsuk Park, Hao Zhou, Parameswaran Raman, Wooseok Ha

Viaarxiv icon

MADA: Meta-Adaptive Optimizers through hyper-gradient Descent

Add code
Bookmark button
Alert button
Jan 17, 2024
Kaan Ozkara, Can Karakus, Parameswaran Raman, Mingyi Hong, Shoham Sabach, Branislav Kveton, Volkan Cevher

Viaarxiv icon

Krylov Cubic Regularized Newton: A Subspace Second-Order Method with Dimension-Free Convergence Rate

Add code
Bookmark button
Alert button
Jan 05, 2024
Ruichen Jiang, Parameswaran Raman, Shoham Sabach, Aryan Mokhtari, Mingyi Hong, Volkan Cevher

Viaarxiv icon

Contractive error feedback for gradient compression

Add code
Bookmark button
Alert button
Dec 13, 2023
Bingcong Li, Shuai Zheng, Parameswaran Raman, Anshumali Shrivastava, Georgios B. Giannakis

Viaarxiv icon

DS-FACTO: Doubly Separable Factorization Machines

Add code
Bookmark button
Alert button
Apr 29, 2020
Parameswaran Raman, S. V. N. Vishwanathan

Figure 1 for DS-FACTO: Doubly Separable Factorization Machines
Figure 2 for DS-FACTO: Doubly Separable Factorization Machines
Figure 3 for DS-FACTO: Doubly Separable Factorization Machines
Figure 4 for DS-FACTO: Doubly Separable Factorization Machines
Viaarxiv icon

Optimization on the Surface of the (Hyper)-Sphere

Add code
Bookmark button
Alert button
Sep 13, 2019
Parameswaran Raman, Jiasen Yang

Figure 1 for Optimization on the Surface of the (Hyper)-Sphere
Figure 2 for Optimization on the Surface of the (Hyper)-Sphere
Figure 3 for Optimization on the Surface of the (Hyper)-Sphere
Figure 4 for Optimization on the Surface of the (Hyper)-Sphere
Viaarxiv icon

DS-MLR: Exploiting Double Separability for Scaling up Distributed Multinomial Logistic Regression

Add code
Bookmark button
Alert button
Aug 03, 2018
Parameswaran Raman, Sriram Srinivasan, Shin Matsushima, Xinhua Zhang, Hyokun Yun, S. V. N. Vishwanathan

Figure 1 for DS-MLR: Exploiting Double Separability for Scaling up Distributed Multinomial Logistic Regression
Figure 2 for DS-MLR: Exploiting Double Separability for Scaling up Distributed Multinomial Logistic Regression
Figure 3 for DS-MLR: Exploiting Double Separability for Scaling up Distributed Multinomial Logistic Regression
Figure 4 for DS-MLR: Exploiting Double Separability for Scaling up Distributed Multinomial Logistic Regression
Viaarxiv icon

Extreme Stochastic Variational Inference: Distributed and Asynchronous

Add code
Bookmark button
Alert button
Aug 03, 2018
Jiong Zhang, Parameswaran Raman, Shihao Ji, Hsiang-Fu Yu, S. V. N. Vishwanathan, Inderjit S. Dhillon

Figure 1 for Extreme Stochastic Variational Inference: Distributed and Asynchronous
Figure 2 for Extreme Stochastic Variational Inference: Distributed and Asynchronous
Figure 3 for Extreme Stochastic Variational Inference: Distributed and Asynchronous
Figure 4 for Extreme Stochastic Variational Inference: Distributed and Asynchronous
Viaarxiv icon