Alert button
Picture for Rong Ge

Rong Ge

Alert button

Do Transformers Parse while Predicting the Masked Word?

Add code
Bookmark button
Alert button
Mar 14, 2023
Haoyu Zhao, Abhishek Panigrahi, Rong Ge, Sanjeev Arora

Figure 1 for Do Transformers Parse while Predicting the Masked Word?
Figure 2 for Do Transformers Parse while Predicting the Masked Word?
Figure 3 for Do Transformers Parse while Predicting the Masked Word?
Figure 4 for Do Transformers Parse while Predicting the Masked Word?
Viaarxiv icon

Hiding Data Helps: On the Benefits of Masking for Sparse Coding

Add code
Bookmark button
Alert button
Feb 24, 2023
Muthu Chidambaram, Chenwei Wu, Yu Cheng, Rong Ge

Figure 1 for Hiding Data Helps: On the Benefits of Masking for Sparse Coding
Figure 2 for Hiding Data Helps: On the Benefits of Masking for Sparse Coding
Figure 3 for Hiding Data Helps: On the Benefits of Masking for Sparse Coding
Viaarxiv icon

Implicit Regularization Leads to Benign Overfitting for Sparse Linear Regression

Add code
Bookmark button
Alert button
Feb 01, 2023
Mo Zhou, Rong Ge

Figure 1 for Implicit Regularization Leads to Benign Overfitting for Sparse Linear Regression
Figure 2 for Implicit Regularization Leads to Benign Overfitting for Sparse Linear Regression
Viaarxiv icon

Provably Learning Diverse Features in Multi-View Data with Midpoint Mixup

Add code
Bookmark button
Alert button
Oct 24, 2022
Muthu Chidambaram, Xiang Wang, Chenwei Wu, Rong Ge

Figure 1 for Provably Learning Diverse Features in Multi-View Data with Midpoint Mixup
Figure 2 for Provably Learning Diverse Features in Multi-View Data with Midpoint Mixup
Figure 3 for Provably Learning Diverse Features in Multi-View Data with Midpoint Mixup
Figure 4 for Provably Learning Diverse Features in Multi-View Data with Midpoint Mixup
Viaarxiv icon

Understanding Edge-of-Stability Training Dynamics with a Minimalist Example

Add code
Bookmark button
Alert button
Oct 07, 2022
Xingyu Zhu, Zixuan Wang, Xiang Wang, Mo Zhou, Rong Ge

Figure 1 for Understanding Edge-of-Stability Training Dynamics with a Minimalist Example
Figure 2 for Understanding Edge-of-Stability Training Dynamics with a Minimalist Example
Figure 3 for Understanding Edge-of-Stability Training Dynamics with a Minimalist Example
Figure 4 for Understanding Edge-of-Stability Training Dynamics with a Minimalist Example
Viaarxiv icon

Plateau in Monotonic Linear Interpolation -- A "Biased" View of Loss Landscape for Deep Networks

Add code
Bookmark button
Alert button
Oct 03, 2022
Xiang Wang, Annie N. Wang, Mo Zhou, Rong Ge

Figure 1 for Plateau in Monotonic Linear Interpolation -- A "Biased" View of Loss Landscape for Deep Networks
Figure 2 for Plateau in Monotonic Linear Interpolation -- A "Biased" View of Loss Landscape for Deep Networks
Figure 3 for Plateau in Monotonic Linear Interpolation -- A "Biased" View of Loss Landscape for Deep Networks
Figure 4 for Plateau in Monotonic Linear Interpolation -- A "Biased" View of Loss Landscape for Deep Networks
Viaarxiv icon

A Regression Approach to Learning-Augmented Online Algorithms

Add code
Bookmark button
Alert button
May 25, 2022
Keerti Anand, Rong Ge, Amit Kumar, Debmalya Panigrahi

Figure 1 for A Regression Approach to Learning-Augmented Online Algorithms
Viaarxiv icon

Customizing ML Predictions for Online Algorithms

Add code
Bookmark button
Alert button
May 18, 2022
Keerti Anand, Rong Ge, Debmalya Panigrahi

Figure 1 for Customizing ML Predictions for Online Algorithms
Figure 2 for Customizing ML Predictions for Online Algorithms
Figure 3 for Customizing ML Predictions for Online Algorithms
Viaarxiv icon

Online Algorithms with Multiple Predictions

Add code
Bookmark button
Alert button
May 08, 2022
Keerti Anand, Rong Ge, Amit Kumar, Debmalya Panigrahi

Viaarxiv icon