Picture for Bhavya Vasudeva

Bhavya Vasudeva

Understanding Contextual Recall in Transformers: How Finetuning Enables In-Context Reasoning over Pretraining Knowledge

Add code
Mar 21, 2026
Viaarxiv icon

How Muon's Spectral Design Benefits Generalization: A Study on Imbalanced Data

Add code
Oct 27, 2025
Figure 1 for How Muon's Spectral Design Benefits Generalization: A Study on Imbalanced Data
Figure 2 for How Muon's Spectral Design Benefits Generalization: A Study on Imbalanced Data
Figure 3 for How Muon's Spectral Design Benefits Generalization: A Study on Imbalanced Data
Figure 4 for How Muon's Spectral Design Benefits Generalization: A Study on Imbalanced Data
Viaarxiv icon

In-Context Occam's Razor: How Transformers Prefer Simpler Hypotheses on the Fly

Add code
Jun 24, 2025
Viaarxiv icon

The Rich and the Simple: On the Implicit Bias of Adam and SGD

Add code
May 29, 2025
Figure 1 for The Rich and the Simple: On the Implicit Bias of Adam and SGD
Figure 2 for The Rich and the Simple: On the Implicit Bias of Adam and SGD
Figure 3 for The Rich and the Simple: On the Implicit Bias of Adam and SGD
Figure 4 for The Rich and the Simple: On the Implicit Bias of Adam and SGD
Viaarxiv icon

Simplicity Bias of Transformers to Learn Low Sensitivity Functions

Add code
Mar 11, 2024
Viaarxiv icon

Implicit Bias and Fast Convergence Rates for Self-attention

Add code
Feb 08, 2024
Figure 1 for Implicit Bias and Fast Convergence Rates for Self-attention
Figure 2 for Implicit Bias and Fast Convergence Rates for Self-attention
Figure 3 for Implicit Bias and Fast Convergence Rates for Self-attention
Figure 4 for Implicit Bias and Fast Convergence Rates for Self-attention
Viaarxiv icon

Mitigating Simplicity Bias in Deep Learning for Improved OOD Generalization and Robustness

Add code
Oct 09, 2023
Figure 1 for Mitigating Simplicity Bias in Deep Learning for Improved OOD Generalization and Robustness
Figure 2 for Mitigating Simplicity Bias in Deep Learning for Improved OOD Generalization and Robustness
Figure 3 for Mitigating Simplicity Bias in Deep Learning for Improved OOD Generalization and Robustness
Figure 4 for Mitigating Simplicity Bias in Deep Learning for Improved OOD Generalization and Robustness
Viaarxiv icon

LoOp: Looking for Optimal Hard Negative Embeddings for Deep Metric Learning

Add code
Aug 20, 2021
Figure 1 for LoOp: Looking for Optimal Hard Negative Embeddings for Deep Metric Learning
Figure 2 for LoOp: Looking for Optimal Hard Negative Embeddings for Deep Metric Learning
Figure 3 for LoOp: Looking for Optimal Hard Negative Embeddings for Deep Metric Learning
Figure 4 for LoOp: Looking for Optimal Hard Negative Embeddings for Deep Metric Learning
Viaarxiv icon

Multi-Phase Locking Value: A Generalized Method for Determining Instantaneous Multi-frequency Phase Coupling

Add code
Feb 20, 2021
Figure 1 for Multi-Phase Locking Value: A Generalized Method for Determining Instantaneous Multi-frequency Phase Coupling
Figure 2 for Multi-Phase Locking Value: A Generalized Method for Determining Instantaneous Multi-frequency Phase Coupling
Figure 3 for Multi-Phase Locking Value: A Generalized Method for Determining Instantaneous Multi-frequency Phase Coupling
Figure 4 for Multi-Phase Locking Value: A Generalized Method for Determining Instantaneous Multi-frequency Phase Coupling
Viaarxiv icon

AIM 2020 Challenge on Learned Image Signal Processing Pipeline

Add code
Nov 10, 2020
Figure 1 for AIM 2020 Challenge on Learned Image Signal Processing Pipeline
Figure 2 for AIM 2020 Challenge on Learned Image Signal Processing Pipeline
Figure 3 for AIM 2020 Challenge on Learned Image Signal Processing Pipeline
Figure 4 for AIM 2020 Challenge on Learned Image Signal Processing Pipeline
Viaarxiv icon