Alert button
Picture for Jason D. Lee

Jason D. Lee

Alert button

Horizon-Free Regret for Linear Markov Decision Processes

Mar 15, 2024
Zihan Zhang, Jason D. Lee, Yuxin Chen, Simon S. Du

Viaarxiv icon

Computational-Statistical Gaps in Gaussian Single-Index Models

Mar 12, 2024
Alex Damian, Loucas Pillaud-Vivien, Jason D. Lee, Joan Bruna

Figure 1 for Computational-Statistical Gaps in Gaussian Single-Index Models
Figure 2 for Computational-Statistical Gaps in Gaussian Single-Index Models
Figure 3 for Computational-Statistical Gaps in Gaussian Single-Index Models
Figure 4 for Computational-Statistical Gaps in Gaussian Single-Index Models
Viaarxiv icon

The Computational Complexity of Learning Gaussian Single-Index Models

Mar 08, 2024
Alex Damian, Loucas Pillaud-Vivien, Jason D. Lee, Joan Bruna

Figure 1 for The Computational Complexity of Learning Gaussian Single-Index Models
Figure 2 for The Computational Complexity of Learning Gaussian Single-Index Models
Figure 3 for The Computational Complexity of Learning Gaussian Single-Index Models
Figure 4 for The Computational Complexity of Learning Gaussian Single-Index Models
Viaarxiv icon

How Well Can Transformers Emulate In-context Newton's Method?

Mar 05, 2024
Angeliki Giannou, Liu Yang, Tianhao Wang, Dimitris Papailiopoulos, Jason D. Lee

Figure 1 for How Well Can Transformers Emulate In-context Newton's Method?
Figure 2 for How Well Can Transformers Emulate In-context Newton's Method?
Figure 3 for How Well Can Transformers Emulate In-context Newton's Method?
Figure 4 for How Well Can Transformers Emulate In-context Newton's Method?
Viaarxiv icon

BitDelta: Your Fine-Tune May Only Be Worth One Bit

Feb 28, 2024
James Liu, Guangxuan Xiao, Kai Li, Jason D. Lee, Song Han, Tri Dao, Tianle Cai

Viaarxiv icon

Revisiting Zeroth-Order Optimization for Memory-Efficient LLM Fine-Tuning: A Benchmark

Feb 26, 2024
Yihua Zhang, Pingzhi Li, Junyuan Hong, Jiaxiang Li, Yimeng Zhang, Wenqing Zheng, Pin-Yu Chen, Jason D. Lee, Wotao Yin, Mingyi Hong, Zhangyang Wang, Sijia Liu, Tianlong Chen

Viaarxiv icon

How Transformers Learn Causal Structure with Gradient Descent

Feb 22, 2024
Eshaan Nichani, Alex Damian, Jason D. Lee

Viaarxiv icon

LoRA Training in the NTK Regime has No Spurious Local Minima

Feb 19, 2024
Uijeong Jang, Jason D. Lee, Ernest K. Ryu

Viaarxiv icon

An Information-Theoretic Analysis of In-Context Learning

Jan 28, 2024
Hong Jun Jeon, Jason D. Lee, Qi Lei, Benjamin Van Roy

Viaarxiv icon