Alert button
Picture for Jason D. Lee

Jason D. Lee

Alert button

Dataset Reset Policy Optimization for RLHF

Add code
Bookmark button
Alert button
Apr 16, 2024
Jonathan D. Chang, Wenhao Zhan, Owen Oertell, Kianté Brantley, Dipendra Misra, Jason D. Lee, Wen Sun

Viaarxiv icon

Horizon-Free Regret for Linear Markov Decision Processes

Add code
Bookmark button
Alert button
Mar 15, 2024
Zihan Zhang, Jason D. Lee, Yuxin Chen, Simon S. Du

Viaarxiv icon

Computational-Statistical Gaps in Gaussian Single-Index Models

Add code
Bookmark button
Alert button
Mar 12, 2024
Alex Damian, Loucas Pillaud-Vivien, Jason D. Lee, Joan Bruna

Figure 1 for Computational-Statistical Gaps in Gaussian Single-Index Models
Figure 2 for Computational-Statistical Gaps in Gaussian Single-Index Models
Figure 3 for Computational-Statistical Gaps in Gaussian Single-Index Models
Figure 4 for Computational-Statistical Gaps in Gaussian Single-Index Models
Viaarxiv icon

The Computational Complexity of Learning Gaussian Single-Index Models

Add code
Bookmark button
Alert button
Mar 08, 2024
Alex Damian, Loucas Pillaud-Vivien, Jason D. Lee, Joan Bruna

Figure 1 for The Computational Complexity of Learning Gaussian Single-Index Models
Figure 2 for The Computational Complexity of Learning Gaussian Single-Index Models
Figure 3 for The Computational Complexity of Learning Gaussian Single-Index Models
Figure 4 for The Computational Complexity of Learning Gaussian Single-Index Models
Viaarxiv icon

How Well Can Transformers Emulate In-context Newton's Method?

Add code
Bookmark button
Alert button
Mar 05, 2024
Angeliki Giannou, Liu Yang, Tianhao Wang, Dimitris Papailiopoulos, Jason D. Lee

Figure 1 for How Well Can Transformers Emulate In-context Newton's Method?
Figure 2 for How Well Can Transformers Emulate In-context Newton's Method?
Figure 3 for How Well Can Transformers Emulate In-context Newton's Method?
Figure 4 for How Well Can Transformers Emulate In-context Newton's Method?
Viaarxiv icon

BitDelta: Your Fine-Tune May Only Be Worth One Bit

Add code
Bookmark button
Alert button
Feb 28, 2024
James Liu, Guangxuan Xiao, Kai Li, Jason D. Lee, Song Han, Tri Dao, Tianle Cai

Viaarxiv icon

Revisiting Zeroth-Order Optimization for Memory-Efficient LLM Fine-Tuning: A Benchmark

Add code
Bookmark button
Alert button
Feb 26, 2024
Yihua Zhang, Pingzhi Li, Junyuan Hong, Jiaxiang Li, Yimeng Zhang, Wenqing Zheng, Pin-Yu Chen, Jason D. Lee, Wotao Yin, Mingyi Hong, Zhangyang Wang, Sijia Liu, Tianlong Chen

Viaarxiv icon

How Transformers Learn Causal Structure with Gradient Descent

Add code
Bookmark button
Alert button
Feb 22, 2024
Eshaan Nichani, Alex Damian, Jason D. Lee

Viaarxiv icon

LoRA Training in the NTK Regime has No Spurious Local Minima

Add code
Bookmark button
Alert button
Feb 19, 2024
Uijeong Jang, Jason D. Lee, Ernest K. Ryu

Viaarxiv icon