Alert button
Picture for Tengyu Ma

Tengyu Ma

Alert button

DR3: Value-Based Deep Reinforcement Learning Requires Explicit Regularization

Add code
Bookmark button
Alert button
Dec 09, 2021
Aviral Kumar, Rishabh Agarwal, Tengyu Ma, Aaron Courville, George Tucker, Sergey Levine

Figure 1 for DR3: Value-Based Deep Reinforcement Learning Requires Explicit Regularization
Figure 2 for DR3: Value-Based Deep Reinforcement Learning Requires Explicit Regularization
Figure 3 for DR3: Value-Based Deep Reinforcement Learning Requires Explicit Regularization
Figure 4 for DR3: Value-Based Deep Reinforcement Learning Requires Explicit Regularization
Viaarxiv icon

Plan Better Amid Conservatism: Offline Multi-Agent Reinforcement Learning with Actor Rectification

Add code
Bookmark button
Alert button
Nov 22, 2021
Ling Pan, Longbo Huang, Tengyu Ma, Huazhe Xu

Figure 1 for Plan Better Amid Conservatism: Offline Multi-Agent Reinforcement Learning with Actor Rectification
Figure 2 for Plan Better Amid Conservatism: Offline Multi-Agent Reinforcement Learning with Actor Rectification
Figure 3 for Plan Better Amid Conservatism: Offline Multi-Agent Reinforcement Learning with Actor Rectification
Figure 4 for Plan Better Amid Conservatism: Offline Multi-Agent Reinforcement Learning with Actor Rectification
Viaarxiv icon

An Explanation of In-context Learning as Implicit Bayesian Inference

Add code
Bookmark button
Alert button
Nov 14, 2021
Sang Michael Xie, Aditi Raghunathan, Percy Liang, Tengyu Ma

Figure 1 for An Explanation of In-context Learning as Implicit Bayesian Inference
Figure 2 for An Explanation of In-context Learning as Implicit Bayesian Inference
Figure 3 for An Explanation of In-context Learning as Implicit Bayesian Inference
Figure 4 for An Explanation of In-context Learning as Implicit Bayesian Inference
Viaarxiv icon

Sharp Bounds for Federated Averaging (Local SGD) and Continuous Perspective

Add code
Bookmark button
Alert button
Nov 05, 2021
Margalit Glasgow, Honglin Yuan, Tengyu Ma

Figure 1 for Sharp Bounds for Federated Averaging (Local SGD) and Continuous Perspective
Figure 2 for Sharp Bounds for Federated Averaging (Local SGD) and Continuous Perspective
Figure 3 for Sharp Bounds for Federated Averaging (Local SGD) and Continuous Perspective
Figure 4 for Sharp Bounds for Federated Averaging (Local SGD) and Continuous Perspective
Viaarxiv icon

Self-supervised Learning is More Robust to Dataset Imbalance

Add code
Bookmark button
Alert button
Oct 11, 2021
Hong Liu, Jeff Z. HaoChen, Adrien Gaidon, Tengyu Ma

Figure 1 for Self-supervised Learning is More Robust to Dataset Imbalance
Figure 2 for Self-supervised Learning is More Robust to Dataset Imbalance
Figure 3 for Self-supervised Learning is More Robust to Dataset Imbalance
Figure 4 for Self-supervised Learning is More Robust to Dataset Imbalance
Viaarxiv icon

On the Opportunities and Risks of Foundation Models

Add code
Bookmark button
Alert button
Aug 18, 2021
Rishi Bommasani, Drew A. Hudson, Ehsan Adeli, Russ Altman, Simran Arora, Sydney von Arx, Michael S. Bernstein, Jeannette Bohg, Antoine Bosselut, Emma Brunskill, Erik Brynjolfsson, Shyamal Buch, Dallas Card, Rodrigo Castellon, Niladri Chatterji, Annie Chen, Kathleen Creel, Jared Quincy Davis, Dora Demszky, Chris Donahue, Moussa Doumbouya, Esin Durmus, Stefano Ermon, John Etchemendy, Kawin Ethayarajh, Li Fei-Fei, Chelsea Finn, Trevor Gale, Lauren Gillespie, Karan Goel, Noah Goodman, Shelby Grossman, Neel Guha, Tatsunori Hashimoto, Peter Henderson, John Hewitt, Daniel E. Ho, Jenny Hong, Kyle Hsu, Jing Huang, Thomas Icard, Saahil Jain, Dan Jurafsky, Pratyusha Kalluri, Siddharth Karamcheti, Geoff Keeling, Fereshte Khani, Omar Khattab, Pang Wei Kohd, Mark Krass, Ranjay Krishna, Rohith Kuditipudi, Ananya Kumar, Faisal Ladhak, Mina Lee, Tony Lee, Jure Leskovec, Isabelle Levent, Xiang Lisa Li, Xuechen Li, Tengyu Ma, Ali Malik, Christopher D. Manning, Suvir Mirchandani, Eric Mitchell, Zanele Munyikwa, Suraj Nair, Avanika Narayan, Deepak Narayanan, Ben Newman, Allen Nie, Juan Carlos Niebles, Hamed Nilforoshan, Julian Nyarko, Giray Ogut, Laurel Orr, Isabel Papadimitriou, Joon Sung Park, Chris Piech, Eva Portelance, Christopher Potts, Aditi Raghunathan, Rob Reich, Hongyu Ren, Frieda Rong, Yusuf Roohani, Camilo Ruiz, Jack Ryan, Christopher Ré, Dorsa Sadigh, Shiori Sagawa, Keshav Santhanam, Andy Shih, Krishnan Srinivasan, Alex Tamkin, Rohan Taori, Armin W. Thomas, Florian Tramèr, Rose E. Wang, William Wang, Bohan Wu, Jiajun Wu, Yuhuai Wu, Sang Michael Xie, Michihiro Yasunaga, Jiaxuan You, Matei Zaharia, Michael Zhang, Tianyi Zhang, Xikun Zhang, Yuhui Zhang, Lucia Zheng, Kaitlyn Zhou, Percy Liang

Figure 1 for On the Opportunities and Risks of Foundation Models
Figure 2 for On the Opportunities and Risks of Foundation Models
Figure 3 for On the Opportunities and Risks of Foundation Models
Figure 4 for On the Opportunities and Risks of Foundation Models
Viaarxiv icon

Learning Barrier Certificates: Towards Safe Reinforcement Learning with Zero Training-time Violations

Add code
Bookmark button
Alert button
Aug 04, 2021
Yuping Luo, Tengyu Ma

Figure 1 for Learning Barrier Certificates: Towards Safe Reinforcement Learning with Zero Training-time Violations
Figure 2 for Learning Barrier Certificates: Towards Safe Reinforcement Learning with Zero Training-time Violations
Figure 3 for Learning Barrier Certificates: Towards Safe Reinforcement Learning with Zero Training-time Violations
Viaarxiv icon

Statistically Meaningful Approximation: a Case Study on Approximating Turing Machines with Transformers

Add code
Bookmark button
Alert button
Jul 28, 2021
Colin Wei, Yining Chen, Tengyu Ma

Viaarxiv icon