Alert button
Picture for Yuandong Tian

Yuandong Tian

Alert button

GaLore: Memory-Efficient LLM Training by Gradient Low-Rank Projection

Add code
Bookmark button
Alert button
Mar 06, 2024
Jiawei Zhao, Zhenyu Zhang, Beidi Chen, Zhangyang Wang, Anima Anandkumar, Yuandong Tian

Figure 1 for GaLore: Memory-Efficient LLM Training by Gradient Low-Rank Projection
Figure 2 for GaLore: Memory-Efficient LLM Training by Gradient Low-Rank Projection
Figure 3 for GaLore: Memory-Efficient LLM Training by Gradient Low-Rank Projection
Figure 4 for GaLore: Memory-Efficient LLM Training by Gradient Low-Rank Projection
Viaarxiv icon

MobileLLM: Optimizing Sub-billion Parameter Language Models for On-Device Use Cases

Add code
Bookmark button
Alert button
Feb 22, 2024
Zechun Liu, Changsheng Zhao, Forrest Iandola, Chen Lai, Yuandong Tian, Igor Fedorov, Yunyang Xiong, Ernie Chang, Yangyang Shi, Raghuraman Krishnamoorthi, Liangzhen Lai, Vikas Chandra

Viaarxiv icon

Beyond A*: Better Planning with Transformers via Search Dynamics Bootstrapping

Add code
Bookmark button
Alert button
Feb 21, 2024
Lucas Lehnert, Sainbayar Sukhbaatar, Paul Mcvay, Michael Rabbat, Yuandong Tian

Viaarxiv icon

Diffusion World Model

Add code
Bookmark button
Alert button
Feb 11, 2024
Zihan Ding, Amy Zhang, Yuandong Tian, Qinqing Zheng

Viaarxiv icon

TravelPlanner: A Benchmark for Real-World Planning with Language Agents

Add code
Bookmark button
Alert button
Feb 05, 2024
Jian Xie, Kai Zhang, Jiangjie Chen, Tinghui Zhu, Renze Lou, Yuandong Tian, Yanghua Xiao, Yu Su

Viaarxiv icon

H-GAP: Humanoid Control with a Generalist Planner

Add code
Bookmark button
Alert button
Dec 05, 2023
Zhengyao Jiang, Yingchen Xu, Nolan Wagener, Yicheng Luo, Michael Janner, Edward Grefenstette, Tim Rocktäschel, Yuandong Tian

Viaarxiv icon

Deja Vu: Contextual Sparsity for Efficient LLMs at Inference Time

Add code
Bookmark button
Alert button
Oct 26, 2023
Zichang Liu, Jue Wang, Tri Dao, Tianyi Zhou, Binhang Yuan, Zhao Song, Anshumali Shrivastava, Ce Zhang, Yuandong Tian, Christopher Re, Beidi Chen

Figure 1 for Deja Vu: Contextual Sparsity for Efficient LLMs at Inference Time
Figure 2 for Deja Vu: Contextual Sparsity for Efficient LLMs at Inference Time
Figure 3 for Deja Vu: Contextual Sparsity for Efficient LLMs at Inference Time
Figure 4 for Deja Vu: Contextual Sparsity for Efficient LLMs at Inference Time
Viaarxiv icon

End-to-end Story Plot Generator

Add code
Bookmark button
Alert button
Oct 13, 2023
Hanlin Zhu, Andrew Cohen, Danqing Wang, Kevin Yang, Xiaomeng Yang, Jiantao Jiao, Yuandong Tian

Viaarxiv icon

Learning Personalized Story Evaluation

Add code
Bookmark button
Alert button
Oct 10, 2023
Danqing Wang, Kevin Yang, Hanlin Zhu, Xiaomeng Yang, Andrew Cohen, Lei Li, Yuandong Tian

Viaarxiv icon