Alert button
Picture for Yuandong Tian

Yuandong Tian

Alert button

Learning Personalized Story Evaluation

Add code
Bookmark button
Alert button
Oct 06, 2023
Danqing Wang, Kevin Yang, Hanlin Zhu, Xiaomeng Yang, Andrew Cohen, Lei Li, Yuandong Tian

Viaarxiv icon

GenCO: Generating Diverse Solutions to Design Problems with Combinatorial Nature

Add code
Bookmark button
Alert button
Oct 03, 2023
Aaron Ferber, Arman Zharmagambetov, Taoan Huang, Bistra Dilkina, Yuandong Tian

Viaarxiv icon

JoMA: Demystifying Multilayer Transformers via JOint Dynamics of MLP and Attention

Add code
Bookmark button
Alert button
Oct 03, 2023
Yuandong Tian, Yiping Wang, Zhenyu Zhang, Beidi Chen, Simon Du

Viaarxiv icon

Efficient Streaming Language Models with Attention Sinks

Add code
Bookmark button
Alert button
Sep 29, 2023
Guangxuan Xiao, Yuandong Tian, Beidi Chen, Song Han, Mike Lewis

Figure 1 for Efficient Streaming Language Models with Attention Sinks
Figure 2 for Efficient Streaming Language Models with Attention Sinks
Figure 3 for Efficient Streaming Language Models with Attention Sinks
Figure 4 for Efficient Streaming Language Models with Attention Sinks
Viaarxiv icon

RLCD: Reinforcement Learning from Contrast Distillation for Language Model Alignment

Add code
Bookmark button
Alert button
Jul 24, 2023
Kevin Yang, Dan Klein, Asli Celikyilmaz, Nanyun Peng, Yuandong Tian

Figure 1 for RLCD: Reinforcement Learning from Contrast Distillation for Language Model Alignment
Figure 2 for RLCD: Reinforcement Learning from Contrast Distillation for Language Model Alignment
Figure 3 for RLCD: Reinforcement Learning from Contrast Distillation for Language Model Alignment
Figure 4 for RLCD: Reinforcement Learning from Contrast Distillation for Language Model Alignment
Viaarxiv icon

H$_2$O: Heavy-Hitter Oracle for Efficient Generative Inference of Large Language Models

Add code
Bookmark button
Alert button
Jul 19, 2023
Zhenyu Zhang, Ying Sheng, Tianyi Zhou, Tianlong Chen, Lianmin Zheng, Ruisi Cai, Zhao Song, Yuandong Tian, Christopher Ré, Clark Barrett, Zhangyang Wang, Beidi Chen

Figure 1 for H$_2$O: Heavy-Hitter Oracle for Efficient Generative Inference of Large Language Models
Figure 2 for H$_2$O: Heavy-Hitter Oracle for Efficient Generative Inference of Large Language Models
Figure 3 for H$_2$O: Heavy-Hitter Oracle for Efficient Generative Inference of Large Language Models
Figure 4 for H$_2$O: Heavy-Hitter Oracle for Efficient Generative Inference of Large Language Models
Viaarxiv icon

Landscape Surrogate: Learning Decision Losses for Mathematical Optimization Under Partial Information

Add code
Bookmark button
Alert button
Jul 18, 2023
Arman Zharmagambetov, Brandon Amos, Aaron Ferber, Taoan Huang, Bistra Dilkina, Yuandong Tian

Figure 1 for Landscape Surrogate: Learning Decision Losses for Mathematical Optimization Under Partial Information
Figure 2 for Landscape Surrogate: Learning Decision Losses for Mathematical Optimization Under Partial Information
Figure 3 for Landscape Surrogate: Learning Decision Losses for Mathematical Optimization Under Partial Information
Figure 4 for Landscape Surrogate: Learning Decision Losses for Mathematical Optimization Under Partial Information
Viaarxiv icon

Extending Context Window of Large Language Models via Positional Interpolation

Add code
Bookmark button
Alert button
Jun 28, 2023
Shouyuan Chen, Sherman Wong, Liangjian Chen, Yuandong Tian

Figure 1 for Extending Context Window of Large Language Models via Positional Interpolation
Figure 2 for Extending Context Window of Large Language Models via Positional Interpolation
Figure 3 for Extending Context Window of Large Language Models via Positional Interpolation
Figure 4 for Extending Context Window of Large Language Models via Positional Interpolation
Viaarxiv icon

Scan and Snap: Understanding Training Dynamics and Token Composition in 1-layer Transformer

Add code
Bookmark button
Alert button
May 25, 2023
Yuandong Tian, Yiping Wang, Beidi Chen, Simon Du

Figure 1 for Scan and Snap: Understanding Training Dynamics and Token Composition in 1-layer Transformer
Figure 2 for Scan and Snap: Understanding Training Dynamics and Token Composition in 1-layer Transformer
Figure 3 for Scan and Snap: Understanding Training Dynamics and Token Composition in 1-layer Transformer
Figure 4 for Scan and Snap: Understanding Training Dynamics and Token Composition in 1-layer Transformer
Viaarxiv icon