Alert button
Picture for Simon Du

Simon Du

Alert button

JoMA: Demystifying Multilayer Transformers via JOint Dynamics of MLP and Attention

Add code
Bookmark button
Alert button
Oct 03, 2023
Yuandong Tian, Yiping Wang, Zhenyu Zhang, Beidi Chen, Simon Du

Viaarxiv icon

Scan and Snap: Understanding Training Dynamics and Token Composition in 1-layer Transformer

Add code
Bookmark button
Alert button
May 25, 2023
Yuandong Tian, Yiping Wang, Beidi Chen, Simon Du

Figure 1 for Scan and Snap: Understanding Training Dynamics and Token Composition in 1-layer Transformer
Figure 2 for Scan and Snap: Understanding Training Dynamics and Token Composition in 1-layer Transformer
Figure 3 for Scan and Snap: Understanding Training Dynamics and Token Composition in 1-layer Transformer
Figure 4 for Scan and Snap: Understanding Training Dynamics and Token Composition in 1-layer Transformer
Viaarxiv icon

Near-Optimal Algorithms for Autonomous Exploration and Multi-Goal Stochastic Shortest Path

Add code
Bookmark button
Alert button
May 22, 2022
Haoyuan Cai, Tengyu Ma, Simon Du

Figure 1 for Near-Optimal Algorithms for Autonomous Exploration and Multi-Goal Stochastic Shortest Path
Figure 2 for Near-Optimal Algorithms for Autonomous Exploration and Multi-Goal Stochastic Shortest Path
Figure 3 for Near-Optimal Algorithms for Autonomous Exploration and Multi-Goal Stochastic Shortest Path
Viaarxiv icon

AdaLoss: A computationally-efficient and provably convergent adaptive gradient method

Add code
Bookmark button
Alert button
Sep 17, 2021
Xiaoxia Wu, Yuege Xie, Simon Du, Rachel Ward

Figure 1 for AdaLoss: A computationally-efficient and provably convergent adaptive gradient method
Figure 2 for AdaLoss: A computationally-efficient and provably convergent adaptive gradient method
Figure 3 for AdaLoss: A computationally-efficient and provably convergent adaptive gradient method
Figure 4 for AdaLoss: A computationally-efficient and provably convergent adaptive gradient method
Viaarxiv icon

Discovering Diverse Multi-Agent Strategic Behavior via Reward Randomization

Add code
Bookmark button
Alert button
Mar 12, 2021
Zhenggang Tang, Chao Yu, Boyuan Chen, Huazhe Xu, Xiaolong Wang, Fei Fang, Simon Du, Yu Wang, Yi Wu

Figure 1 for Discovering Diverse Multi-Agent Strategic Behavior via Reward Randomization
Figure 2 for Discovering Diverse Multi-Agent Strategic Behavior via Reward Randomization
Figure 3 for Discovering Diverse Multi-Agent Strategic Behavior via Reward Randomization
Figure 4 for Discovering Diverse Multi-Agent Strategic Behavior via Reward Randomization
Viaarxiv icon

Discrete-Continuous Mixtures in Probabilistic Programming: Generalized Semantics and Inference Algorithms

Add code
Bookmark button
Alert button
Jun 08, 2018
Yi Wu, Siddharth Srivastava, Nicholas Hay, Simon Du, Stuart Russell

Figure 1 for Discrete-Continuous Mixtures in Probabilistic Programming: Generalized Semantics and Inference Algorithms
Figure 2 for Discrete-Continuous Mixtures in Probabilistic Programming: Generalized Semantics and Inference Algorithms
Figure 3 for Discrete-Continuous Mixtures in Probabilistic Programming: Generalized Semantics and Inference Algorithms
Figure 4 for Discrete-Continuous Mixtures in Probabilistic Programming: Generalized Semantics and Inference Algorithms
Viaarxiv icon

Stochastic Zeroth-order Optimization in High Dimensions

Add code
Bookmark button
Alert button
Feb 26, 2018
Yining Wang, Simon Du, Sivaraman Balakrishnan, Aarti Singh

Figure 1 for Stochastic Zeroth-order Optimization in High Dimensions
Figure 2 for Stochastic Zeroth-order Optimization in High Dimensions
Figure 3 for Stochastic Zeroth-order Optimization in High Dimensions
Viaarxiv icon