Alert button
Picture for Tuo Zhao

Tuo Zhao

Alert button

No Parameters Left Behind: Sensitivity Guided Adaptive Learning Rate for Training Large Transformer Models

Add code
Bookmark button
Alert button
Feb 06, 2022
Chen Liang, Haoming Jiang, Simiao Zuo, Pengcheng He, Xiaodong Liu, Jianfeng Gao, Weizhu Chen, Tuo Zhao

Figure 1 for No Parameters Left Behind: Sensitivity Guided Adaptive Learning Rate for Training Large Transformer Models
Figure 2 for No Parameters Left Behind: Sensitivity Guided Adaptive Learning Rate for Training Large Transformer Models
Figure 3 for No Parameters Left Behind: Sensitivity Guided Adaptive Learning Rate for Training Large Transformer Models
Figure 4 for No Parameters Left Behind: Sensitivity Guided Adaptive Learning Rate for Training Large Transformer Models
Viaarxiv icon

Homotopic Policy Mirror Descent: Policy Convergence, Implicit Regularization, and Improved Sample Complexity

Add code
Bookmark button
Alert button
Jan 30, 2022
Yan Li, Tuo Zhao, Guanghui Lan

Figure 1 for Homotopic Policy Mirror Descent: Policy Convergence, Implicit Regularization, and Improved Sample Complexity
Viaarxiv icon

Block Policy Mirror Descent

Add code
Bookmark button
Alert button
Jan 15, 2022
Guanghui Lan, Yan Li, Tuo Zhao

Figure 1 for Block Policy Mirror Descent
Figure 2 for Block Policy Mirror Descent
Figure 3 for Block Policy Mirror Descent
Figure 4 for Block Policy Mirror Descent
Viaarxiv icon

Deep Learning Assisted End-to-End Synthesis of mm-Wave Passive Networks with 3D EM Structures: A Study on A Transformer-Based Matching Network

Add code
Bookmark button
Alert button
Jan 06, 2022
Siawpeng Er, Edward Liu, Minshuo Chen, Yan Li, Yuqi Liu, Tuo Zhao, Hua Wang

Figure 1 for Deep Learning Assisted End-to-End Synthesis of mm-Wave Passive Networks with 3D EM Structures: A Study on A Transformer-Based Matching Network
Figure 2 for Deep Learning Assisted End-to-End Synthesis of mm-Wave Passive Networks with 3D EM Structures: A Study on A Transformer-Based Matching Network
Figure 3 for Deep Learning Assisted End-to-End Synthesis of mm-Wave Passive Networks with 3D EM Structures: A Study on A Transformer-Based Matching Network
Figure 4 for Deep Learning Assisted End-to-End Synthesis of mm-Wave Passive Networks with 3D EM Structures: A Study on A Transformer-Based Matching Network
Viaarxiv icon

Deep Nonparametric Estimation of Operators between Infinite Dimensional Spaces

Add code
Bookmark button
Alert button
Jan 01, 2022
Hao Liu, Haizhao Yang, Minshuo Chen, Tuo Zhao, Wenjing Liao

Figure 1 for Deep Nonparametric Estimation of Operators between Infinite Dimensional Spaces
Figure 2 for Deep Nonparametric Estimation of Operators between Infinite Dimensional Spaces
Figure 3 for Deep Nonparametric Estimation of Operators between Infinite Dimensional Spaces
Figure 4 for Deep Nonparametric Estimation of Operators between Infinite Dimensional Spaces
Viaarxiv icon

Learning Generalizable Vision-Tactile Robotic Grasping Strategy for Deformable Objects via Transformer

Add code
Bookmark button
Alert button
Dec 20, 2021
Yunhai Han, Rahul Batra, Nathan Boyd, Tuo Zhao, Yu She, Seth Hutchinson, Ye Zhao

Figure 1 for Learning Generalizable Vision-Tactile Robotic Grasping Strategy for Deformable Objects via Transformer
Figure 2 for Learning Generalizable Vision-Tactile Robotic Grasping Strategy for Deformable Objects via Transformer
Figure 3 for Learning Generalizable Vision-Tactile Robotic Grasping Strategy for Deformable Objects via Transformer
Figure 4 for Learning Generalizable Vision-Tactile Robotic Grasping Strategy for Deformable Objects via Transformer
Viaarxiv icon

Frequency-aware SGD for Efficient Embedding Learning with Provable Benefits

Add code
Bookmark button
Alert button
Oct 24, 2021
Yan Li, Dhruv Choudhary, Xiaohan Wei, Baichuan Yuan, Bhargav Bhushanam, Tuo Zhao, Guanghui Lan

Figure 1 for Frequency-aware SGD for Efficient Embedding Learning with Provable Benefits
Figure 2 for Frequency-aware SGD for Efficient Embedding Learning with Provable Benefits
Figure 3 for Frequency-aware SGD for Efficient Embedding Learning with Provable Benefits
Figure 4 for Frequency-aware SGD for Efficient Embedding Learning with Provable Benefits
Viaarxiv icon