Picture for Fuzhao Xue

Fuzhao Xue

Adaptive Computation with Elastic Input Sequence

Add code
Jan 30, 2023
Viaarxiv icon

Deeper vs Wider: A Revisit of Transformer Configuration

Add code
May 24, 2022
Figure 1 for Deeper vs Wider: A Revisit of Transformer Configuration
Figure 2 for Deeper vs Wider: A Revisit of Transformer Configuration
Figure 3 for Deeper vs Wider: A Revisit of Transformer Configuration
Figure 4 for Deeper vs Wider: A Revisit of Transformer Configuration
Viaarxiv icon

CowClip: Reducing CTR Prediction Model Training Time from 12 hours to 10 minutes on 1 GPU

Add code
Apr 22, 2022
Figure 1 for CowClip: Reducing CTR Prediction Model Training Time from 12 hours to 10 minutes on 1 GPU
Figure 2 for CowClip: Reducing CTR Prediction Model Training Time from 12 hours to 10 minutes on 1 GPU
Figure 3 for CowClip: Reducing CTR Prediction Model Training Time from 12 hours to 10 minutes on 1 GPU
Figure 4 for CowClip: Reducing CTR Prediction Model Training Time from 12 hours to 10 minutes on 1 GPU
Viaarxiv icon

Modeling Motion with Multi-Modal Features for Text-Based Video Segmentation

Add code
Apr 06, 2022
Figure 1 for Modeling Motion with Multi-Modal Features for Text-Based Video Segmentation
Figure 2 for Modeling Motion with Multi-Modal Features for Text-Based Video Segmentation
Figure 3 for Modeling Motion with Multi-Modal Features for Text-Based Video Segmentation
Figure 4 for Modeling Motion with Multi-Modal Features for Text-Based Video Segmentation
Viaarxiv icon

One Student Knows All Experts Know: From Sparse to Dense

Add code
Jan 26, 2022
Figure 1 for One Student Knows All Experts Know: From Sparse to Dense
Figure 2 for One Student Knows All Experts Know: From Sparse to Dense
Figure 3 for One Student Knows All Experts Know: From Sparse to Dense
Figure 4 for One Student Knows All Experts Know: From Sparse to Dense
Viaarxiv icon

Large-Scale Deep Learning Optimizations: A Comprehensive Survey

Add code
Nov 02, 2021
Figure 1 for Large-Scale Deep Learning Optimizations: A Comprehensive Survey
Figure 2 for Large-Scale Deep Learning Optimizations: A Comprehensive Survey
Figure 3 for Large-Scale Deep Learning Optimizations: A Comprehensive Survey
Figure 4 for Large-Scale Deep Learning Optimizations: A Comprehensive Survey
Viaarxiv icon

Sparse-MLP: A Fully-MLP Architecture with Conditional Computation

Add code
Sep 08, 2021
Figure 1 for Sparse-MLP: A Fully-MLP Architecture with Conditional Computation
Figure 2 for Sparse-MLP: A Fully-MLP Architecture with Conditional Computation
Figure 3 for Sparse-MLP: A Fully-MLP Architecture with Conditional Computation
Figure 4 for Sparse-MLP: A Fully-MLP Architecture with Conditional Computation
Viaarxiv icon

Automated Audio Captioning using Transfer Learning and Reconstruction Latent Space Similarity Regularization

Add code
Aug 10, 2021
Figure 1 for Automated Audio Captioning using Transfer Learning and Reconstruction Latent Space Similarity Regularization
Figure 2 for Automated Audio Captioning using Transfer Learning and Reconstruction Latent Space Similarity Regularization
Figure 3 for Automated Audio Captioning using Transfer Learning and Reconstruction Latent Space Similarity Regularization
Figure 4 for Automated Audio Captioning using Transfer Learning and Reconstruction Latent Space Similarity Regularization
Viaarxiv icon

Go Wider Instead of Deeper

Add code
Jul 29, 2021
Figure 1 for Go Wider Instead of Deeper
Figure 2 for Go Wider Instead of Deeper
Figure 3 for Go Wider Instead of Deeper
Figure 4 for Go Wider Instead of Deeper
Viaarxiv icon

Recent Advances in Deep Learning Based Dialogue Systems: A Systematic Survey

Add code
Jun 01, 2021
Figure 1 for Recent Advances in Deep Learning Based Dialogue Systems: A Systematic Survey
Figure 2 for Recent Advances in Deep Learning Based Dialogue Systems: A Systematic Survey
Figure 3 for Recent Advances in Deep Learning Based Dialogue Systems: A Systematic Survey
Figure 4 for Recent Advances in Deep Learning Based Dialogue Systems: A Systematic Survey
Viaarxiv icon