Picture for Tao Lei

Tao Lei

Simple Recurrence Improves Masked Language Models

Add code
May 23, 2022
Figure 1 for Simple Recurrence Improves Masked Language Models
Figure 2 for Simple Recurrence Improves Masked Language Models
Figure 3 for Simple Recurrence Improves Masked Language Models
Figure 4 for Simple Recurrence Improves Masked Language Models
Viaarxiv icon

Mixture-of-Experts with Expert Choice Routing

Add code
Feb 18, 2022
Figure 1 for Mixture-of-Experts with Expert Choice Routing
Figure 2 for Mixture-of-Experts with Expert Choice Routing
Figure 3 for Mixture-of-Experts with Expert Choice Routing
Figure 4 for Mixture-of-Experts with Expert Choice Routing
Viaarxiv icon

SRU++: Pioneering Fast Recurrence with Attention for Speech Recognition

Add code
Oct 11, 2021
Figure 1 for SRU++: Pioneering Fast Recurrence with Attention for Speech Recognition
Figure 2 for SRU++: Pioneering Fast Recurrence with Attention for Speech Recognition
Figure 3 for SRU++: Pioneering Fast Recurrence with Attention for Speech Recognition
Figure 4 for SRU++: Pioneering Fast Recurrence with Attention for Speech Recognition
Viaarxiv icon

Channel-Temporal Attention for First-Person Video Domain Adaptation

Add code
Aug 19, 2021
Figure 1 for Channel-Temporal Attention for First-Person Video Domain Adaptation
Figure 2 for Channel-Temporal Attention for First-Person Video Domain Adaptation
Figure 3 for Channel-Temporal Attention for First-Person Video Domain Adaptation
Figure 4 for Channel-Temporal Attention for First-Person Video Domain Adaptation
Viaarxiv icon

Team PyKale Submission to the EPIC-Kitchens 2021 Unsupervised Domain Adaptation Challenge for Action Recognition

Add code
Jun 22, 2021
Figure 1 for Team PyKale  Submission to the EPIC-Kitchens 2021 Unsupervised Domain Adaptation Challenge for Action Recognition
Figure 2 for Team PyKale  Submission to the EPIC-Kitchens 2021 Unsupervised Domain Adaptation Challenge for Action Recognition
Figure 3 for Team PyKale  Submission to the EPIC-Kitchens 2021 Unsupervised Domain Adaptation Challenge for Action Recognition
Viaarxiv icon

Nutribullets Hybrid: Multi-document Health Summarization

Add code
Apr 08, 2021
Figure 1 for Nutribullets Hybrid: Multi-document Health Summarization
Figure 2 for Nutribullets Hybrid: Multi-document Health Summarization
Figure 3 for Nutribullets Hybrid: Multi-document Health Summarization
Figure 4 for Nutribullets Hybrid: Multi-document Health Summarization
Viaarxiv icon

Nutri-bullets: Summarizing Health Studies by Composing Segments

Add code
Mar 22, 2021
Figure 1 for Nutri-bullets: Summarizing Health Studies by Composing Segments
Figure 2 for Nutri-bullets: Summarizing Health Studies by Composing Segments
Figure 3 for Nutri-bullets: Summarizing Health Studies by Composing Segments
Figure 4 for Nutri-bullets: Summarizing Health Studies by Composing Segments
Viaarxiv icon

When Attention Meets Fast Recurrence: Training Language Models with Reduced Compute

Add code
Feb 24, 2021
Figure 1 for When Attention Meets Fast Recurrence: Training Language Models with Reduced Compute
Figure 2 for When Attention Meets Fast Recurrence: Training Language Models with Reduced Compute
Figure 3 for When Attention Meets Fast Recurrence: Training Language Models with Reduced Compute
Figure 4 for When Attention Meets Fast Recurrence: Training Language Models with Reduced Compute
Viaarxiv icon

Medical Image Segmentation Using Deep Learning: A Survey

Add code
Sep 28, 2020
Figure 1 for Medical Image Segmentation Using Deep Learning: A Survey
Figure 2 for Medical Image Segmentation Using Deep Learning: A Survey
Figure 3 for Medical Image Segmentation Using Deep Learning: A Survey
Figure 4 for Medical Image Segmentation Using Deep Learning: A Survey
Viaarxiv icon

Autoregressive Knowledge Distillation through Imitation Learning

Add code
Sep 15, 2020
Figure 1 for Autoregressive Knowledge Distillation through Imitation Learning
Figure 2 for Autoregressive Knowledge Distillation through Imitation Learning
Figure 3 for Autoregressive Knowledge Distillation through Imitation Learning
Figure 4 for Autoregressive Knowledge Distillation through Imitation Learning
Viaarxiv icon