Picture for Mingshuang Luo

Mingshuang Luo

Morph: A Motion-free Physics Optimization Framework for Human Motion Generation

Add code
Nov 22, 2024
Viaarxiv icon

M$^3$GPT: An Advanced Multimodal, Multitask Framework for Motion Comprehension and Generation

Add code
May 29, 2024
Viaarxiv icon

Predicting Multi-Codebook Vector Quantization Indexes for Knowledge Distillation

Add code
Oct 31, 2022
Figure 1 for Predicting Multi-Codebook Vector Quantization Indexes for Knowledge Distillation
Figure 2 for Predicting Multi-Codebook Vector Quantization Indexes for Knowledge Distillation
Figure 3 for Predicting Multi-Codebook Vector Quantization Indexes for Knowledge Distillation
Figure 4 for Predicting Multi-Codebook Vector Quantization Indexes for Knowledge Distillation
Viaarxiv icon

Fast and parallel decoding for transducer

Add code
Oct 31, 2022
Viaarxiv icon

Pruned RNN-T for fast, memory-efficient ASR training

Add code
Jun 23, 2022
Figure 1 for Pruned RNN-T for fast, memory-efficient ASR training
Figure 2 for Pruned RNN-T for fast, memory-efficient ASR training
Figure 3 for Pruned RNN-T for fast, memory-efficient ASR training
Figure 4 for Pruned RNN-T for fast, memory-efficient ASR training
Viaarxiv icon

Synchronous Bidirectional Learning for Multilingual Lip Reading

Add code
May 12, 2020
Figure 1 for Synchronous Bidirectional Learning for Multilingual Lip Reading
Figure 2 for Synchronous Bidirectional Learning for Multilingual Lip Reading
Figure 3 for Synchronous Bidirectional Learning for Multilingual Lip Reading
Figure 4 for Synchronous Bidirectional Learning for Multilingual Lip Reading
Viaarxiv icon

Pseudo-Convolutional Policy Gradient for Sequence-to-Sequence Lip-Reading

Add code
Mar 09, 2020
Figure 1 for Pseudo-Convolutional Policy Gradient for Sequence-to-Sequence Lip-Reading
Figure 2 for Pseudo-Convolutional Policy Gradient for Sequence-to-Sequence Lip-Reading
Figure 3 for Pseudo-Convolutional Policy Gradient for Sequence-to-Sequence Lip-Reading
Figure 4 for Pseudo-Convolutional Policy Gradient for Sequence-to-Sequence Lip-Reading
Viaarxiv icon