Picture for Jiahui Yu

Jiahui Yu

Tony

CoCa: Contrastive Captioners are Image-Text Foundation Models

Add code
May 04, 2022
Figure 1 for CoCa: Contrastive Captioners are Image-Text Foundation Models
Figure 2 for CoCa: Contrastive Captioners are Image-Text Foundation Models
Figure 3 for CoCa: Contrastive Captioners are Image-Text Foundation Models
Figure 4 for CoCa: Contrastive Captioners are Image-Text Foundation Models
Viaarxiv icon

Self-supervised Learning with Random-projection Quantizer for Speech Recognition

Add code
Feb 03, 2022
Figure 1 for Self-supervised Learning with Random-projection Quantizer for Speech Recognition
Figure 2 for Self-supervised Learning with Random-projection Quantizer for Speech Recognition
Figure 3 for Self-supervised Learning with Random-projection Quantizer for Speech Recognition
Figure 4 for Self-supervised Learning with Random-projection Quantizer for Speech Recognition
Viaarxiv icon

Co-training Transformer with Videos and Images Improves Action Recognition

Add code
Dec 14, 2021
Figure 1 for Co-training Transformer with Videos and Images Improves Action Recognition
Figure 2 for Co-training Transformer with Videos and Images Improves Action Recognition
Figure 3 for Co-training Transformer with Videos and Images Improves Action Recognition
Figure 4 for Co-training Transformer with Videos and Images Improves Action Recognition
Viaarxiv icon

Vector-quantized Image Modeling with Improved VQGAN

Add code
Oct 09, 2021
Figure 1 for Vector-quantized Image Modeling with Improved VQGAN
Figure 2 for Vector-quantized Image Modeling with Improved VQGAN
Figure 3 for Vector-quantized Image Modeling with Improved VQGAN
Figure 4 for Vector-quantized Image Modeling with Improved VQGAN
Viaarxiv icon

BigSSL: Exploring the Frontier of Large-Scale Semi-Supervised Learning for Automatic Speech Recognition

Add code
Oct 01, 2021
Figure 1 for BigSSL: Exploring the Frontier of Large-Scale Semi-Supervised Learning for Automatic Speech Recognition
Figure 2 for BigSSL: Exploring the Frontier of Large-Scale Semi-Supervised Learning for Automatic Speech Recognition
Figure 3 for BigSSL: Exploring the Frontier of Large-Scale Semi-Supervised Learning for Automatic Speech Recognition
Figure 4 for BigSSL: Exploring the Frontier of Large-Scale Semi-Supervised Learning for Automatic Speech Recognition
Viaarxiv icon

SimVLM: Simple Visual Language Model Pretraining with Weak Supervision

Add code
Aug 24, 2021
Figure 1 for SimVLM: Simple Visual Language Model Pretraining with Weak Supervision
Figure 2 for SimVLM: Simple Visual Language Model Pretraining with Weak Supervision
Figure 3 for SimVLM: Simple Visual Language Model Pretraining with Weak Supervision
Figure 4 for SimVLM: Simple Visual Language Model Pretraining with Weak Supervision
Viaarxiv icon

Normalization effects on shallow neural networks and related asymptotic expansions

Add code
Nov 20, 2020
Figure 1 for Normalization effects on shallow neural networks and related asymptotic expansions
Figure 2 for Normalization effects on shallow neural networks and related asymptotic expansions
Figure 3 for Normalization effects on shallow neural networks and related asymptotic expansions
Figure 4 for Normalization effects on shallow neural networks and related asymptotic expansions
Viaarxiv icon

Cascaded encoders for unifying streaming and non-streaming ASR

Add code
Oct 27, 2020
Figure 1 for Cascaded encoders for unifying streaming and non-streaming ASR
Figure 2 for Cascaded encoders for unifying streaming and non-streaming ASR
Figure 3 for Cascaded encoders for unifying streaming and non-streaming ASR
Figure 4 for Cascaded encoders for unifying streaming and non-streaming ASR
Viaarxiv icon

FastEmit: Low-latency Streaming ASR with Sequence-level Emission Regularization

Add code
Oct 21, 2020
Figure 1 for FastEmit: Low-latency Streaming ASR with Sequence-level Emission Regularization
Figure 2 for FastEmit: Low-latency Streaming ASR with Sequence-level Emission Regularization
Figure 3 for FastEmit: Low-latency Streaming ASR with Sequence-level Emission Regularization
Figure 4 for FastEmit: Low-latency Streaming ASR with Sequence-level Emission Regularization
Viaarxiv icon

Universal ASR: Unify and Improve Streaming ASR with Full-context Modeling

Add code
Oct 12, 2020
Figure 1 for Universal ASR: Unify and Improve Streaming ASR with Full-context Modeling
Figure 2 for Universal ASR: Unify and Improve Streaming ASR with Full-context Modeling
Figure 3 for Universal ASR: Unify and Improve Streaming ASR with Full-context Modeling
Figure 4 for Universal ASR: Unify and Improve Streaming ASR with Full-context Modeling
Viaarxiv icon