Picture for Jiahui Yu

Jiahui Yu

Tony

PaLM 2 Technical Report

Add code
May 17, 2023
Figure 1 for PaLM 2 Technical Report
Figure 2 for PaLM 2 Technical Report
Figure 3 for PaLM 2 Technical Report
Figure 4 for PaLM 2 Technical Report
Viaarxiv icon

Practical Conformer: Optimizing size, speed and flops of Conformer for on-Device and cloud ASR

Add code
Mar 31, 2023
Figure 1 for Practical Conformer: Optimizing size, speed and flops of Conformer for on-Device and cloud ASR
Figure 2 for Practical Conformer: Optimizing size, speed and flops of Conformer for on-Device and cloud ASR
Figure 3 for Practical Conformer: Optimizing size, speed and flops of Conformer for on-Device and cloud ASR
Figure 4 for Practical Conformer: Optimizing size, speed and flops of Conformer for on-Device and cloud ASR
Viaarxiv icon

VILA: Learning Image Aesthetics from User Comments with Vision-Language Pretraining

Add code
Mar 24, 2023
Figure 1 for VILA: Learning Image Aesthetics from User Comments with Vision-Language Pretraining
Figure 2 for VILA: Learning Image Aesthetics from User Comments with Vision-Language Pretraining
Figure 3 for VILA: Learning Image Aesthetics from User Comments with Vision-Language Pretraining
Figure 4 for VILA: Learning Image Aesthetics from User Comments with Vision-Language Pretraining
Viaarxiv icon

CoBIT: A Contrastive Bi-directional Image-Text Generation Model

Add code
Mar 23, 2023
Figure 1 for CoBIT: A Contrastive Bi-directional Image-Text Generation Model
Figure 2 for CoBIT: A Contrastive Bi-directional Image-Text Generation Model
Figure 3 for CoBIT: A Contrastive Bi-directional Image-Text Generation Model
Figure 4 for CoBIT: A Contrastive Bi-directional Image-Text Generation Model
Viaarxiv icon

Noise2Music: Text-conditioned Music Generation with Diffusion Models

Add code
Feb 08, 2023
Figure 1 for Noise2Music: Text-conditioned Music Generation with Diffusion Models
Figure 2 for Noise2Music: Text-conditioned Music Generation with Diffusion Models
Figure 3 for Noise2Music: Text-conditioned Music Generation with Diffusion Models
Figure 4 for Noise2Music: Text-conditioned Music Generation with Diffusion Models
Viaarxiv icon

Video-Text Modeling with Zero-Shot Transfer from Contrastive Captioners

Add code
Dec 09, 2022
Figure 1 for Video-Text Modeling with Zero-Shot Transfer from Contrastive Captioners
Figure 2 for Video-Text Modeling with Zero-Shot Transfer from Contrastive Captioners
Figure 3 for Video-Text Modeling with Zero-Shot Transfer from Contrastive Captioners
Figure 4 for Video-Text Modeling with Zero-Shot Transfer from Contrastive Captioners
Viaarxiv icon

Exploiting Category Names for Few-Shot Classification with Vision-Language Models

Add code
Dec 04, 2022
Viaarxiv icon

Deep object detection for waterbird monitoring using aerial imagery

Add code
Oct 10, 2022
Figure 1 for Deep object detection for waterbird monitoring using aerial imagery
Figure 2 for Deep object detection for waterbird monitoring using aerial imagery
Figure 3 for Deep object detection for waterbird monitoring using aerial imagery
Figure 4 for Deep object detection for waterbird monitoring using aerial imagery
Viaarxiv icon

Normalization effects on deep neural networks

Add code
Sep 02, 2022
Figure 1 for Normalization effects on deep neural networks
Figure 2 for Normalization effects on deep neural networks
Figure 3 for Normalization effects on deep neural networks
Figure 4 for Normalization effects on deep neural networks
Viaarxiv icon

Scaling Autoregressive Models for Content-Rich Text-to-Image Generation

Add code
Jun 22, 2022
Figure 1 for Scaling Autoregressive Models for Content-Rich Text-to-Image Generation
Figure 2 for Scaling Autoregressive Models for Content-Rich Text-to-Image Generation
Figure 3 for Scaling Autoregressive Models for Content-Rich Text-to-Image Generation
Figure 4 for Scaling Autoregressive Models for Content-Rich Text-to-Image Generation
Viaarxiv icon