Picture for Andrew Tao

Andrew Tao

X-VILA: Cross-Modality Alignment for Large Language Model

Add code
May 29, 2024
Viaarxiv icon

VILA: On Pre-training for Visual Language Models

Add code
Dec 14, 2023
Figure 1 for VILA: On Pre-training for Visual Language Models
Figure 2 for VILA: On Pre-training for Visual Language Models
Figure 3 for VILA: On Pre-training for Visual Language Models
Figure 4 for VILA: On Pre-training for Visual Language Models
Viaarxiv icon

FasterViT: Fast Vision Transformers with Hierarchical Attention

Add code
Jun 09, 2023
Figure 1 for FasterViT: Fast Vision Transformers with Hierarchical Attention
Figure 2 for FasterViT: Fast Vision Transformers with Hierarchical Attention
Figure 3 for FasterViT: Fast Vision Transformers with Hierarchical Attention
Figure 4 for FasterViT: Fast Vision Transformers with Hierarchical Attention
Viaarxiv icon

Progressive Learning of 3D Reconstruction Network from 2D GAN Data

May 18, 2023
Figure 1 for Progressive Learning of 3D Reconstruction Network from 2D GAN Data
Figure 2 for Progressive Learning of 3D Reconstruction Network from 2D GAN Data
Figure 3 for Progressive Learning of 3D Reconstruction Network from 2D GAN Data
Figure 4 for Progressive Learning of 3D Reconstruction Network from 2D GAN Data
Viaarxiv icon

Preserve Your Own Correlation: A Noise Prior for Video Diffusion Models

May 17, 2023
Figure 1 for Preserve Your Own Correlation: A Noise Prior for Video Diffusion Models
Figure 2 for Preserve Your Own Correlation: A Noise Prior for Video Diffusion Models
Figure 3 for Preserve Your Own Correlation: A Noise Prior for Video Diffusion Models
Figure 4 for Preserve Your Own Correlation: A Noise Prior for Video Diffusion Models
Viaarxiv icon

Fine Detailed Texture Learning for 3D Meshes with Generative Models

Add code
Mar 17, 2022
Figure 1 for Fine Detailed Texture Learning for 3D Meshes with Generative Models
Figure 2 for Fine Detailed Texture Learning for 3D Meshes with Generative Models
Figure 3 for Fine Detailed Texture Learning for 3D Meshes with Generative Models
Figure 4 for Fine Detailed Texture Learning for 3D Meshes with Generative Models
Viaarxiv icon

Leveraging Bitstream Metadata for Fast and Accurate Video Compression Correction

Add code
Jan 31, 2022
Figure 1 for Leveraging Bitstream Metadata for Fast and Accurate Video Compression Correction
Figure 2 for Leveraging Bitstream Metadata for Fast and Accurate Video Compression Correction
Figure 3 for Leveraging Bitstream Metadata for Fast and Accurate Video Compression Correction
Figure 4 for Leveraging Bitstream Metadata for Fast and Accurate Video Compression Correction
Viaarxiv icon

Adaptive Fourier Neural Operators: Efficient Token Mixers for Transformers

Add code
Nov 24, 2021
Figure 1 for Adaptive Fourier Neural Operators: Efficient Token Mixers for Transformers
Figure 2 for Adaptive Fourier Neural Operators: Efficient Token Mixers for Transformers
Figure 3 for Adaptive Fourier Neural Operators: Efficient Token Mixers for Transformers
Figure 4 for Adaptive Fourier Neural Operators: Efficient Token Mixers for Transformers
Viaarxiv icon

View Generalization for Single Image Textured 3D Models

Add code
Jun 10, 2021
Figure 1 for View Generalization for Single Image Textured 3D Models
Figure 2 for View Generalization for Single Image Textured 3D Models
Figure 3 for View Generalization for Single Image Textured 3D Models
Figure 4 for View Generalization for Single Image Textured 3D Models
Viaarxiv icon

Dual Contrastive Loss and Attention for GANs

Add code
Mar 31, 2021
Figure 1 for Dual Contrastive Loss and Attention for GANs
Figure 2 for Dual Contrastive Loss and Attention for GANs
Figure 3 for Dual Contrastive Loss and Attention for GANs
Figure 4 for Dual Contrastive Loss and Attention for GANs
Viaarxiv icon