Picture for Dan Kondratyuk

Dan Kondratyuk

CamViG: Camera Aware Image-to-Video Generation with Multimodal Transformers

Add code
May 21, 2024
Viaarxiv icon

VideoPoet: A Large Language Model for Zero-Shot Video Generation

Add code
Dec 21, 2023
Figure 1 for VideoPoet: A Large Language Model for Zero-Shot Video Generation
Figure 2 for VideoPoet: A Large Language Model for Zero-Shot Video Generation
Figure 3 for VideoPoet: A Large Language Model for Zero-Shot Video Generation
Figure 4 for VideoPoet: A Large Language Model for Zero-Shot Video Generation
Viaarxiv icon

Alternating Gradient Descent and Mixture-of-Experts for Integrated Multimodal Perception

Add code
May 10, 2023
Figure 1 for Alternating Gradient Descent and Mixture-of-Experts for Integrated Multimodal Perception
Figure 2 for Alternating Gradient Descent and Mixture-of-Experts for Integrated Multimodal Perception
Figure 3 for Alternating Gradient Descent and Mixture-of-Experts for Integrated Multimodal Perception
Figure 4 for Alternating Gradient Descent and Mixture-of-Experts for Integrated Multimodal Perception
Viaarxiv icon

Towards a Unified Foundation Model: Jointly Pre-Training Transformers on Unpaired Images and Text

Add code
Dec 14, 2021
Figure 1 for Towards a Unified Foundation Model: Jointly Pre-Training Transformers on Unpaired Images and Text
Figure 2 for Towards a Unified Foundation Model: Jointly Pre-Training Transformers on Unpaired Images and Text
Figure 3 for Towards a Unified Foundation Model: Jointly Pre-Training Transformers on Unpaired Images and Text
Figure 4 for Towards a Unified Foundation Model: Jointly Pre-Training Transformers on Unpaired Images and Text
Viaarxiv icon

MoViNets: Mobile Video Networks for Efficient Video Recognition

Add code
Apr 18, 2021
Figure 1 for MoViNets: Mobile Video Networks for Efficient Video Recognition
Figure 2 for MoViNets: Mobile Video Networks for Efficient Video Recognition
Figure 3 for MoViNets: Mobile Video Networks for Efficient Video Recognition
Figure 4 for MoViNets: Mobile Video Networks for Efficient Video Recognition
Viaarxiv icon

Multiple Networks are More Efficient than One: Fast and Accurate Models via Ensembles and Cascades

Add code
Dec 03, 2020
Figure 1 for Multiple Networks are More Efficient than One: Fast and Accurate Models via Ensembles and Cascades
Figure 2 for Multiple Networks are More Efficient than One: Fast and Accurate Models via Ensembles and Cascades
Figure 3 for Multiple Networks are More Efficient than One: Fast and Accurate Models via Ensembles and Cascades
Figure 4 for Multiple Networks are More Efficient than One: Fast and Accurate Models via Ensembles and Cascades
Viaarxiv icon

When Ensembling Smaller Models is More Efficient than Single Large Models

Add code
May 01, 2020
Figure 1 for When Ensembling Smaller Models is More Efficient than Single Large Models
Figure 2 for When Ensembling Smaller Models is More Efficient than Single Large Models
Figure 3 for When Ensembling Smaller Models is More Efficient than Single Large Models
Viaarxiv icon