Picture for Zhuowen Tu

Zhuowen Tu

BLIVA: A Simple Multimodal LLM for Better Handling of Text-Rich Visual Questions

Add code
Aug 19, 2023
Figure 1 for BLIVA: A Simple Multimodal LLM for Better Handling of Text-Rich Visual Questions
Figure 2 for BLIVA: A Simple Multimodal LLM for Better Handling of Text-Rich Visual Questions
Figure 3 for BLIVA: A Simple Multimodal LLM for Better Handling of Text-Rich Visual Questions
Figure 4 for BLIVA: A Simple Multimodal LLM for Better Handling of Text-Rich Visual Questions
Viaarxiv icon

Patched Denoising Diffusion Models For High-Resolution Image Synthesis

Add code
Aug 02, 2023
Figure 1 for Patched Denoising Diffusion Models For High-Resolution Image Synthesis
Figure 2 for Patched Denoising Diffusion Models For High-Resolution Image Synthesis
Figure 3 for Patched Denoising Diffusion Models For High-Resolution Image Synthesis
Figure 4 for Patched Denoising Diffusion Models For High-Resolution Image Synthesis
Viaarxiv icon

Distilling Large Vision-Language Model with Out-of-Distribution Generalizability

Add code
Jul 19, 2023
Viaarxiv icon

DocTr: Document Transformer for Structured Information Extraction in Documents

Add code
Jul 16, 2023
Viaarxiv icon

Musketeer (All for One, and One for All): A Generalist Vision-Language Model with Task Explanation Prompts

Add code
May 11, 2023
Viaarxiv icon

Single-Stage Diffusion NeRF: A Unified Approach to 3D Generation and Reconstruction

Add code
Apr 17, 2023
Viaarxiv icon

DiffusionRig: Learning Personalized Priors for Facial Appearance Editing

Add code
Apr 13, 2023
Figure 1 for DiffusionRig: Learning Personalized Priors for Facial Appearance Editing
Figure 2 for DiffusionRig: Learning Personalized Priors for Facial Appearance Editing
Figure 3 for DiffusionRig: Learning Personalized Priors for Facial Appearance Editing
Figure 4 for DiffusionRig: Learning Personalized Priors for Facial Appearance Editing
Viaarxiv icon

On the Feasibility of Cross-Task Transfer with Model-Based Reinforcement Learning

Add code
Oct 19, 2022
Figure 1 for On the Feasibility of Cross-Task Transfer with Model-Based Reinforcement Learning
Figure 2 for On the Feasibility of Cross-Task Transfer with Model-Based Reinforcement Learning
Figure 3 for On the Feasibility of Cross-Task Transfer with Model-Based Reinforcement Learning
Figure 4 for On the Feasibility of Cross-Task Transfer with Model-Based Reinforcement Learning
Viaarxiv icon

Point Cloud Recognition with Position-to-Structure Attention Transformers

Add code
Oct 05, 2022
Figure 1 for Point Cloud Recognition with Position-to-Structure Attention Transformers
Figure 2 for Point Cloud Recognition with Position-to-Structure Attention Transformers
Figure 3 for Point Cloud Recognition with Position-to-Structure Attention Transformers
Figure 4 for Point Cloud Recognition with Position-to-Structure Attention Transformers
Viaarxiv icon

An In-depth Study of Stochastic Backpropagation

Add code
Sep 30, 2022
Figure 1 for An In-depth Study of Stochastic Backpropagation
Figure 2 for An In-depth Study of Stochastic Backpropagation
Figure 3 for An In-depth Study of Stochastic Backpropagation
Figure 4 for An In-depth Study of Stochastic Backpropagation
Viaarxiv icon