Picture for Jianlong Fu

Jianlong Fu

MobileVidFactory: Automatic Diffusion-Based Social Media Video Generation for Mobile Devices from Text

Add code
Jul 31, 2023
Figure 1 for MobileVidFactory: Automatic Diffusion-Based Social Media Video Generation for Mobile Devices from Text
Figure 2 for MobileVidFactory: Automatic Diffusion-Based Social Media Video Generation for Mobile Devices from Text
Viaarxiv icon

SINC: Self-Supervised In-Context Learning for Vision-Language Tasks

Add code
Jul 15, 2023
Figure 1 for SINC: Self-Supervised In-Context Learning for Vision-Language Tasks
Figure 2 for SINC: Self-Supervised In-Context Learning for Vision-Language Tasks
Figure 3 for SINC: Self-Supervised In-Context Learning for Vision-Language Tasks
Figure 4 for SINC: Self-Supervised In-Context Learning for Vision-Language Tasks
Viaarxiv icon

Pave the Way to Grasp Anything: Transferring Foundation Models for Universal Pick-Place Robots

Add code
Jun 25, 2023
Viaarxiv icon

Learning Profitable NFT Image Diffusions via Multiple Visual-Policy Guided Reinforcement Learning

Add code
Jun 20, 2023
Figure 1 for Learning Profitable NFT Image Diffusions via Multiple Visual-Policy Guided Reinforcement Learning
Figure 2 for Learning Profitable NFT Image Diffusions via Multiple Visual-Policy Guided Reinforcement Learning
Figure 3 for Learning Profitable NFT Image Diffusions via Multiple Visual-Policy Guided Reinforcement Learning
Figure 4 for Learning Profitable NFT Image Diffusions via Multiple Visual-Policy Guided Reinforcement Learning
Viaarxiv icon

MovieFactory: Automatic Movie Creation from Text using Large Generative Models for Language and Images

Add code
Jun 12, 2023
Viaarxiv icon

AlphaBlock: Embodied Finetuning for Vision-Language Reasoning in Robot Manipulation

Add code
May 30, 2023
Figure 1 for AlphaBlock: Embodied Finetuning for Vision-Language Reasoning in Robot Manipulation
Figure 2 for AlphaBlock: Embodied Finetuning for Vision-Language Reasoning in Robot Manipulation
Figure 3 for AlphaBlock: Embodied Finetuning for Vision-Language Reasoning in Robot Manipulation
Figure 4 for AlphaBlock: Embodied Finetuning for Vision-Language Reasoning in Robot Manipulation
Viaarxiv icon

Solving Diffusion ODEs with Optimal Boundary Conditions for Better Image Super-Resolution

Add code
May 24, 2023
Viaarxiv icon

VideoFactory: Swap Attention in Spatiotemporal Diffusions for Text-to-Video Generation

Add code
May 18, 2023
Figure 1 for VideoFactory: Swap Attention in Spatiotemporal Diffusions for Text-to-Video Generation
Figure 2 for VideoFactory: Swap Attention in Spatiotemporal Diffusions for Text-to-Video Generation
Figure 3 for VideoFactory: Swap Attention in Spatiotemporal Diffusions for Text-to-Video Generation
Figure 4 for VideoFactory: Swap Attention in Spatiotemporal Diffusions for Text-to-Video Generation
Viaarxiv icon

NUWA-XL: Diffusion over Diffusion for eXtremely Long Video Generation

Add code
Mar 22, 2023
Figure 1 for NUWA-XL: Diffusion over Diffusion for eXtremely Long Video Generation
Figure 2 for NUWA-XL: Diffusion over Diffusion for eXtremely Long Video Generation
Figure 3 for NUWA-XL: Diffusion over Diffusion for eXtremely Long Video Generation
Figure 4 for NUWA-XL: Diffusion over Diffusion for eXtremely Long Video Generation
Viaarxiv icon

Learning Data-Driven Vector-Quantized Degradation Model for Animation Video Super-Resolution

Add code
Mar 17, 2023
Figure 1 for Learning Data-Driven Vector-Quantized Degradation Model for Animation Video Super-Resolution
Figure 2 for Learning Data-Driven Vector-Quantized Degradation Model for Animation Video Super-Resolution
Figure 3 for Learning Data-Driven Vector-Quantized Degradation Model for Animation Video Super-Resolution
Figure 4 for Learning Data-Driven Vector-Quantized Degradation Model for Animation Video Super-Resolution
Viaarxiv icon