Picture for Biao Gong

Biao Gong

Hi-VAE: Efficient Video Autoencoding with Global and Detailed Motion

Add code
Jun 08, 2025
Viaarxiv icon

Dimension-Reduction Attack! Video Generative Models are Experts on Controllable Image Synthesis

Add code
May 29, 2025
Viaarxiv icon

Ming-Lite-Uni: Advancements in Unified Architecture for Natural Multimodal Interaction

Add code
May 05, 2025
Viaarxiv icon

DreamRelation: Relation-Centric Video Customization

Add code
Mar 10, 2025
Viaarxiv icon

Benchmarking Large Vision-Language Models via Directed Scene Graph for Comprehensive Image Captioning

Add code
Dec 12, 2024
Viaarxiv icon

MotionStone: Decoupled Motion Intensity Modulation with Diffusion Transformer for Image-to-Video Generation

Add code
Dec 08, 2024
Figure 1 for MotionStone: Decoupled Motion Intensity Modulation with Diffusion Transformer for Image-to-Video Generation
Figure 2 for MotionStone: Decoupled Motion Intensity Modulation with Diffusion Transformer for Image-to-Video Generation
Figure 3 for MotionStone: Decoupled Motion Intensity Modulation with Diffusion Transformer for Image-to-Video Generation
Figure 4 for MotionStone: Decoupled Motion Intensity Modulation with Diffusion Transformer for Image-to-Video Generation
Viaarxiv icon

Mimir: Improving Video Diffusion Models for Precise Text Understanding

Add code
Dec 04, 2024
Figure 1 for Mimir: Improving Video Diffusion Models for Precise Text Understanding
Figure 2 for Mimir: Improving Video Diffusion Models for Precise Text Understanding
Figure 3 for Mimir: Improving Video Diffusion Models for Precise Text Understanding
Figure 4 for Mimir: Improving Video Diffusion Models for Precise Text Understanding
Viaarxiv icon

MAT: Multi-Range Attention Transformer for Efficient Image Super-Resolution

Add code
Nov 26, 2024
Figure 1 for MAT: Multi-Range Attention Transformer for Efficient Image Super-Resolution
Figure 2 for MAT: Multi-Range Attention Transformer for Efficient Image Super-Resolution
Figure 3 for MAT: Multi-Range Attention Transformer for Efficient Image Super-Resolution
Figure 4 for MAT: Multi-Range Attention Transformer for Efficient Image Super-Resolution
Viaarxiv icon

LumiSculpt: A Consistency Lighting Control Network for Video Generation

Add code
Oct 30, 2024
Figure 1 for LumiSculpt: A Consistency Lighting Control Network for Video Generation
Figure 2 for LumiSculpt: A Consistency Lighting Control Network for Video Generation
Figure 3 for LumiSculpt: A Consistency Lighting Control Network for Video Generation
Figure 4 for LumiSculpt: A Consistency Lighting Control Network for Video Generation
Viaarxiv icon

Framer: Interactive Frame Interpolation

Add code
Oct 24, 2024
Figure 1 for Framer: Interactive Frame Interpolation
Figure 2 for Framer: Interactive Frame Interpolation
Figure 3 for Framer: Interactive Frame Interpolation
Figure 4 for Framer: Interactive Frame Interpolation
Viaarxiv icon