Picture for Bingyue Peng

Bingyue Peng

Generative Refinement Networks for Visual Synthesis

Add code
Apr 14, 2026
Viaarxiv icon

Long-Horizon Streaming Video Generation via Hybrid Attention with Decoupled Distillation

Add code
Apr 11, 2026
Viaarxiv icon

ALIVE: Animate Your World with Lifelike Audio-Video Generation

Add code
Feb 09, 2026
Viaarxiv icon

InfinityStar: Unified Spacetime AutoRegressive Modeling for Visual Generation

Add code
Nov 06, 2025
Viaarxiv icon

HLLM-Creator: Hierarchical LLM-based Personalized Creative Generation

Add code
Aug 25, 2025
Viaarxiv icon

VC-LLM: Automated Advertisement Video Creation from Raw Footage using Multi-modal LLMs

Add code
Apr 08, 2025
Figure 1 for VC-LLM: Automated Advertisement Video Creation from Raw Footage using Multi-modal LLMs
Figure 2 for VC-LLM: Automated Advertisement Video Creation from Raw Footage using Multi-modal LLMs
Figure 3 for VC-LLM: Automated Advertisement Video Creation from Raw Footage using Multi-modal LLMs
Figure 4 for VC-LLM: Automated Advertisement Video Creation from Raw Footage using Multi-modal LLMs
Viaarxiv icon

UniTok: A Unified Tokenizer for Visual Generation and Understanding

Add code
Feb 27, 2025
Figure 1 for UniTok: A Unified Tokenizer for Visual Generation and Understanding
Figure 2 for UniTok: A Unified Tokenizer for Visual Generation and Understanding
Figure 3 for UniTok: A Unified Tokenizer for Visual Generation and Understanding
Figure 4 for UniTok: A Unified Tokenizer for Visual Generation and Understanding
Viaarxiv icon

HumanDiT: Pose-Guided Diffusion Transformer for Long-form Human Motion Video Generation

Add code
Feb 10, 2025
Figure 1 for HumanDiT: Pose-Guided Diffusion Transformer for Long-form Human Motion Video Generation
Figure 2 for HumanDiT: Pose-Guided Diffusion Transformer for Long-form Human Motion Video Generation
Figure 3 for HumanDiT: Pose-Guided Diffusion Transformer for Long-form Human Motion Video Generation
Figure 4 for HumanDiT: Pose-Guided Diffusion Transformer for Long-form Human Motion Video Generation
Viaarxiv icon

Goku: Flow Based Video Generative Foundation Models

Add code
Feb 10, 2025
Figure 1 for Goku: Flow Based Video Generative Foundation Models
Figure 2 for Goku: Flow Based Video Generative Foundation Models
Figure 3 for Goku: Flow Based Video Generative Foundation Models
Figure 4 for Goku: Flow Based Video Generative Foundation Models
Viaarxiv icon

Text-to-Edit: Controllable End-to-End Video Ad Creation via Multimodal LLMs

Add code
Jan 10, 2025
Figure 1 for Text-to-Edit: Controllable End-to-End Video Ad Creation via Multimodal LLMs
Figure 2 for Text-to-Edit: Controllable End-to-End Video Ad Creation via Multimodal LLMs
Figure 3 for Text-to-Edit: Controllable End-to-End Video Ad Creation via Multimodal LLMs
Figure 4 for Text-to-Edit: Controllable End-to-End Video Ad Creation via Multimodal LLMs
Viaarxiv icon