Picture for Weifeng Chen

Weifeng Chen

TAP: A Token-Adaptive Predictor Framework for Training-Free Diffusion Acceleration

Add code
Mar 04, 2026
Viaarxiv icon

Train Short, Inference Long: Training-free Horizon Extension for Autoregressive Video Generation

Add code
Feb 17, 2026
Viaarxiv icon

LLM4Fluid: Large Language Models as Generalizable Neural Solvers for Fluid Dynamics

Add code
Jan 29, 2026
Viaarxiv icon

FlowAct-R1: Towards Interactive Humanoid Video Generation

Add code
Jan 15, 2026
Viaarxiv icon

PhyGDPO: Physics-Aware Groupwise Direct Preference Optimization for Physically Consistent Text-to-Video Generation

Add code
Dec 31, 2025
Viaarxiv icon

Exploring MLLM-Diffusion Information Transfer with MetaCanvas

Add code
Dec 12, 2025
Viaarxiv icon

OnlineVPO: Align Video Diffusion Model with Online Video-Centric Preference Optimization

Add code
Dec 19, 2024
Figure 1 for OnlineVPO: Align Video Diffusion Model with Online Video-Centric Preference Optimization
Figure 2 for OnlineVPO: Align Video Diffusion Model with Online Video-Centric Preference Optimization
Figure 3 for OnlineVPO: Align Video Diffusion Model with Online Video-Centric Preference Optimization
Figure 4 for OnlineVPO: Align Video Diffusion Model with Online Video-Centric Preference Optimization
Viaarxiv icon

Prompt-A-Video: Prompt Your Video Diffusion Model via Preference-Aligned LLM

Add code
Dec 19, 2024
Figure 1 for Prompt-A-Video: Prompt Your Video Diffusion Model via Preference-Aligned LLM
Figure 2 for Prompt-A-Video: Prompt Your Video Diffusion Model via Preference-Aligned LLM
Figure 3 for Prompt-A-Video: Prompt Your Video Diffusion Model via Preference-Aligned LLM
Figure 4 for Prompt-A-Video: Prompt Your Video Diffusion Model via Preference-Aligned LLM
Viaarxiv icon

IDA-VLM: Towards Movie Understanding via ID-Aware Large Vision-Language Model

Add code
Jul 10, 2024
Figure 1 for IDA-VLM: Towards Movie Understanding via ID-Aware Large Vision-Language Model
Figure 2 for IDA-VLM: Towards Movie Understanding via ID-Aware Large Vision-Language Model
Figure 3 for IDA-VLM: Towards Movie Understanding via ID-Aware Large Vision-Language Model
Figure 4 for IDA-VLM: Towards Movie Understanding via ID-Aware Large Vision-Language Model
Viaarxiv icon

ID-Aligner: Enhancing Identity-Preserving Text-to-Image Generation with Reward Feedback Learning

Add code
Apr 23, 2024
Figure 1 for ID-Aligner: Enhancing Identity-Preserving Text-to-Image Generation with Reward Feedback Learning
Figure 2 for ID-Aligner: Enhancing Identity-Preserving Text-to-Image Generation with Reward Feedback Learning
Figure 3 for ID-Aligner: Enhancing Identity-Preserving Text-to-Image Generation with Reward Feedback Learning
Figure 4 for ID-Aligner: Enhancing Identity-Preserving Text-to-Image Generation with Reward Feedback Learning
Viaarxiv icon