Picture for Long Mai

Long Mai

CineVerse: Consistent Keyframe Synthesis for Cinematic Scene Composition

Add code
Apr 28, 2025
Viaarxiv icon

REGEN: Learning Compact Video Embedding with (Re-)Generative Decoder

Add code
Mar 11, 2025
Viaarxiv icon

MotionCanvas: Cinematic Shot Design with Controllable Image-to-Video Generation

Add code
Feb 06, 2025
Figure 1 for MotionCanvas: Cinematic Shot Design with Controllable Image-to-Video Generation
Figure 2 for MotionCanvas: Cinematic Shot Design with Controllable Image-to-Video Generation
Figure 3 for MotionCanvas: Cinematic Shot Design with Controllable Image-to-Video Generation
Figure 4 for MotionCanvas: Cinematic Shot Design with Controllable Image-to-Video Generation
Viaarxiv icon

Pushing the Boundaries of State Space Models for Image and Video Generation

Add code
Feb 03, 2025
Figure 1 for Pushing the Boundaries of State Space Models for Image and Video Generation
Figure 2 for Pushing the Boundaries of State Space Models for Image and Video Generation
Figure 3 for Pushing the Boundaries of State Space Models for Image and Video Generation
Figure 4 for Pushing the Boundaries of State Space Models for Image and Video Generation
Viaarxiv icon

Progressive Growing of Video Tokenizers for Highly Compressed Latent Spaces

Add code
Jan 09, 2025
Viaarxiv icon

GaussianVideo: Efficient Video Representation via Hierarchical Gaussian Splatting

Add code
Jan 08, 2025
Viaarxiv icon

Real-Time Textless Dialogue Generation

Add code
Jan 08, 2025
Viaarxiv icon

TAB: Transformer Attention Bottlenecks enable User Intervention and Debugging in Vision-Language Models

Add code
Dec 24, 2024
Viaarxiv icon

Improving Linguistic Diversity of Large Language Models with Possibility Exploration Fine-Tuning

Add code
Dec 04, 2024
Figure 1 for Improving Linguistic Diversity of Large Language Models with Possibility Exploration Fine-Tuning
Figure 2 for Improving Linguistic Diversity of Large Language Models with Possibility Exploration Fine-Tuning
Figure 3 for Improving Linguistic Diversity of Large Language Models with Possibility Exploration Fine-Tuning
Figure 4 for Improving Linguistic Diversity of Large Language Models with Possibility Exploration Fine-Tuning
Viaarxiv icon

Empowering Visual Creativity: A Vision-Language Assistant to Image Editing Recommendations

Add code
May 31, 2024
Figure 1 for Empowering Visual Creativity: A Vision-Language Assistant to Image Editing Recommendations
Figure 2 for Empowering Visual Creativity: A Vision-Language Assistant to Image Editing Recommendations
Figure 3 for Empowering Visual Creativity: A Vision-Language Assistant to Image Editing Recommendations
Figure 4 for Empowering Visual Creativity: A Vision-Language Assistant to Image Editing Recommendations
Viaarxiv icon