Zero Shot Text To Video Generation


VidVec: Unlocking Video MLLM Embeddings for Video-Text Retrieval

Add code
Feb 08, 2026
Viaarxiv icon

PyraTok: Language-Aligned Pyramidal Tokenizer for Video Understanding and Generation

Add code
Jan 22, 2026
Viaarxiv icon

PREGEN: Uncovering Latent Thoughts in Composed Video Retrieval

Add code
Jan 20, 2026
Viaarxiv icon

VIRTUE: Versatile Video Retrieval Through Unified Embeddings

Add code
Jan 17, 2026
Viaarxiv icon

Goal Force: Teaching Video Models To Accomplish Physics-Conditioned Goals

Add code
Jan 09, 2026
Viaarxiv icon

MM-Sonate: Multimodal Controllable Audio-Video Generation with Zero-Shot Voice Cloning

Add code
Jan 08, 2026
Viaarxiv icon

AnchorHOI: Zero-shot Generation of 4D Human-Object Interaction via Anchor-based Prior Distillation

Add code
Dec 16, 2025
Figure 1 for AnchorHOI: Zero-shot Generation of 4D Human-Object Interaction via Anchor-based Prior Distillation
Figure 2 for AnchorHOI: Zero-shot Generation of 4D Human-Object Interaction via Anchor-based Prior Distillation
Figure 3 for AnchorHOI: Zero-shot Generation of 4D Human-Object Interaction via Anchor-based Prior Distillation
Figure 4 for AnchorHOI: Zero-shot Generation of 4D Human-Object Interaction via Anchor-based Prior Distillation
Viaarxiv icon

MusRec: Zero-Shot Text-to-Music Editing via Rectified Flow and Diffusion Transformers

Add code
Nov 06, 2025
Figure 1 for MusRec: Zero-Shot Text-to-Music Editing via Rectified Flow and Diffusion Transformers
Figure 2 for MusRec: Zero-Shot Text-to-Music Editing via Rectified Flow and Diffusion Transformers
Figure 3 for MusRec: Zero-Shot Text-to-Music Editing via Rectified Flow and Diffusion Transformers
Figure 4 for MusRec: Zero-Shot Text-to-Music Editing via Rectified Flow and Diffusion Transformers
Viaarxiv icon

EMMA: Generalizing Real-World Robot Manipulation via Generative Visual Transfer

Add code
Sep 26, 2025
Figure 1 for EMMA: Generalizing Real-World Robot Manipulation via Generative Visual Transfer
Figure 2 for EMMA: Generalizing Real-World Robot Manipulation via Generative Visual Transfer
Figure 3 for EMMA: Generalizing Real-World Robot Manipulation via Generative Visual Transfer
Figure 4 for EMMA: Generalizing Real-World Robot Manipulation via Generative Visual Transfer
Viaarxiv icon

StyleSculptor: Zero-Shot Style-Controllable 3D Asset Generation with Texture-Geometry Dual Guidance

Add code
Sep 16, 2025
Viaarxiv icon