Picture for Zachary Novack

Zachary Novack

SoundReactor: Frame-level Online Video-to-Audio Generation

Add code
Oct 02, 2025
Viaarxiv icon

WildScore: Benchmarking MLLMs in-the-Wild Symbolic Music Reasoning

Add code
Sep 05, 2025
Viaarxiv icon

Bob's Confetti: Phonetic Memorization Attacks in Music and Video Generation

Add code
Jul 23, 2025
Viaarxiv icon

Video-Guided Text-to-Music Generation Using Public Domain Movie Collections

Add code
Jun 14, 2025
Viaarxiv icon

Repurposing Marigold for Zero-Shot Metric Depth Estimation via Defocus Blur Cues

Add code
May 23, 2025
Viaarxiv icon

Fast Text-to-Audio Generation with Adversarial Post-Training

Add code
May 14, 2025
Viaarxiv icon

Aligning Text-to-Music Evaluation with Human Preferences

Add code
Mar 20, 2025
Viaarxiv icon

Deriving Representative Structure from Music Corpora

Add code
Feb 21, 2025
Viaarxiv icon

Presto! Distilling Steps and Layers for Accelerating Music Generation

Add code
Oct 07, 2024
Figure 1 for Presto! Distilling Steps and Layers for Accelerating Music Generation
Figure 2 for Presto! Distilling Steps and Layers for Accelerating Music Generation
Figure 3 for Presto! Distilling Steps and Layers for Accelerating Music Generation
Figure 4 for Presto! Distilling Steps and Layers for Accelerating Music Generation
Viaarxiv icon

CoLLAP: Contrastive Long-form Language-Audio Pretraining with Musical Temporal Structure Augmentation

Add code
Oct 03, 2024
Viaarxiv icon