Alert button
Picture for Justin Salamon

Justin Salamon

Alert button

Bridging High-Quality Audio and Video via Language for Sound Effects Retrieval from Visual Queries

Aug 17, 2023
Julia Wilkins, Justin Salamon, Magdalena Fuentes, Juan Pablo Bello, Oriol Nieto

Figure 1 for Bridging High-Quality Audio and Video via Language for Sound Effects Retrieval from Visual Queries
Figure 2 for Bridging High-Quality Audio and Video via Language for Sound Effects Retrieval from Visual Queries
Figure 3 for Bridging High-Quality Audio and Video via Language for Sound Effects Retrieval from Visual Queries
Viaarxiv icon

Language-Guided Music Recommendation for Video via Prompt Analogies

Jun 15, 2023
Daniel McKee, Justin Salamon, Josef Sivic, Bryan Russell

Figure 1 for Language-Guided Music Recommendation for Video via Prompt Analogies
Figure 2 for Language-Guided Music Recommendation for Video via Prompt Analogies
Figure 3 for Language-Guided Music Recommendation for Video via Prompt Analogies
Figure 4 for Language-Guided Music Recommendation for Video via Prompt Analogies
Viaarxiv icon

Efficient Spoken Language Recognition via Multilabel Classification

Jun 02, 2023
Oriol Nieto, Zeyu Jin, Franck Dernoncourt, Justin Salamon

Figure 1 for Efficient Spoken Language Recognition via Multilabel Classification
Figure 2 for Efficient Spoken Language Recognition via Multilabel Classification
Figure 3 for Efficient Spoken Language Recognition via Multilabel Classification
Figure 4 for Efficient Spoken Language Recognition via Multilabel Classification
Viaarxiv icon

Conditional Generation of Audio from Video via Foley Analogies

Apr 17, 2023
Yuexi Du, Ziyang Chen, Justin Salamon, Bryan Russell, Andrew Owens

Figure 1 for Conditional Generation of Audio from Video via Foley Analogies
Figure 2 for Conditional Generation of Audio from Video via Foley Analogies
Figure 3 for Conditional Generation of Audio from Video via Foley Analogies
Figure 4 for Conditional Generation of Audio from Video via Foley Analogies
Viaarxiv icon

Language-Guided Audio-Visual Source Separation via Trimodal Consistency

Mar 28, 2023
Reuben Tan, Arijit Ray, Andrea Burns, Bryan A. Plummer, Justin Salamon, Oriol Nieto, Bryan Russell, Kate Saenko

Figure 1 for Language-Guided Audio-Visual Source Separation via Trimodal Consistency
Figure 2 for Language-Guided Audio-Visual Source Separation via Trimodal Consistency
Figure 3 for Language-Guided Audio-Visual Source Separation via Trimodal Consistency
Figure 4 for Language-Guided Audio-Visual Source Separation via Trimodal Consistency
Viaarxiv icon

Audio-Text Models Do Not Yet Leverage Natural Language

Mar 19, 2023
Ho-Hsiang Wu, Oriol Nieto, Juan Pablo Bello, Justin Salamon

Figure 1 for Audio-Text Models Do Not Yet Leverage Natural Language
Figure 2 for Audio-Text Models Do Not Yet Leverage Natural Language
Figure 3 for Audio-Text Models Do Not Yet Leverage Natural Language
Figure 4 for Audio-Text Models Do Not Yet Leverage Natural Language
Viaarxiv icon

It's Time for Artistic Correspondence in Music and Video

Jun 14, 2022
Didac Suris, Carl Vondrick, Bryan Russell, Justin Salamon

Figure 1 for It's Time for Artistic Correspondence in Music and Video
Figure 2 for It's Time for Artistic Correspondence in Music and Video
Figure 3 for It's Time for Artistic Correspondence in Music and Video
Figure 4 for It's Time for Artistic Correspondence in Music and Video
Viaarxiv icon

Filler Word Detection and Classification: A Dataset and Benchmark

Mar 28, 2022
Ge Zhu, Juan-Pablo Caceres, Justin Salamon

Figure 1 for Filler Word Detection and Classification: A Dataset and Benchmark
Figure 2 for Filler Word Detection and Classification: A Dataset and Benchmark
Figure 3 for Filler Word Detection and Classification: A Dataset and Benchmark
Figure 4 for Filler Word Detection and Classification: A Dataset and Benchmark
Viaarxiv icon

HEAR 2021: Holistic Evaluation of Audio Representations

Mar 26, 2022
Joseph Turian, Jordie Shier, Humair Raj Khan, Bhiksha Raj, Björn W. Schuller, Christian J. Steinmetz, Colin Malloy, George Tzanetakis, Gissel Velarde, Kirk McNally, Max Henry, Nicolas Pinto, Camille Noufi, Christian Clough, Dorien Herremans, Eduardo Fonseca, Jesse Engel, Justin Salamon, Philippe Esling, Pranay Manocha, Shinji Watanabe, Zeyu Jin, Yonatan Bisk

Figure 1 for HEAR 2021: Holistic Evaluation of Audio Representations
Figure 2 for HEAR 2021: Holistic Evaluation of Audio Representations
Figure 3 for HEAR 2021: Holistic Evaluation of Audio Representations
Figure 4 for HEAR 2021: Holistic Evaluation of Audio Representations
Viaarxiv icon