Alert button
Picture for Oriol Nieto

Oriol Nieto

Alert button

CompA: Addressing the Gap in Compositional Reasoning in Audio-Language Models

Add code
Bookmark button
Alert button
Oct 12, 2023
Sreyan Ghosh, Ashish Seth, Sonal Kumar, Utkarsh Tyagi, Chandra Kiran Evuru, S. Ramaneswaran, S. Sakshi, Oriol Nieto, Ramani Duraiswami, Dinesh Manocha

Figure 1 for CompA: Addressing the Gap in Compositional Reasoning in Audio-Language Models
Figure 2 for CompA: Addressing the Gap in Compositional Reasoning in Audio-Language Models
Figure 3 for CompA: Addressing the Gap in Compositional Reasoning in Audio-Language Models
Figure 4 for CompA: Addressing the Gap in Compositional Reasoning in Audio-Language Models
Viaarxiv icon

Bridging High-Quality Audio and Video via Language for Sound Effects Retrieval from Visual Queries

Add code
Bookmark button
Alert button
Aug 17, 2023
Julia Wilkins, Justin Salamon, Magdalena Fuentes, Juan Pablo Bello, Oriol Nieto

Figure 1 for Bridging High-Quality Audio and Video via Language for Sound Effects Retrieval from Visual Queries
Figure 2 for Bridging High-Quality Audio and Video via Language for Sound Effects Retrieval from Visual Queries
Figure 3 for Bridging High-Quality Audio and Video via Language for Sound Effects Retrieval from Visual Queries
Viaarxiv icon

Efficient Spoken Language Recognition via Multilabel Classification

Add code
Bookmark button
Alert button
Jun 02, 2023
Oriol Nieto, Zeyu Jin, Franck Dernoncourt, Justin Salamon

Figure 1 for Efficient Spoken Language Recognition via Multilabel Classification
Figure 2 for Efficient Spoken Language Recognition via Multilabel Classification
Figure 3 for Efficient Spoken Language Recognition via Multilabel Classification
Figure 4 for Efficient Spoken Language Recognition via Multilabel Classification
Viaarxiv icon

Language-Guided Audio-Visual Source Separation via Trimodal Consistency

Add code
Bookmark button
Alert button
Mar 28, 2023
Reuben Tan, Arijit Ray, Andrea Burns, Bryan A. Plummer, Justin Salamon, Oriol Nieto, Bryan Russell, Kate Saenko

Figure 1 for Language-Guided Audio-Visual Source Separation via Trimodal Consistency
Figure 2 for Language-Guided Audio-Visual Source Separation via Trimodal Consistency
Figure 3 for Language-Guided Audio-Visual Source Separation via Trimodal Consistency
Figure 4 for Language-Guided Audio-Visual Source Separation via Trimodal Consistency
Viaarxiv icon

Audio-Text Models Do Not Yet Leverage Natural Language

Add code
Bookmark button
Alert button
Mar 19, 2023
Ho-Hsiang Wu, Oriol Nieto, Juan Pablo Bello, Justin Salamon

Figure 1 for Audio-Text Models Do Not Yet Leverage Natural Language
Figure 2 for Audio-Text Models Do Not Yet Leverage Natural Language
Figure 3 for Audio-Text Models Do Not Yet Leverage Natural Language
Figure 4 for Audio-Text Models Do Not Yet Leverage Natural Language
Viaarxiv icon

Music Enhancement via Image Translation and Vocoding

Add code
Bookmark button
Alert button
Apr 28, 2022
Nikhil Kandpal, Oriol Nieto, Zeyu Jin

Figure 1 for Music Enhancement via Image Translation and Vocoding
Figure 2 for Music Enhancement via Image Translation and Vocoding
Figure 3 for Music Enhancement via Image Translation and Vocoding
Figure 4 for Music Enhancement via Image Translation and Vocoding
Viaarxiv icon

Mood Classification Using Listening Data

Add code
Bookmark button
Alert button
Oct 22, 2020
Filip Korzeniowski, Oriol Nieto, Matthew McCallum, Minz Won, Sergio Oramas, Erik Schmidt

Figure 1 for Mood Classification Using Listening Data
Figure 2 for Mood Classification Using Listening Data
Figure 3 for Mood Classification Using Listening Data
Figure 4 for Mood Classification Using Listening Data
Viaarxiv icon

End-to-end learning for music audio tagging at scale

Add code
Bookmark button
Alert button
Jun 15, 2018
Jordi Pons, Oriol Nieto, Matthew Prockup, Erik Schmidt, Andreas Ehmann, Xavier Serra

Figure 1 for End-to-end learning for music audio tagging at scale
Figure 2 for End-to-end learning for music audio tagging at scale
Figure 3 for End-to-end learning for music audio tagging at scale
Figure 4 for End-to-end learning for music audio tagging at scale
Viaarxiv icon

Predicting Audio Advertisement Quality

Add code
Bookmark button
Alert button
Feb 09, 2018
Samaneh Ebrahimi, Hossein Vahabi, Matthew Prockup, Oriol Nieto

Figure 1 for Predicting Audio Advertisement Quality
Figure 2 for Predicting Audio Advertisement Quality
Figure 3 for Predicting Audio Advertisement Quality
Figure 4 for Predicting Audio Advertisement Quality
Viaarxiv icon

A Deep Multimodal Approach for Cold-start Music Recommendation

Add code
Bookmark button
Alert button
Jul 24, 2017
Sergio Oramas, Oriol Nieto, Mohamed Sordo, Xavier Serra

Figure 1 for A Deep Multimodal Approach for Cold-start Music Recommendation
Figure 2 for A Deep Multimodal Approach for Cold-start Music Recommendation
Figure 3 for A Deep Multimodal Approach for Cold-start Music Recommendation
Viaarxiv icon