Alert button
Picture for Arda Senocak

Arda Senocak

Alert button

From Coarse to Fine: Efficient Training for Audio Spectrogram Transformers

Add code
Bookmark button
Alert button
Jan 16, 2024
Jiu Feng, Mehmet Hamza Erol, Joon Son Chung, Arda Senocak

Viaarxiv icon

Can CLIP Help Sound Source Localization?

Add code
Bookmark button
Alert button
Nov 07, 2023
Sooyoung Park, Arda Senocak, Joon Son Chung

Viaarxiv icon

Sound Source Localization is All about Cross-Modal Alignment

Add code
Bookmark button
Alert button
Sep 19, 2023
Arda Senocak, Hyeonggon Ryu, Junsik Kim, Tae-Hyun Oh, Hanspeter Pfister, Joon Son Chung

Figure 1 for Sound Source Localization is All about Cross-Modal Alignment
Figure 2 for Sound Source Localization is All about Cross-Modal Alignment
Figure 3 for Sound Source Localization is All about Cross-Modal Alignment
Figure 4 for Sound Source Localization is All about Cross-Modal Alignment
Viaarxiv icon

FlexiAST: Flexibility is What AST Needs

Add code
Bookmark button
Alert button
Jul 18, 2023
Jiu Feng, Mehmet Hamza Erol, Joon Son Chung, Arda Senocak

Figure 1 for FlexiAST: Flexibility is What AST Needs
Figure 2 for FlexiAST: Flexibility is What AST Needs
Figure 3 for FlexiAST: Flexibility is What AST Needs
Figure 4 for FlexiAST: Flexibility is What AST Needs
Viaarxiv icon

Hindi as a Second Language: Improving Visually Grounded Speech with Semantically Similar Samples

Add code
Bookmark button
Alert button
Mar 30, 2023
Hyeonggon Ryu, Arda Senocak, In So Kweon, Joon Son Chung

Figure 1 for Hindi as a Second Language: Improving Visually Grounded Speech with Semantically Similar Samples
Figure 2 for Hindi as a Second Language: Improving Visually Grounded Speech with Semantically Similar Samples
Figure 3 for Hindi as a Second Language: Improving Visually Grounded Speech with Semantically Similar Samples
Figure 4 for Hindi as a Second Language: Improving Visually Grounded Speech with Semantically Similar Samples
Viaarxiv icon

Sound to Visual Scene Generation by Audio-to-Visual Latent Alignment

Add code
Bookmark button
Alert button
Mar 30, 2023
Kim Sung-Bin, Arda Senocak, Hyunwoo Ha, Andrew Owens, Tae-Hyun Oh

Figure 1 for Sound to Visual Scene Generation by Audio-to-Visual Latent Alignment
Figure 2 for Sound to Visual Scene Generation by Audio-to-Visual Latent Alignment
Figure 3 for Sound to Visual Scene Generation by Audio-to-Visual Latent Alignment
Figure 4 for Sound to Visual Scene Generation by Audio-to-Visual Latent Alignment
Viaarxiv icon

MarginNCE: Robust Sound Localization with a Negative Margin

Add code
Bookmark button
Alert button
Nov 03, 2022
Sooyoung Park, Arda Senocak, Joon Son Chung

Figure 1 for MarginNCE: Robust Sound Localization with a Negative Margin
Figure 2 for MarginNCE: Robust Sound Localization with a Negative Margin
Figure 3 for MarginNCE: Robust Sound Localization with a Negative Margin
Figure 4 for MarginNCE: Robust Sound Localization with a Negative Margin
Viaarxiv icon

Audio-Visual Fusion Layers for Event Type Aware Video Recognition

Add code
Bookmark button
Alert button
Feb 12, 2022
Arda Senocak, Junsik Kim, Tae-Hyun Oh, Hyeonggon Ryu, Dingzeyu Li, In So Kweon

Figure 1 for Audio-Visual Fusion Layers for Event Type Aware Video Recognition
Figure 2 for Audio-Visual Fusion Layers for Event Type Aware Video Recognition
Figure 3 for Audio-Visual Fusion Layers for Event Type Aware Video Recognition
Figure 4 for Audio-Visual Fusion Layers for Event Type Aware Video Recognition
Viaarxiv icon

Learning Sound Localization Better From Semantically Similar Samples

Add code
Bookmark button
Alert button
Feb 07, 2022
Arda Senocak, Hyeonggon Ryu, Junsik Kim, In So Kweon

Figure 1 for Learning Sound Localization Better From Semantically Similar Samples
Figure 2 for Learning Sound Localization Better From Semantically Similar Samples
Figure 3 for Learning Sound Localization Better From Semantically Similar Samples
Figure 4 for Learning Sound Localization Better From Semantically Similar Samples
Viaarxiv icon

Learning to Localize Sound Sources in Visual Scenes: Analysis and Applications

Add code
Bookmark button
Alert button
Nov 20, 2019
Arda Senocak, Tae-Hyun Oh, Junsik Kim, Ming-Hsuan Yang, In So Kweon

Figure 1 for Learning to Localize Sound Sources in Visual Scenes: Analysis and Applications
Figure 2 for Learning to Localize Sound Sources in Visual Scenes: Analysis and Applications
Figure 3 for Learning to Localize Sound Sources in Visual Scenes: Analysis and Applications
Figure 4 for Learning to Localize Sound Sources in Visual Scenes: Analysis and Applications
Viaarxiv icon