Alert button
Picture for Hyeonggon Ryu

Hyeonggon Ryu

Alert button

Sound Source Localization is All about Cross-Modal Alignment

Add code
Bookmark button
Alert button
Sep 19, 2023
Arda Senocak, Hyeonggon Ryu, Junsik Kim, Tae-Hyun Oh, Hanspeter Pfister, Joon Son Chung

Figure 1 for Sound Source Localization is All about Cross-Modal Alignment
Figure 2 for Sound Source Localization is All about Cross-Modal Alignment
Figure 3 for Sound Source Localization is All about Cross-Modal Alignment
Figure 4 for Sound Source Localization is All about Cross-Modal Alignment
Viaarxiv icon

Hindi as a Second Language: Improving Visually Grounded Speech with Semantically Similar Samples

Add code
Bookmark button
Alert button
Mar 30, 2023
Hyeonggon Ryu, Arda Senocak, In So Kweon, Joon Son Chung

Figure 1 for Hindi as a Second Language: Improving Visually Grounded Speech with Semantically Similar Samples
Figure 2 for Hindi as a Second Language: Improving Visually Grounded Speech with Semantically Similar Samples
Figure 3 for Hindi as a Second Language: Improving Visually Grounded Speech with Semantically Similar Samples
Figure 4 for Hindi as a Second Language: Improving Visually Grounded Speech with Semantically Similar Samples
Viaarxiv icon

Generative Bias for Visual Question Answering

Add code
Bookmark button
Alert button
Aug 02, 2022
Jae Won Cho, Dong-jin Kim, Hyeonggon Ryu, In So Kweon

Figure 1 for Generative Bias for Visual Question Answering
Figure 2 for Generative Bias for Visual Question Answering
Figure 3 for Generative Bias for Visual Question Answering
Figure 4 for Generative Bias for Visual Question Answering
Viaarxiv icon

Audio-Visual Fusion Layers for Event Type Aware Video Recognition

Add code
Bookmark button
Alert button
Feb 12, 2022
Arda Senocak, Junsik Kim, Tae-Hyun Oh, Hyeonggon Ryu, Dingzeyu Li, In So Kweon

Figure 1 for Audio-Visual Fusion Layers for Event Type Aware Video Recognition
Figure 2 for Audio-Visual Fusion Layers for Event Type Aware Video Recognition
Figure 3 for Audio-Visual Fusion Layers for Event Type Aware Video Recognition
Figure 4 for Audio-Visual Fusion Layers for Event Type Aware Video Recognition
Viaarxiv icon

Learning Sound Localization Better From Semantically Similar Samples

Add code
Bookmark button
Alert button
Feb 07, 2022
Arda Senocak, Hyeonggon Ryu, Junsik Kim, In So Kweon

Figure 1 for Learning Sound Localization Better From Semantically Similar Samples
Figure 2 for Learning Sound Localization Better From Semantically Similar Samples
Figure 3 for Learning Sound Localization Better From Semantically Similar Samples
Figure 4 for Learning Sound Localization Better From Semantically Similar Samples
Viaarxiv icon