Alert button
Picture for Yan-Bo Lin

Yan-Bo Lin

Alert button

Siamese Vision Transformers are Scalable Audio-visual Learners

Add code
Bookmark button
Alert button
Mar 28, 2024
Yan-Bo Lin, Gedas Bertasius

Figure 1 for Siamese Vision Transformers are Scalable Audio-visual Learners
Figure 2 for Siamese Vision Transformers are Scalable Audio-visual Learners
Figure 3 for Siamese Vision Transformers are Scalable Audio-visual Learners
Figure 4 for Siamese Vision Transformers are Scalable Audio-visual Learners
Viaarxiv icon

DAM: Dynamic Adapter Merging for Continual Video QA Learning

Add code
Bookmark button
Alert button
Mar 13, 2024
Feng Cheng, Ziyang Wang, Yi-Lin Sung, Yan-Bo Lin, Mohit Bansal, Gedas Bertasius

Figure 1 for DAM: Dynamic Adapter Merging for Continual Video QA Learning
Figure 2 for DAM: Dynamic Adapter Merging for Continual Video QA Learning
Figure 3 for DAM: Dynamic Adapter Merging for Continual Video QA Learning
Figure 4 for DAM: Dynamic Adapter Merging for Continual Video QA Learning
Viaarxiv icon

Vision Transformers are Parameter-Efficient Audio-Visual Learners

Add code
Bookmark button
Alert button
Dec 15, 2022
Yan-Bo Lin, Yi-Lin Sung, Jie Lei, Mohit Bansal, Gedas Bertasius

Figure 1 for Vision Transformers are Parameter-Efficient Audio-Visual Learners
Figure 2 for Vision Transformers are Parameter-Efficient Audio-Visual Learners
Figure 3 for Vision Transformers are Parameter-Efficient Audio-Visual Learners
Figure 4 for Vision Transformers are Parameter-Efficient Audio-Visual Learners
Viaarxiv icon

ECLIPSE: Efficient Long-range Video Retrieval using Sight and Sound

Add code
Bookmark button
Alert button
Apr 06, 2022
Yan-Bo Lin, Jie Lei, Mohit Bansal, Gedas Bertasius

Figure 1 for ECLIPSE: Efficient Long-range Video Retrieval using Sight and Sound
Figure 2 for ECLIPSE: Efficient Long-range Video Retrieval using Sight and Sound
Figure 3 for ECLIPSE: Efficient Long-range Video Retrieval using Sight and Sound
Figure 4 for ECLIPSE: Efficient Long-range Video Retrieval using Sight and Sound
Viaarxiv icon

Exploiting Audio-Visual Consistency with Partial Supervision for Spatial Audio Generation

Add code
Bookmark button
Alert button
May 03, 2021
Yan-Bo Lin, Yu-Chiang Frank Wang

Figure 1 for Exploiting Audio-Visual Consistency with Partial Supervision for Spatial Audio Generation
Figure 2 for Exploiting Audio-Visual Consistency with Partial Supervision for Spatial Audio Generation
Figure 3 for Exploiting Audio-Visual Consistency with Partial Supervision for Spatial Audio Generation
Figure 4 for Exploiting Audio-Visual Consistency with Partial Supervision for Spatial Audio Generation
Viaarxiv icon

Unsupervised Sound Localization via Iterative Contrastive Learning

Add code
Bookmark button
Alert button
Apr 01, 2021
Yan-Bo Lin, Hung-Yu Tseng, Hsin-Ying Lee, Yen-Yu Lin, Ming-Hsuan Yang

Figure 1 for Unsupervised Sound Localization via Iterative Contrastive Learning
Figure 2 for Unsupervised Sound Localization via Iterative Contrastive Learning
Figure 3 for Unsupervised Sound Localization via Iterative Contrastive Learning
Figure 4 for Unsupervised Sound Localization via Iterative Contrastive Learning
Viaarxiv icon

Cross-Dataset Person Re-Identification via Unsupervised Pose Disentanglement and Adaptation

Add code
Bookmark button
Alert button
Sep 20, 2019
Yu-Jhe Li, Ci-Siang Lin, Yan-Bo Lin, Yu-Chiang Frank Wang

Figure 1 for Cross-Dataset Person Re-Identification via Unsupervised Pose Disentanglement and Adaptation
Figure 2 for Cross-Dataset Person Re-Identification via Unsupervised Pose Disentanglement and Adaptation
Figure 3 for Cross-Dataset Person Re-Identification via Unsupervised Pose Disentanglement and Adaptation
Figure 4 for Cross-Dataset Person Re-Identification via Unsupervised Pose Disentanglement and Adaptation
Viaarxiv icon

Dual-modality seq2seq network for audio-visual event localization

Add code
Bookmark button
Alert button
Feb 20, 2019
Yan-Bo Lin, Yu-Jhe Li, Yu-Chiang Frank Wang

Figure 1 for Dual-modality seq2seq network for audio-visual event localization
Figure 2 for Dual-modality seq2seq network for audio-visual event localization
Figure 3 for Dual-modality seq2seq network for audio-visual event localization
Figure 4 for Dual-modality seq2seq network for audio-visual event localization
Viaarxiv icon