Alert button
Picture for Junwen Xiong

Junwen Xiong

Alert button

DiffSal: Joint Audio and Video Learning for Diffusion Saliency Prediction

Add code
Bookmark button
Alert button
Mar 02, 2024
Junwen Xiong, Peng Zhang, Tao You, Chuanyue Li, Wei Huang, Yufei Zha

Figure 1 for DiffSal: Joint Audio and Video Learning for Diffusion Saliency Prediction
Figure 2 for DiffSal: Joint Audio and Video Learning for Diffusion Saliency Prediction
Figure 3 for DiffSal: Joint Audio and Video Learning for Diffusion Saliency Prediction
Figure 4 for DiffSal: Joint Audio and Video Learning for Diffusion Saliency Prediction
Viaarxiv icon

UniST: Towards Unifying Saliency Transformer for Video Saliency Prediction and Detection

Add code
Bookmark button
Alert button
Sep 15, 2023
Junwen Xiong, Peng Zhang, Chuanyue Li, Wei Huang, Yufei Zha, Tao You

Figure 1 for UniST: Towards Unifying Saliency Transformer for Video Saliency Prediction and Detection
Figure 2 for UniST: Towards Unifying Saliency Transformer for Video Saliency Prediction and Detection
Figure 3 for UniST: Towards Unifying Saliency Transformer for Video Saliency Prediction and Detection
Figure 4 for UniST: Towards Unifying Saliency Transformer for Video Saliency Prediction and Detection
Viaarxiv icon

FTFDNet: Learning to Detect Talking Face Video Manipulation with Tri-Modality Interaction

Add code
Bookmark button
Alert button
Jul 08, 2023
Ganglai Wang, Peng Zhang, Junwen Xiong, Feihan Yang, Wei Huang, Yufei Zha

Figure 1 for FTFDNet: Learning to Detect Talking Face Video Manipulation with Tri-Modality Interaction
Figure 2 for FTFDNet: Learning to Detect Talking Face Video Manipulation with Tri-Modality Interaction
Figure 3 for FTFDNet: Learning to Detect Talking Face Video Manipulation with Tri-Modality Interaction
Figure 4 for FTFDNet: Learning to Detect Talking Face Video Manipulation with Tri-Modality Interaction
Viaarxiv icon

CASP-Net: Rethinking Video Saliency Prediction from an Audio-VisualConsistency Perceptual Perspective

Add code
Bookmark button
Alert button
Mar 11, 2023
Junwen Xiong, Ganglai Wang, Peng Zhang, Wei Huang, Yufei Zha, Guangtao Zhai

Figure 1 for CASP-Net: Rethinking Video Saliency Prediction from an Audio-VisualConsistency Perceptual Perspective
Figure 2 for CASP-Net: Rethinking Video Saliency Prediction from an Audio-VisualConsistency Perceptual Perspective
Figure 3 for CASP-Net: Rethinking Video Saliency Prediction from an Audio-VisualConsistency Perceptual Perspective
Figure 4 for CASP-Net: Rethinking Video Saliency Prediction from an Audio-VisualConsistency Perceptual Perspective
Viaarxiv icon

Audio-visual speech separation based on joint feature representation with cross-modal attention

Add code
Bookmark button
Alert button
Mar 05, 2022
Junwen Xiong, Peng Zhang, Lei Xie, Wei Huang, Yufei Zha, Yanning Zhang

Figure 1 for Audio-visual speech separation based on joint feature representation with cross-modal attention
Figure 2 for Audio-visual speech separation based on joint feature representation with cross-modal attention
Figure 3 for Audio-visual speech separation based on joint feature representation with cross-modal attention
Figure 4 for Audio-visual speech separation based on joint feature representation with cross-modal attention
Viaarxiv icon

Look\&Listen: Multi-Modal Correlation Learning for Active Speaker Detection and Speech Enhancement

Add code
Bookmark button
Alert button
Mar 04, 2022
Junwen Xiong, Yu Zhou, Peng Zhang, Lei Xie, Wei Huang, Yufei Zha

Figure 1 for Look\&Listen: Multi-Modal Correlation Learning for Active Speaker Detection and Speech Enhancement
Figure 2 for Look\&Listen: Multi-Modal Correlation Learning for Active Speaker Detection and Speech Enhancement
Figure 3 for Look\&Listen: Multi-Modal Correlation Learning for Active Speaker Detection and Speech Enhancement
Figure 4 for Look\&Listen: Multi-Modal Correlation Learning for Active Speaker Detection and Speech Enhancement
Viaarxiv icon