Picture for Junjie Li

Junjie Li

Audio-Visual Target Speaker Extraction with Reverse Selective Auditory Attention

Add code
Apr 29, 2024
Figure 1 for Audio-Visual Target Speaker Extraction with Reverse Selective Auditory Attention
Figure 2 for Audio-Visual Target Speaker Extraction with Reverse Selective Auditory Attention
Figure 3 for Audio-Visual Target Speaker Extraction with Reverse Selective Auditory Attention
Figure 4 for Audio-Visual Target Speaker Extraction with Reverse Selective Auditory Attention
Viaarxiv icon

IPAD: Industrial Process Anomaly Detection Dataset

Add code
Apr 23, 2024
Viaarxiv icon

Rethinking Clothes Changing Person ReID: Conflicts, Synthesis, and Optimization

Add code
Apr 19, 2024
Viaarxiv icon

Conifer: Improving Complex Constrained Instruction-Following Ability of Large Language Models

Add code
Apr 03, 2024
Viaarxiv icon

EL-VIT: Probing Vision Transformer with Interactive Visualization

Add code
Jan 23, 2024
Figure 1 for EL-VIT: Probing Vision Transformer with Interactive Visualization
Figure 2 for EL-VIT: Probing Vision Transformer with Interactive Visualization
Figure 3 for EL-VIT: Probing Vision Transformer with Interactive Visualization
Figure 4 for EL-VIT: Probing Vision Transformer with Interactive Visualization
Viaarxiv icon

Amplifying robotics capacities with a human touch: An immersive low-latency panoramic remote system

Add code
Jan 09, 2024
Viaarxiv icon

Revisiting Foreground and Background Separation in Weakly-supervised Temporal Action Localization: A Clustering-based Approach

Add code
Dec 21, 2023
Figure 1 for Revisiting Foreground and Background Separation in Weakly-supervised Temporal Action Localization: A Clustering-based Approach
Figure 2 for Revisiting Foreground and Background Separation in Weakly-supervised Temporal Action Localization: A Clustering-based Approach
Figure 3 for Revisiting Foreground and Background Separation in Weakly-supervised Temporal Action Localization: A Clustering-based Approach
Figure 4 for Revisiting Foreground and Background Separation in Weakly-supervised Temporal Action Localization: A Clustering-based Approach
Viaarxiv icon

SEF-VC: Speaker Embedding Free Zero-Shot Voice Conversion with Cross Attention

Add code
Dec 14, 2023
Viaarxiv icon

GAIA: Delving into Gradient-based Attribution Abnormality for Out-of-distribution Detection

Add code
Nov 16, 2023
Viaarxiv icon

Generalizable Person Search on Open-world User-Generated Video Content

Add code
Oct 16, 2023
Viaarxiv icon