Picture for Xiangyu Zhang

Xiangyu Zhang

Binaural Selective Attention Model for Target Speaker Extraction

Add code
Jun 18, 2024
Figure 1 for Binaural Selective Attention Model for Target Speaker Extraction
Figure 2 for Binaural Selective Attention Model for Target Speaker Extraction
Figure 3 for Binaural Selective Attention Model for Target Speaker Extraction
Figure 4 for Binaural Selective Attention Model for Target Speaker Extraction
Viaarxiv icon

Self-supervised Adversarial Training of Monocular Depth Estimation against Physical-World Attacks

Add code
Jun 09, 2024
Figure 1 for Self-supervised Adversarial Training of Monocular Depth Estimation against Physical-World Attacks
Figure 2 for Self-supervised Adversarial Training of Monocular Depth Estimation against Physical-World Attacks
Figure 3 for Self-supervised Adversarial Training of Monocular Depth Estimation against Physical-World Attacks
Figure 4 for Self-supervised Adversarial Training of Monocular Depth Estimation against Physical-World Attacks
Viaarxiv icon

Mutual Information Guided Backdoor Mitigation for Pre-trained Encoders

Add code
Jun 05, 2024
Figure 1 for Mutual Information Guided Backdoor Mitigation for Pre-trained Encoders
Figure 2 for Mutual Information Guided Backdoor Mitigation for Pre-trained Encoders
Figure 3 for Mutual Information Guided Backdoor Mitigation for Pre-trained Encoders
Figure 4 for Mutual Information Guided Backdoor Mitigation for Pre-trained Encoders
Viaarxiv icon

Source Code Foundation Models are Transferable Binary Analysis Knowledge Bases

Add code
May 30, 2024
Figure 1 for Source Code Foundation Models are Transferable Binary Analysis Knowledge Bases
Figure 2 for Source Code Foundation Models are Transferable Binary Analysis Knowledge Bases
Figure 3 for Source Code Foundation Models are Transferable Binary Analysis Knowledge Bases
Figure 4 for Source Code Foundation Models are Transferable Binary Analysis Knowledge Bases
Viaarxiv icon

Is a 3D-Tokenized LLM the Key to Reliable Autonomous Driving?

Add code
May 28, 2024
Viaarxiv icon

Reflected Flow Matching

Add code
May 26, 2024
Viaarxiv icon

Focus Anywhere for Fine-grained Multi-page Document Understanding

Add code
May 23, 2024
Figure 1 for Focus Anywhere for Fine-grained Multi-page Document Understanding
Figure 2 for Focus Anywhere for Fine-grained Multi-page Document Understanding
Figure 3 for Focus Anywhere for Fine-grained Multi-page Document Understanding
Figure 4 for Focus Anywhere for Fine-grained Multi-page Document Understanding
Viaarxiv icon

Mamba in Speech: Towards an Alternative to Self-Attention

Add code
May 22, 2024
Figure 1 for Mamba in Speech: Towards an Alternative to Self-Attention
Figure 2 for Mamba in Speech: Towards an Alternative to Self-Attention
Figure 3 for Mamba in Speech: Towards an Alternative to Self-Attention
Figure 4 for Mamba in Speech: Towards an Alternative to Self-Attention
Viaarxiv icon

On the relevance of pre-neural approaches in natural language processing pedagogy

Add code
May 16, 2024
Figure 1 for On the relevance of pre-neural approaches in natural language processing pedagogy
Figure 2 for On the relevance of pre-neural approaches in natural language processing pedagogy
Figure 3 for On the relevance of pre-neural approaches in natural language processing pedagogy
Figure 4 for On the relevance of pre-neural approaches in natural language processing pedagogy
Viaarxiv icon

Music Emotion Prediction Using Recurrent Neural Networks

Add code
May 10, 2024
Figure 1 for Music Emotion Prediction Using Recurrent Neural Networks
Figure 2 for Music Emotion Prediction Using Recurrent Neural Networks
Figure 3 for Music Emotion Prediction Using Recurrent Neural Networks
Figure 4 for Music Emotion Prediction Using Recurrent Neural Networks
Viaarxiv icon