Picture for Tanja Schultz

Tanja Schultz

End-to-end Acoustic-linguistic Emotion and Intent Recognition Enhanced by Semi-supervised Learning

Add code
Jul 10, 2025
Viaarxiv icon

A Modular Pipeline for 3D Object Tracking Using RGB Cameras

Add code
Mar 06, 2025
Viaarxiv icon

Deep Speech Synthesis from Multimodal Articulatory Representations

Add code
Dec 17, 2024
Figure 1 for Deep Speech Synthesis from Multimodal Articulatory Representations
Figure 2 for Deep Speech Synthesis from Multimodal Articulatory Representations
Figure 3 for Deep Speech Synthesis from Multimodal Articulatory Representations
Figure 4 for Deep Speech Synthesis from Multimodal Articulatory Representations
Viaarxiv icon

Investigating Effective Speaker Property Privacy Protection in Federated Learning for Speech Emotion Recognition

Add code
Oct 17, 2024
Figure 1 for Investigating Effective Speaker Property Privacy Protection in Federated Learning for Speech Emotion Recognition
Figure 2 for Investigating Effective Speaker Property Privacy Protection in Federated Learning for Speech Emotion Recognition
Figure 3 for Investigating Effective Speaker Property Privacy Protection in Federated Learning for Speech Emotion Recognition
Figure 4 for Investigating Effective Speaker Property Privacy Protection in Federated Learning for Speech Emotion Recognition
Viaarxiv icon

Speech as a Biomarker for Disease Detection

Add code
Sep 16, 2024
Figure 1 for Speech as a Biomarker for Disease Detection
Figure 2 for Speech as a Biomarker for Disease Detection
Figure 3 for Speech as a Biomarker for Disease Detection
Figure 4 for Speech as a Biomarker for Disease Detection
Viaarxiv icon

NeuroSpex: Neuro-Guided Speaker Extraction with Cross-Modal Attention

Add code
Sep 04, 2024
Figure 1 for NeuroSpex: Neuro-Guided Speaker Extraction with Cross-Modal Attention
Figure 2 for NeuroSpex: Neuro-Guided Speaker Extraction with Cross-Modal Attention
Figure 3 for NeuroSpex: Neuro-Guided Speaker Extraction with Cross-Modal Attention
Figure 4 for NeuroSpex: Neuro-Guided Speaker Extraction with Cross-Modal Attention
Viaarxiv icon

On the Role of Visual Grounding in VQA

Add code
Jun 26, 2024
Figure 1 for On the Role of Visual Grounding in VQA
Figure 2 for On the Role of Visual Grounding in VQA
Figure 3 for On the Role of Visual Grounding in VQA
Figure 4 for On the Role of Visual Grounding in VQA
Viaarxiv icon

Speech Emotion Recognition under Resource Constraints with Data Distillation

Add code
Jun 21, 2024
Figure 1 for Speech Emotion Recognition under Resource Constraints with Data Distillation
Figure 2 for Speech Emotion Recognition under Resource Constraints with Data Distillation
Figure 3 for Speech Emotion Recognition under Resource Constraints with Data Distillation
Figure 4 for Speech Emotion Recognition under Resource Constraints with Data Distillation
Viaarxiv icon

Diff-ETS: Learning a Diffusion Probabilistic Model for Electromyography-to-Speech Conversion

Add code
May 11, 2024
Figure 1 for Diff-ETS: Learning a Diffusion Probabilistic Model for Electromyography-to-Speech Conversion
Figure 2 for Diff-ETS: Learning a Diffusion Probabilistic Model for Electromyography-to-Speech Conversion
Figure 3 for Diff-ETS: Learning a Diffusion Probabilistic Model for Electromyography-to-Speech Conversion
Viaarxiv icon

STAA-Net: A Sparse and Transferable Adversarial Attack for Speech Emotion Recognition

Add code
Feb 02, 2024
Figure 1 for STAA-Net: A Sparse and Transferable Adversarial Attack for Speech Emotion Recognition
Figure 2 for STAA-Net: A Sparse and Transferable Adversarial Attack for Speech Emotion Recognition
Figure 3 for STAA-Net: A Sparse and Transferable Adversarial Attack for Speech Emotion Recognition
Figure 4 for STAA-Net: A Sparse and Transferable Adversarial Attack for Speech Emotion Recognition
Viaarxiv icon