Picture for Sharon Gannot

Sharon Gannot

Audio-Visual Approach For Multimodal Concurrent Speaker Detection

Add code
Jul 01, 2024
Figure 1 for Audio-Visual Approach For Multimodal Concurrent Speaker Detection
Figure 2 for Audio-Visual Approach For Multimodal Concurrent Speaker Detection
Figure 3 for Audio-Visual Approach For Multimodal Concurrent Speaker Detection
Figure 4 for Audio-Visual Approach For Multimodal Concurrent Speaker Detection
Viaarxiv icon

peerRTF: Robust MVDR Beamforming Using Graph Convolutional Network

Add code
Jul 01, 2024
Viaarxiv icon

Multi-Microphone Speech Emotion Recognition using the Hierarchical Token-semantic Audio Transformer Architecture

Add code
Jun 05, 2024
Viaarxiv icon

RevRIR: Joint Reverberant Speech and Room Impulse Response Embedding using Contrastive Learning with Application to Room Shape Classification

Add code
Jun 05, 2024
Viaarxiv icon

SingIt! Singer Voice Transformation

Add code
May 07, 2024
Viaarxiv icon

Socially Pertinent Robots in Gerontological Healthcare

Add code
Apr 11, 2024
Figure 1 for Socially Pertinent Robots in Gerontological Healthcare
Figure 2 for Socially Pertinent Robots in Gerontological Healthcare
Figure 3 for Socially Pertinent Robots in Gerontological Healthcare
Figure 4 for Socially Pertinent Robots in Gerontological Healthcare
Viaarxiv icon

Concurrent Speaker Detection: A multi-microphone Transformer-Based Approach

Add code
Mar 11, 2024
Figure 1 for Concurrent Speaker Detection: A multi-microphone Transformer-Based Approach
Figure 2 for Concurrent Speaker Detection: A multi-microphone Transformer-Based Approach
Figure 3 for Concurrent Speaker Detection: A multi-microphone Transformer-Based Approach
Figure 4 for Concurrent Speaker Detection: A multi-microphone Transformer-Based Approach
Viaarxiv icon

Comparison of Frequency-Fusion Mechanisms for Binaural Direction-of-Arrival Estimation for Multiple Speakers

Add code
Jan 15, 2024
Figure 1 for Comparison of Frequency-Fusion Mechanisms for Binaural Direction-of-Arrival Estimation for Multiple Speakers
Figure 2 for Comparison of Frequency-Fusion Mechanisms for Binaural Direction-of-Arrival Estimation for Multiple Speakers
Viaarxiv icon

Single-Microphone Speaker Separation and Voice Activity Detection in Noisy and Reverberant Environments

Add code
Jan 07, 2024
Viaarxiv icon

LipVoicer: Generating Speech from Silent Videos Guided by Lip Reading

Add code
Jun 05, 2023
Figure 1 for LipVoicer: Generating Speech from Silent Videos Guided by Lip Reading
Figure 2 for LipVoicer: Generating Speech from Silent Videos Guided by Lip Reading
Figure 3 for LipVoicer: Generating Speech from Silent Videos Guided by Lip Reading
Figure 4 for LipVoicer: Generating Speech from Silent Videos Guided by Lip Reading
Viaarxiv icon