Picture for Stavros Petridis

Stavros Petridis

Hearing Loss Detection from Facial Expressions in One-on-one Conversations

Add code
Jan 17, 2024
Figure 1 for Hearing Loss Detection from Facial Expressions in One-on-one Conversations
Figure 2 for Hearing Loss Detection from Facial Expressions in One-on-one Conversations
Figure 3 for Hearing Loss Detection from Facial Expressions in One-on-one Conversations
Figure 4 for Hearing Loss Detection from Facial Expressions in One-on-one Conversations
Viaarxiv icon

TorchAudio 2.1: Advancing speech recognition, self-supervised learning, and audio processing components for PyTorch

Add code
Oct 27, 2023
Figure 1 for TorchAudio 2.1: Advancing speech recognition, self-supervised learning, and audio processing components for PyTorch
Figure 2 for TorchAudio 2.1: Advancing speech recognition, self-supervised learning, and audio processing components for PyTorch
Figure 3 for TorchAudio 2.1: Advancing speech recognition, self-supervised learning, and audio processing components for PyTorch
Figure 4 for TorchAudio 2.1: Advancing speech recognition, self-supervised learning, and audio processing components for PyTorch
Viaarxiv icon

SparseVSR: Lightweight and Noise Robust Visual Speech Recognition

Add code
Jul 10, 2023
Figure 1 for SparseVSR: Lightweight and Noise Robust Visual Speech Recognition
Figure 2 for SparseVSR: Lightweight and Noise Robust Visual Speech Recognition
Figure 3 for SparseVSR: Lightweight and Noise Robust Visual Speech Recognition
Figure 4 for SparseVSR: Lightweight and Noise Robust Visual Speech Recognition
Viaarxiv icon

Laughing Matters: Introducing Laughing-Face Generation using Diffusion Models

Add code
May 15, 2023
Figure 1 for Laughing Matters: Introducing Laughing-Face Generation using Diffusion Models
Figure 2 for Laughing Matters: Introducing Laughing-Face Generation using Diffusion Models
Figure 3 for Laughing Matters: Introducing Laughing-Face Generation using Diffusion Models
Figure 4 for Laughing Matters: Introducing Laughing-Face Generation using Diffusion Models
Viaarxiv icon

Is dataset condensation a silver bullet for healthcare data sharing?

Add code
May 05, 2023
Viaarxiv icon

SynthVSR: Scaling Up Visual Speech Recognition With Synthetic Supervision

Add code
Apr 03, 2023
Figure 1 for SynthVSR: Scaling Up Visual Speech Recognition With Synthetic Supervision
Figure 2 for SynthVSR: Scaling Up Visual Speech Recognition With Synthetic Supervision
Figure 3 for SynthVSR: Scaling Up Visual Speech Recognition With Synthetic Supervision
Figure 4 for SynthVSR: Scaling Up Visual Speech Recognition With Synthetic Supervision
Viaarxiv icon

Auto-AVSR: Audio-Visual Speech Recognition with Automatic Labels

Add code
Mar 25, 2023
Figure 1 for Auto-AVSR: Audio-Visual Speech Recognition with Automatic Labels
Figure 2 for Auto-AVSR: Audio-Visual Speech Recognition with Automatic Labels
Figure 3 for Auto-AVSR: Audio-Visual Speech Recognition with Automatic Labels
Figure 4 for Auto-AVSR: Audio-Visual Speech Recognition with Automatic Labels
Viaarxiv icon

Learning Cross-lingual Visual Speech Representations

Add code
Mar 14, 2023
Figure 1 for Learning Cross-lingual Visual Speech Representations
Figure 2 for Learning Cross-lingual Visual Speech Representations
Figure 3 for Learning Cross-lingual Visual Speech Representations
Figure 4 for Learning Cross-lingual Visual Speech Representations
Viaarxiv icon

Diffused Heads: Diffusion Models Beat GANs on Talking-Face Generation

Add code
Jan 06, 2023
Figure 1 for Diffused Heads: Diffusion Models Beat GANs on Talking-Face Generation
Figure 2 for Diffused Heads: Diffusion Models Beat GANs on Talking-Face Generation
Figure 3 for Diffused Heads: Diffusion Models Beat GANs on Talking-Face Generation
Figure 4 for Diffused Heads: Diffusion Models Beat GANs on Talking-Face Generation
Viaarxiv icon

Jointly Learning Visual and Auditory Speech Representations from Raw Data

Add code
Dec 12, 2022
Viaarxiv icon