Picture for Stavros Petridis

Stavros Petridis

TorchAudio 2.1: Advancing speech recognition, self-supervised learning, and audio processing components for PyTorch

Add code
Oct 27, 2023
Figure 1 for TorchAudio 2.1: Advancing speech recognition, self-supervised learning, and audio processing components for PyTorch
Figure 2 for TorchAudio 2.1: Advancing speech recognition, self-supervised learning, and audio processing components for PyTorch
Figure 3 for TorchAudio 2.1: Advancing speech recognition, self-supervised learning, and audio processing components for PyTorch
Figure 4 for TorchAudio 2.1: Advancing speech recognition, self-supervised learning, and audio processing components for PyTorch
Viaarxiv icon

SparseVSR: Lightweight and Noise Robust Visual Speech Recognition

Add code
Jul 10, 2023
Figure 1 for SparseVSR: Lightweight and Noise Robust Visual Speech Recognition
Figure 2 for SparseVSR: Lightweight and Noise Robust Visual Speech Recognition
Figure 3 for SparseVSR: Lightweight and Noise Robust Visual Speech Recognition
Figure 4 for SparseVSR: Lightweight and Noise Robust Visual Speech Recognition
Viaarxiv icon

Laughing Matters: Introducing Laughing-Face Generation using Diffusion Models

Add code
May 15, 2023
Figure 1 for Laughing Matters: Introducing Laughing-Face Generation using Diffusion Models
Figure 2 for Laughing Matters: Introducing Laughing-Face Generation using Diffusion Models
Figure 3 for Laughing Matters: Introducing Laughing-Face Generation using Diffusion Models
Figure 4 for Laughing Matters: Introducing Laughing-Face Generation using Diffusion Models
Viaarxiv icon

Is dataset condensation a silver bullet for healthcare data sharing?

Add code
May 05, 2023
Viaarxiv icon

SynthVSR: Scaling Up Visual Speech Recognition With Synthetic Supervision

Add code
Apr 03, 2023
Viaarxiv icon

Auto-AVSR: Audio-Visual Speech Recognition with Automatic Labels

Add code
Mar 25, 2023
Viaarxiv icon

Learning Cross-lingual Visual Speech Representations

Add code
Mar 14, 2023
Figure 1 for Learning Cross-lingual Visual Speech Representations
Figure 2 for Learning Cross-lingual Visual Speech Representations
Figure 3 for Learning Cross-lingual Visual Speech Representations
Figure 4 for Learning Cross-lingual Visual Speech Representations
Viaarxiv icon

Diffused Heads: Diffusion Models Beat GANs on Talking-Face Generation

Add code
Jan 06, 2023
Figure 1 for Diffused Heads: Diffusion Models Beat GANs on Talking-Face Generation
Figure 2 for Diffused Heads: Diffusion Models Beat GANs on Talking-Face Generation
Figure 3 for Diffused Heads: Diffusion Models Beat GANs on Talking-Face Generation
Figure 4 for Diffused Heads: Diffusion Models Beat GANs on Talking-Face Generation
Viaarxiv icon

Jointly Learning Visual and Auditory Speech Representations from Raw Data

Add code
Dec 12, 2022
Viaarxiv icon

LA-VocE: Low-SNR Audio-visual Speech Enhancement using Neural Vocoders

Add code
Nov 20, 2022
Viaarxiv icon