Alert button
Picture for Maja Pantic

Maja Pantic

Alert button

BRAVEn: Improving Self-Supervised Pre-training for Visual and Auditory Speech Recognition

Add code
Bookmark button
Alert button
Apr 02, 2024
Alexandros Haliassos, Andreas Zinonos, Rodrigo Mira, Stavros Petridis, Maja Pantic

Viaarxiv icon

Audio-visual video-to-speech synthesis with synthesized input audio

Add code
Bookmark button
Alert button
Jul 31, 2023
Triantafyllos Kefalas, Yannis Panagakis, Maja Pantic

Figure 1 for Audio-visual video-to-speech synthesis with synthesized input audio
Figure 2 for Audio-visual video-to-speech synthesis with synthesized input audio
Figure 3 for Audio-visual video-to-speech synthesis with synthesized input audio
Figure 4 for Audio-visual video-to-speech synthesis with synthesized input audio
Viaarxiv icon

SparseVSR: Lightweight and Noise Robust Visual Speech Recognition

Add code
Bookmark button
Alert button
Jul 10, 2023
Adriana Fernandez-Lopez, Honglie Chen, Pingchuan Ma, Alexandros Haliassos, Stavros Petridis, Maja Pantic

Figure 1 for SparseVSR: Lightweight and Noise Robust Visual Speech Recognition
Figure 2 for SparseVSR: Lightweight and Noise Robust Visual Speech Recognition
Figure 3 for SparseVSR: Lightweight and Noise Robust Visual Speech Recognition
Figure 4 for SparseVSR: Lightweight and Noise Robust Visual Speech Recognition
Viaarxiv icon

Large-scale unsupervised audio pre-training for video-to-speech synthesis

Add code
Bookmark button
Alert button
Jun 27, 2023
Triantafyllos Kefalas, Yannis Panagakis, Maja Pantic

Figure 1 for Large-scale unsupervised audio pre-training for video-to-speech synthesis
Figure 2 for Large-scale unsupervised audio pre-training for video-to-speech synthesis
Figure 3 for Large-scale unsupervised audio pre-training for video-to-speech synthesis
Figure 4 for Large-scale unsupervised audio pre-training for video-to-speech synthesis
Viaarxiv icon

Laughing Matters: Introducing Laughing-Face Generation using Diffusion Models

Add code
Bookmark button
Alert button
May 15, 2023
Antoni Bigata Casademunt, Rodrigo Mira, Nikita Drobyshev, Konstantinos Vougioukas, Stavros Petridis, Maja Pantic

Figure 1 for Laughing Matters: Introducing Laughing-Face Generation using Diffusion Models
Figure 2 for Laughing Matters: Introducing Laughing-Face Generation using Diffusion Models
Figure 3 for Laughing Matters: Introducing Laughing-Face Generation using Diffusion Models
Figure 4 for Laughing Matters: Introducing Laughing-Face Generation using Diffusion Models
Viaarxiv icon

SynthVSR: Scaling Up Visual Speech Recognition With Synthetic Supervision

Add code
Bookmark button
Alert button
Apr 03, 2023
Xubo Liu, Egor Lakomkin, Konstantinos Vougioukas, Pingchuan Ma, Honglie Chen, Ruiming Xie, Morrie Doulaty, Niko Moritz, Jáchym Kolář, Stavros Petridis, Maja Pantic, Christian Fuegen

Figure 1 for SynthVSR: Scaling Up Visual Speech Recognition With Synthetic Supervision
Figure 2 for SynthVSR: Scaling Up Visual Speech Recognition With Synthetic Supervision
Figure 3 for SynthVSR: Scaling Up Visual Speech Recognition With Synthetic Supervision
Figure 4 for SynthVSR: Scaling Up Visual Speech Recognition With Synthetic Supervision
Viaarxiv icon

Auto-AVSR: Audio-Visual Speech Recognition with Automatic Labels

Add code
Bookmark button
Alert button
Mar 25, 2023
Pingchuan Ma, Alexandros Haliassos, Adriana Fernandez-Lopez, Honglie Chen, Stavros Petridis, Maja Pantic

Figure 1 for Auto-AVSR: Audio-Visual Speech Recognition with Automatic Labels
Figure 2 for Auto-AVSR: Audio-Visual Speech Recognition with Automatic Labels
Figure 3 for Auto-AVSR: Audio-Visual Speech Recognition with Automatic Labels
Figure 4 for Auto-AVSR: Audio-Visual Speech Recognition with Automatic Labels
Viaarxiv icon

Learning Cross-lingual Visual Speech Representations

Add code
Bookmark button
Alert button
Mar 14, 2023
Andreas Zinonos, Alexandros Haliassos, Pingchuan Ma, Stavros Petridis, Maja Pantic

Figure 1 for Learning Cross-lingual Visual Speech Representations
Figure 2 for Learning Cross-lingual Visual Speech Representations
Figure 3 for Learning Cross-lingual Visual Speech Representations
Figure 4 for Learning Cross-lingual Visual Speech Representations
Viaarxiv icon

Diffused Heads: Diffusion Models Beat GANs on Talking-Face Generation

Add code
Bookmark button
Alert button
Jan 06, 2023
Michał Stypułkowski, Konstantinos Vougioukas, Sen He, Maciej Zięba, Stavros Petridis, Maja Pantic

Figure 1 for Diffused Heads: Diffusion Models Beat GANs on Talking-Face Generation
Figure 2 for Diffused Heads: Diffusion Models Beat GANs on Talking-Face Generation
Figure 3 for Diffused Heads: Diffusion Models Beat GANs on Talking-Face Generation
Figure 4 for Diffused Heads: Diffusion Models Beat GANs on Talking-Face Generation
Viaarxiv icon