Alert button
Picture for Maja Pantic

Maja Pantic

Alert button

Jointly Learning Visual and Auditory Speech Representations from Raw Data

Add code
Bookmark button
Alert button
Dec 12, 2022
Alexandros Haliassos, Pingchuan Ma, Rodrigo Mira, Stavros Petridis, Maja Pantic

Figure 1 for Jointly Learning Visual and Auditory Speech Representations from Raw Data
Figure 2 for Jointly Learning Visual and Auditory Speech Representations from Raw Data
Figure 3 for Jointly Learning Visual and Auditory Speech Representations from Raw Data
Figure 4 for Jointly Learning Visual and Auditory Speech Representations from Raw Data
Viaarxiv icon

LA-VocE: Low-SNR Audio-visual Speech Enhancement using Neural Vocoders

Add code
Bookmark button
Alert button
Nov 20, 2022
Rodrigo Mira, Buye Xu, Jacob Donley, Anurag Kumar, Stavros Petridis, Vamsi Krishna Ithapu, Maja Pantic

Figure 1 for LA-VocE: Low-SNR Audio-visual Speech Enhancement using Neural Vocoders
Figure 2 for LA-VocE: Low-SNR Audio-visual Speech Enhancement using Neural Vocoders
Figure 3 for LA-VocE: Low-SNR Audio-visual Speech Enhancement using Neural Vocoders
Figure 4 for LA-VocE: Low-SNR Audio-visual Speech Enhancement using Neural Vocoders
Viaarxiv icon

FAN-Trans: Online Knowledge Distillation for Facial Action Unit Detection

Add code
Bookmark button
Alert button
Nov 11, 2022
Jing Yang, Jie Shen, Yiming Lin, Yordan Hristov, Maja Pantic

Figure 1 for FAN-Trans: Online Knowledge Distillation for Facial Action Unit Detection
Figure 2 for FAN-Trans: Online Knowledge Distillation for Facial Action Unit Detection
Figure 3 for FAN-Trans: Online Knowledge Distillation for Facial Action Unit Detection
Figure 4 for FAN-Trans: Online Knowledge Distillation for Facial Action Unit Detection
Viaarxiv icon

Streaming Audio-Visual Speech Recognition with Alignment Regularization

Add code
Bookmark button
Alert button
Nov 03, 2022
Pingchuan Ma, Niko Moritz, Stavros Petridis, Christian Fuegen, Maja Pantic

Figure 1 for Streaming Audio-Visual Speech Recognition with Alignment Regularization
Figure 2 for Streaming Audio-Visual Speech Recognition with Alignment Regularization
Figure 3 for Streaming Audio-Visual Speech Recognition with Alignment Regularization
Figure 4 for Streaming Audio-Visual Speech Recognition with Alignment Regularization
Viaarxiv icon

SS-VAERR: Self-Supervised Apparent Emotional Reaction Recognition from Video

Add code
Bookmark button
Alert button
Oct 20, 2022
Marija Jegorova, Stavros Petridis, Maja Pantic

Figure 1 for SS-VAERR: Self-Supervised Apparent Emotional Reaction Recognition from Video
Figure 2 for SS-VAERR: Self-Supervised Apparent Emotional Reaction Recognition from Video
Figure 3 for SS-VAERR: Self-Supervised Apparent Emotional Reaction Recognition from Video
Figure 4 for SS-VAERR: Self-Supervised Apparent Emotional Reaction Recognition from Video
Viaarxiv icon

Training Strategies for Improved Lip-reading

Add code
Bookmark button
Alert button
Sep 03, 2022
Pingchuan Ma, Yujiang Wang, Stavros Petridis, Jie Shen, Maja Pantic

Figure 1 for Training Strategies for Improved Lip-reading
Figure 2 for Training Strategies for Improved Lip-reading
Figure 3 for Training Strategies for Improved Lip-reading
Figure 4 for Training Strategies for Improved Lip-reading
Viaarxiv icon

SVTS: Scalable Video-to-Speech Synthesis

Add code
Bookmark button
Alert button
May 04, 2022
Rodrigo Mira, Alexandros Haliassos, Stavros Petridis, Björn W. Schuller, Maja Pantic

Figure 1 for SVTS: Scalable Video-to-Speech Synthesis
Figure 2 for SVTS: Scalable Video-to-Speech Synthesis
Figure 3 for SVTS: Scalable Video-to-Speech Synthesis
Figure 4 for SVTS: Scalable Video-to-Speech Synthesis
Viaarxiv icon

Self-supervised Video-centralised Transformer for Video Face Clustering

Add code
Bookmark button
Alert button
Mar 24, 2022
Yujiang Wang, Mingzhi Dong, Jie Shen, Yiming Luo, Yiming Lin, Pingchuan Ma, Stavros Petridis, Maja Pantic

Figure 1 for Self-supervised Video-centralised Transformer for Video Face Clustering
Figure 2 for Self-supervised Video-centralised Transformer for Video Face Clustering
Figure 3 for Self-supervised Video-centralised Transformer for Video Face Clustering
Figure 4 for Self-supervised Video-centralised Transformer for Video Face Clustering
Viaarxiv icon

Visual Speech Recognition for Multiple Languages in the Wild

Add code
Bookmark button
Alert button
Feb 26, 2022
Pingchuan Ma, Stavros Petridis, Maja Pantic

Figure 1 for Visual Speech Recognition for Multiple Languages in the Wild
Figure 2 for Visual Speech Recognition for Multiple Languages in the Wild
Figure 3 for Visual Speech Recognition for Multiple Languages in the Wild
Figure 4 for Visual Speech Recognition for Multiple Languages in the Wild
Viaarxiv icon

Leveraging Real Talking Faces via Self-Supervision for Robust Forgery Detection

Add code
Bookmark button
Alert button
Jan 18, 2022
Alexandros Haliassos, Rodrigo Mira, Stavros Petridis, Maja Pantic

Figure 1 for Leveraging Real Talking Faces via Self-Supervision for Robust Forgery Detection
Figure 2 for Leveraging Real Talking Faces via Self-Supervision for Robust Forgery Detection
Figure 3 for Leveraging Real Talking Faces via Self-Supervision for Robust Forgery Detection
Figure 4 for Leveraging Real Talking Faces via Self-Supervision for Robust Forgery Detection
Viaarxiv icon