Picture for Maja Pantic

Maja Pantic

Auto-AVSR: Audio-Visual Speech Recognition with Automatic Labels

Add code
Mar 25, 2023
Viaarxiv icon

Learning Cross-lingual Visual Speech Representations

Add code
Mar 14, 2023
Figure 1 for Learning Cross-lingual Visual Speech Representations
Figure 2 for Learning Cross-lingual Visual Speech Representations
Figure 3 for Learning Cross-lingual Visual Speech Representations
Figure 4 for Learning Cross-lingual Visual Speech Representations
Viaarxiv icon

Diffused Heads: Diffusion Models Beat GANs on Talking-Face Generation

Add code
Jan 06, 2023
Figure 1 for Diffused Heads: Diffusion Models Beat GANs on Talking-Face Generation
Figure 2 for Diffused Heads: Diffusion Models Beat GANs on Talking-Face Generation
Figure 3 for Diffused Heads: Diffusion Models Beat GANs on Talking-Face Generation
Figure 4 for Diffused Heads: Diffusion Models Beat GANs on Talking-Face Generation
Viaarxiv icon

Jointly Learning Visual and Auditory Speech Representations from Raw Data

Add code
Dec 12, 2022
Viaarxiv icon

LA-VocE: Low-SNR Audio-visual Speech Enhancement using Neural Vocoders

Add code
Nov 20, 2022
Viaarxiv icon

FAN-Trans: Online Knowledge Distillation for Facial Action Unit Detection

Add code
Nov 11, 2022
Viaarxiv icon

Streaming Audio-Visual Speech Recognition with Alignment Regularization

Add code
Nov 03, 2022
Viaarxiv icon

SS-VAERR: Self-Supervised Apparent Emotional Reaction Recognition from Video

Add code
Oct 20, 2022
Figure 1 for SS-VAERR: Self-Supervised Apparent Emotional Reaction Recognition from Video
Figure 2 for SS-VAERR: Self-Supervised Apparent Emotional Reaction Recognition from Video
Figure 3 for SS-VAERR: Self-Supervised Apparent Emotional Reaction Recognition from Video
Figure 4 for SS-VAERR: Self-Supervised Apparent Emotional Reaction Recognition from Video
Viaarxiv icon

Training Strategies for Improved Lip-reading

Add code
Sep 03, 2022
Figure 1 for Training Strategies for Improved Lip-reading
Figure 2 for Training Strategies for Improved Lip-reading
Figure 3 for Training Strategies for Improved Lip-reading
Figure 4 for Training Strategies for Improved Lip-reading
Viaarxiv icon

SVTS: Scalable Video-to-Speech Synthesis

Add code
May 04, 2022
Figure 1 for SVTS: Scalable Video-to-Speech Synthesis
Figure 2 for SVTS: Scalable Video-to-Speech Synthesis
Figure 3 for SVTS: Scalable Video-to-Speech Synthesis
Figure 4 for SVTS: Scalable Video-to-Speech Synthesis
Viaarxiv icon