Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Title:Towards a Perceptual Model for Estimating the Quality of Visual Speech

Mar 24, 2022

Zakaria Aldeneh, Masha Fedzechkina, Skyler Seto, Katherine Metcalf, Miguel Sarabia, Nicholas Apostoloff, Barry-John Theobald

Figure 1 for Towards a Perceptual Model for Estimating the Quality of Visual Speech

Figure 2 for Towards a Perceptual Model for Estimating the Quality of Visual Speech

Figure 3 for Towards a Perceptual Model for Estimating the Quality of Visual Speech

Figure 4 for Towards a Perceptual Model for Estimating the Quality of Visual Speech

Share this with someone who'll enjoy it:

Abstract:Generating realistic lip motions to simulate speech production is key for driving natural character animations from audio. Previous research has shown that traditional metrics used to optimize and assess models for generating lip motions from speech are not a good indicator of subjective opinion of animation quality. Yet, running repetitive subjective studies for assessing the quality of animations can be time-consuming and difficult to replicate. In this work, we seek to understand the relationship between perturbed lip motion and subjective opinion of lip motion quality. Specifically, we adjust the degree of articulation for lip motion sequences and run a user-study to examine how this adjustment impacts the perceived quality of lip motion. We then train a model using the scores collected from our user-study to automatically predict the subjective quality of an animated sequence. Our results show that (1) users score lip motions with slight over-articulation the highest in terms of perceptual quality; (2) under-articulation had a more detrimental effect on perceived quality of lip motion compared to the effect of over-articulation; and (3) we can automatically estimate the subjective perceptual score for a given lip motion sequences with low error rates.

* Submitted to Interspeech 2022

View paper on

Share this with someone who'll enjoy it:

Title:Towards a Perceptual Model for Estimating the Quality of Visual Speech

Paper and Code