Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Tadas Baltrusaitis

SCAMPS: Synthetics for Camera Measurement of Physiological Signals

Jun 08, 2022

Daniel McDuff, Miah Wander, Xin Liu, Brian L. Hill, Javier Hernandez, Jonathan Lester, Tadas Baltrusaitis

Figure 1 for SCAMPS: Synthetics for Camera Measurement of Physiological Signals

Figure 2 for SCAMPS: Synthetics for Camera Measurement of Physiological Signals

Figure 3 for SCAMPS: Synthetics for Camera Measurement of Physiological Signals

Figure 4 for SCAMPS: Synthetics for Camera Measurement of Physiological Signals

Abstract:The use of cameras and computational algorithms for noninvasive, low-cost and scalable measurement of physiological (e.g., cardiac and pulmonary) vital signs is very attractive. However, diverse data representing a range of environments, body motions, illumination conditions and physiological states is laborious, time consuming and expensive to obtain. Synthetic data have proven a valuable tool in several areas of machine learning, yet are not widely available for camera measurement of physiological states. Synthetic data offer "perfect" labels (e.g., without noise and with precise synchronization), labels that may not be possible to obtain otherwise (e.g., precise pixel level segmentation maps) and provide a high degree of control over variation and diversity in the dataset. We present SCAMPS, a dataset of synthetics containing 2,800 videos (1.68M frames) with aligned cardiac and respiratory signals and facial action intensities. The RGB frames are provided alongside segmentation maps. We provide precise descriptive statistics about the underlying waveforms, including inter-beat interval, heart rate variability, and pulse arrival time. Finally, we present baseline results training on these synthetic data and testing on real-world datasets to illustrate generalizability.

Via

Access Paper or Ask Questions

3D face reconstruction with dense landmarks

Apr 06, 2022

Erroll Wood, Tadas Baltrusaitis, Charlie Hewitt, Matthew Johnson, Jingjing Shen, Nikola Milosavljevic, Daniel Wilde, Stephan Garbin, Toby Sharp, Ivan Stojiljkovic(+2 more)

Figure 1 for 3D face reconstruction with dense landmarks

Figure 2 for 3D face reconstruction with dense landmarks

Figure 3 for 3D face reconstruction with dense landmarks

Figure 4 for 3D face reconstruction with dense landmarks

Abstract:Landmarks often play a key role in face analysis, but many aspects of identity or expression cannot be represented by sparse landmarks alone. Thus, in order to reconstruct faces more accurately, landmarks are often combined with additional signals like depth images or techniques like differentiable rendering. Can we keep things simple by just using more landmarks? In answer, we present the first method that accurately predicts 10x as many landmarks as usual, covering the whole head, including the eyes and teeth. This is accomplished using synthetic training data, which guarantees perfect landmark annotations. By fitting a morphable model to these dense landmarks, we achieve state-of-the-art results for monocular 3D face reconstruction in the wild. We show that dense landmarks are an ideal signal for integrating face shape information across frames by demonstrating accurate and expressive facial performance capture in both monocular and multi-view scenarios. This approach is also highly efficient: we can predict dense landmarks and fit our 3D face model at over 150FPS on a single CPU thread.

Via

Access Paper or Ask Questions

Synthetic Data for Multi-Parameter Camera-Based Physiological Sensing

Oct 10, 2021

Daniel McDuff, Xin Liu, Javier Hernandez, Erroll Wood, Tadas Baltrusaitis

Figure 1 for Synthetic Data for Multi-Parameter Camera-Based Physiological Sensing

Figure 2 for Synthetic Data for Multi-Parameter Camera-Based Physiological Sensing

Figure 3 for Synthetic Data for Multi-Parameter Camera-Based Physiological Sensing

Figure 4 for Synthetic Data for Multi-Parameter Camera-Based Physiological Sensing

Abstract:Synthetic data is a powerful tool in training data hungry deep learning algorithms. However, to date, camera-based physiological sensing has not taken full advantage of these techniques. In this work, we leverage a high-fidelity synthetics pipeline for generating videos of faces with faithful blood flow and breathing patterns. We present systematic experiments showing how physiologically-grounded synthetic data can be used in training camera-based multi-parameter cardiopulmonary sensing. We provide empirical evidence that heart and breathing rate measurement accuracy increases with the number of synthetic avatars in the training set. Furthermore, training with avatars with darker skin types leads to better overall performance than training with avatars with lighter skin types. Finally, we discuss the opportunities that synthetics present in the domain of camera-based physiological sensing and limitations that need to be overcome.

Via

Access Paper or Ask Questions

Advancing Non-Contact Vital Sign Measurement using Synthetic Avatars

Oct 24, 2020

Daniel McDuff, Javier Hernandez, Erroll Wood, Xin Liu, Tadas Baltrusaitis

Figure 1 for Advancing Non-Contact Vital Sign Measurement using Synthetic Avatars

Figure 2 for Advancing Non-Contact Vital Sign Measurement using Synthetic Avatars

Figure 3 for Advancing Non-Contact Vital Sign Measurement using Synthetic Avatars

Figure 4 for Advancing Non-Contact Vital Sign Measurement using Synthetic Avatars

Abstract:Non-contact physiological measurement has the potential to provide low-cost, non-invasive health monitoring. However, machine vision approaches are often limited by the availability and diversity of annotated video datasets resulting in poor generalization to complex real-life conditions. To address these challenges, this work proposes the use of synthetic avatars that display facial blood flow changes and allow for systematic generation of samples under a wide variety of conditions. Our results show that training on both simulated and real video data can lead to performance gains under challenging conditions. We show state-of-the-art performance on three large benchmark datasets and improved robustness to skin type and motion.

Via

Access Paper or Ask Questions

A high fidelity synthetic face framework for computer vision

Jul 16, 2020

Tadas Baltrusaitis, Erroll Wood, Virginia Estellers, Charlie Hewitt, Sebastian Dziadzio, Marek Kowalski, Matthew Johnson, Thomas J. Cashman, Jamie Shotton

Figure 1 for A high fidelity synthetic face framework for computer vision

Figure 2 for A high fidelity synthetic face framework for computer vision

Figure 3 for A high fidelity synthetic face framework for computer vision

Figure 4 for A high fidelity synthetic face framework for computer vision

Abstract:Analysis of faces is one of the core applications of computer vision, with tasks ranging from landmark alignment, head pose estimation, expression recognition, and face recognition among others. However, building reliable methods requires time-consuming data collection and often even more time-consuming manual annotation, which can be unreliable. In our work we propose synthesizing such facial data, including ground truth annotations that would be almost impossible to acquire through manual annotation at the consistency and scale possible through use of synthetic data. We use a parametric face model together with hand crafted assets which enable us to generate training data with unprecedented quality and diversity (varying shape, texture, expression, pose, lighting, and hair).

Via

Access Paper or Ask Questions

Hand2Face: Automatic Synthesis and Recognition of Hand Over Face Occlusions

Aug 17, 2017

Behnaz Nojavanasghari, Charles. E. Hughes, Tadas Baltrusaitis, Louis-philippe Morency

Figure 1 for Hand2Face: Automatic Synthesis and Recognition of Hand Over Face Occlusions

Figure 2 for Hand2Face: Automatic Synthesis and Recognition of Hand Over Face Occlusions

Figure 3 for Hand2Face: Automatic Synthesis and Recognition of Hand Over Face Occlusions

Figure 4 for Hand2Face: Automatic Synthesis and Recognition of Hand Over Face Occlusions

Abstract:A person's face discloses important information about their affective state. Although there has been extensive research on recognition of facial expressions, the performance of existing approaches is challenged by facial occlusions. Facial occlusions are often treated as noise and discarded in recognition of affective states. However, hand over face occlusions can provide additional information for recognition of some affective states such as curiosity, frustration and boredom. One of the reasons that this problem has not gained attention is the lack of naturalistic occluded faces that contain hand over face occlusions as well as other types of occlusions. Traditional approaches for obtaining affective data are time demanding and expensive, which limits researchers in affective computing to work on small datasets. This limitation affects the generalizability of models and deprives researchers from taking advantage of recent advances in deep learning that have shown great success in many fields but require large volumes of data. In this paper, we first introduce a novel framework for synthesizing naturalistic facial occlusions from an initial dataset of non-occluded faces and separate images of hands, reducing the costly process of data collection and annotation. We then propose a model for facial occlusion type recognition to differentiate between hand over face occlusions and other types of occlusions such as scarves, hair, glasses and objects. Finally, we present a model to localize hand over face occlusions and identify the occluded regions of the face.

* Accepted to International Conference on Affective Computing and Intelligent Interaction (ACII), 2017

Via

Access Paper or Ask Questions

GazeDirector: Fully Articulated Eye Gaze Redirection in Video

Apr 27, 2017

Erroll Wood, Tadas Baltrusaitis, Louis-Philippe Morency, Peter Robinson, Andreas Bulling

Figure 1 for GazeDirector: Fully Articulated Eye Gaze Redirection in Video

Figure 2 for GazeDirector: Fully Articulated Eye Gaze Redirection in Video

Figure 3 for GazeDirector: Fully Articulated Eye Gaze Redirection in Video

Figure 4 for GazeDirector: Fully Articulated Eye Gaze Redirection in Video

Abstract:We present GazeDirector, a new approach for eye gaze redirection that uses model-fitting. Our method first tracks the eyes by fitting a multi-part eye region model to video frames using analysis-by-synthesis, thereby recovering eye region shape, texture, pose, and gaze simultaneously. It then redirects gaze by 1) warping the eyelids from the original image using a model-derived flow field, and 2) rendering and compositing synthesized 3D eyeballs onto the output image in a photorealistic manner. GazeDirector allows us to change where people are looking without person-specific training data, and with full articulation, i.e. we can precisely specify new gaze directions in 3D. Quantitatively, we evaluate both model-fitting and gaze synthesis, with experiments for gaze estimation and redirection on the Columbia gaze dataset. Qualitatively, we compare GazeDirector against recent work on gaze redirection, showing better results especially for large redirection angles. Finally, we demonstrate gaze redirection on YouTube videos by introducing new 3D gaze targets and by manipulating visual behavior.

Via

Access Paper or Ask Questions

Rendering of Eyes for Eye-Shape Registration and Gaze Estimation

May 21, 2015

Erroll Wood, Tadas Baltrusaitis, Xucong Zhang, Yusuke Sugano, Peter Robinson, Andreas Bulling

Figure 1 for Rendering of Eyes for Eye-Shape Registration and Gaze Estimation

Figure 2 for Rendering of Eyes for Eye-Shape Registration and Gaze Estimation

Figure 3 for Rendering of Eyes for Eye-Shape Registration and Gaze Estimation

Figure 4 for Rendering of Eyes for Eye-Shape Registration and Gaze Estimation

Abstract:Images of the eye are key in several computer vision problems, such as shape registration and gaze estimation. Recent large-scale supervised methods for these problems require time-consuming data collection and manual annotation, which can be unreliable. We propose synthesizing perfectly labelled photo-realistic training data in a fraction of the time. We used computer graphics techniques to build a collection of dynamic eye-region models from head scan geometry. These were randomly posed to synthesize close-up eye images for a wide range of head poses, gaze directions, and illumination conditions. We used our model's controllability to verify the importance of realistic illumination and shape variations in eye-region training data. Finally, we demonstrate the benefits of our synthesized training data (SynthesEyes) by out-performing state-of-the-art methods for eye-shape registration as well as cross-dataset appearance-based gaze estimation in the wild.

Via

Access Paper or Ask Questions