Picture for Caroline Pantofaru

Caroline Pantofaru

Putting a Face to the Voice: Fusing Audio and Visual Signals Across a Video to Determine Speakers

Add code
May 31, 2017
Figure 1 for Putting a Face to the Voice: Fusing Audio and Visual Signals Across a Video to Determine Speakers
Figure 2 for Putting a Face to the Voice: Fusing Audio and Visual Signals Across a Video to Determine Speakers
Figure 3 for Putting a Face to the Voice: Fusing Audio and Visual Signals Across a Video to Determine Speakers
Figure 4 for Putting a Face to the Voice: Fusing Audio and Visual Signals Across a Video to Determine Speakers
Viaarxiv icon

Egocentric Field-of-View Localization Using First-Person Point-of-View Devices

Add code
Oct 07, 2015
Figure 1 for Egocentric Field-of-View Localization Using First-Person Point-of-View Devices
Figure 2 for Egocentric Field-of-View Localization Using First-Person Point-of-View Devices
Figure 3 for Egocentric Field-of-View Localization Using First-Person Point-of-View Devices
Figure 4 for Egocentric Field-of-View Localization Using First-Person Point-of-View Devices
Viaarxiv icon

Pose Embeddings: A Deep Architecture for Learning to Match Human Poses

Add code
Jul 01, 2015
Figure 1 for Pose Embeddings: A Deep Architecture for Learning to Match Human Poses
Figure 2 for Pose Embeddings: A Deep Architecture for Learning to Match Human Poses
Figure 3 for Pose Embeddings: A Deep Architecture for Learning to Match Human Poses
Figure 4 for Pose Embeddings: A Deep Architecture for Learning to Match Human Poses
Viaarxiv icon