Alert button
Picture for Caroline Pantofaru

Caroline Pantofaru

Alert button

AVA-ActiveSpeaker: An Audio-Visual Dataset for Active Speaker Detection

Add code
Bookmark button
Alert button
Jan 05, 2019
Joseph Roth, Sourish Chaudhuri, Ondrej Klejch, Radhika Marvin, Andrew Gallagher, Liat Kaver, Sharadh Ramaswamy, Arkadiusz Stopczynski, Cordelia Schmid, Zhonghua Xi, Caroline Pantofaru

Figure 1 for AVA-ActiveSpeaker: An Audio-Visual Dataset for Active Speaker Detection
Figure 2 for AVA-ActiveSpeaker: An Audio-Visual Dataset for Active Speaker Detection
Figure 3 for AVA-ActiveSpeaker: An Audio-Visual Dataset for Active Speaker Detection
Figure 4 for AVA-ActiveSpeaker: An Audio-Visual Dataset for Active Speaker Detection
Viaarxiv icon

AVA: A Video Dataset of Spatio-temporally Localized Atomic Visual Actions

Add code
Bookmark button
Alert button
Apr 30, 2018
Chunhui Gu, Chen Sun, David A. Ross, Carl Vondrick, Caroline Pantofaru, Yeqing Li, Sudheendra Vijayanarasimhan, George Toderici, Susanna Ricco, Rahul Sukthankar, Cordelia Schmid, Jitendra Malik

Figure 1 for AVA: A Video Dataset of Spatio-temporally Localized Atomic Visual Actions
Figure 2 for AVA: A Video Dataset of Spatio-temporally Localized Atomic Visual Actions
Figure 3 for AVA: A Video Dataset of Spatio-temporally Localized Atomic Visual Actions
Figure 4 for AVA: A Video Dataset of Spatio-temporally Localized Atomic Visual Actions
Viaarxiv icon

Putting a Face to the Voice: Fusing Audio and Visual Signals Across a Video to Determine Speakers

Add code
Bookmark button
Alert button
May 31, 2017
Ken Hoover, Sourish Chaudhuri, Caroline Pantofaru, Malcolm Slaney, Ian Sturdy

Figure 1 for Putting a Face to the Voice: Fusing Audio and Visual Signals Across a Video to Determine Speakers
Figure 2 for Putting a Face to the Voice: Fusing Audio and Visual Signals Across a Video to Determine Speakers
Figure 3 for Putting a Face to the Voice: Fusing Audio and Visual Signals Across a Video to Determine Speakers
Figure 4 for Putting a Face to the Voice: Fusing Audio and Visual Signals Across a Video to Determine Speakers
Viaarxiv icon

Egocentric Field-of-View Localization Using First-Person Point-of-View Devices

Add code
Bookmark button
Alert button
Oct 07, 2015
Vinay Bettadapura, Irfan Essa, Caroline Pantofaru

Figure 1 for Egocentric Field-of-View Localization Using First-Person Point-of-View Devices
Figure 2 for Egocentric Field-of-View Localization Using First-Person Point-of-View Devices
Figure 3 for Egocentric Field-of-View Localization Using First-Person Point-of-View Devices
Figure 4 for Egocentric Field-of-View Localization Using First-Person Point-of-View Devices
Viaarxiv icon

Pose Embeddings: A Deep Architecture for Learning to Match Human Poses

Add code
Bookmark button
Alert button
Jul 01, 2015
Greg Mori, Caroline Pantofaru, Nisarg Kothari, Thomas Leung, George Toderici, Alexander Toshev, Weilong Yang

Figure 1 for Pose Embeddings: A Deep Architecture for Learning to Match Human Poses
Figure 2 for Pose Embeddings: A Deep Architecture for Learning to Match Human Poses
Figure 3 for Pose Embeddings: A Deep Architecture for Learning to Match Human Poses
Figure 4 for Pose Embeddings: A Deep Architecture for Learning to Match Human Poses
Viaarxiv icon