Alert button
Picture for Reuben Tan

Reuben Tan

Alert button

Socratis: Are large multimodal models emotionally aware?

Sep 05, 2023
Katherine Deng, Arijit Ray, Reuben Tan, Saadia Gabriel, Bryan A. Plummer, Kate Saenko

Figure 1 for Socratis: Are large multimodal models emotionally aware?
Figure 2 for Socratis: Are large multimodal models emotionally aware?
Figure 3 for Socratis: Are large multimodal models emotionally aware?
Figure 4 for Socratis: Are large multimodal models emotionally aware?
Viaarxiv icon

Multiscale Video Pretraining for Long-Term Activity Forecasting

Jul 24, 2023
Reuben Tan, Matthias De Lange, Michael Iuzzolino, Bryan A. Plummer, Kate Saenko, Karl Ridgeway, Lorenzo Torresani

Figure 1 for Multiscale Video Pretraining for Long-Term Activity Forecasting
Figure 2 for Multiscale Video Pretraining for Long-Term Activity Forecasting
Figure 3 for Multiscale Video Pretraining for Long-Term Activity Forecasting
Figure 4 for Multiscale Video Pretraining for Long-Term Activity Forecasting
Viaarxiv icon

EgoAdapt: A multi-stream evaluation study of adaptation to real-world egocentric user video

Jul 11, 2023
Matthias De Lange, Hamid Eghbalzadeh, Reuben Tan, Michael Iuzzolino, Franziska Meier, Karl Ridgeway

Figure 1 for EgoAdapt: A multi-stream evaluation study of adaptation to real-world egocentric user video
Figure 2 for EgoAdapt: A multi-stream evaluation study of adaptation to real-world egocentric user video
Figure 3 for EgoAdapt: A multi-stream evaluation study of adaptation to real-world egocentric user video
Figure 4 for EgoAdapt: A multi-stream evaluation study of adaptation to real-world egocentric user video
Viaarxiv icon

Language-Guided Audio-Visual Source Separation via Trimodal Consistency

Mar 28, 2023
Reuben Tan, Arijit Ray, Andrea Burns, Bryan A. Plummer, Justin Salamon, Oriol Nieto, Bryan Russell, Kate Saenko

Figure 1 for Language-Guided Audio-Visual Source Separation via Trimodal Consistency
Figure 2 for Language-Guided Audio-Visual Source Separation via Trimodal Consistency
Figure 3 for Language-Guided Audio-Visual Source Separation via Trimodal Consistency
Figure 4 for Language-Guided Audio-Visual Source Separation via Trimodal Consistency
Viaarxiv icon

NewsStories: Illustrating articles with visual summaries

Aug 14, 2022
Reuben Tan, Bryan A. Plummer, Kate Saenko, JP Lewis, Avneesh Sud, Thomas Leung

Figure 1 for NewsStories: Illustrating articles with visual summaries
Figure 2 for NewsStories: Illustrating articles with visual summaries
Figure 3 for NewsStories: Illustrating articles with visual summaries
Viaarxiv icon

Look at What I'm Doing: Self-Supervised Spatial Grounding of Narrations in Instructional Videos

Oct 20, 2021
Reuben Tan, Bryan A. Plummer, Kate Saenko, Hailin Jin, Bryan Russell

Figure 1 for Look at What I'm Doing: Self-Supervised Spatial Grounding of Narrations in Instructional Videos
Figure 2 for Look at What I'm Doing: Self-Supervised Spatial Grounding of Narrations in Instructional Videos
Figure 3 for Look at What I'm Doing: Self-Supervised Spatial Grounding of Narrations in Instructional Videos
Figure 4 for Look at What I'm Doing: Self-Supervised Spatial Grounding of Narrations in Instructional Videos
Viaarxiv icon

Detecting Cross-Modal Inconsistency to Defend Against Neural Fake News

Sep 24, 2020
Reuben Tan, Bryan A. Plummer, Kate Saenko

Figure 1 for Detecting Cross-Modal Inconsistency to Defend Against Neural Fake News
Figure 2 for Detecting Cross-Modal Inconsistency to Defend Against Neural Fake News
Figure 3 for Detecting Cross-Modal Inconsistency to Defend Against Neural Fake News
Figure 4 for Detecting Cross-Modal Inconsistency to Defend Against Neural Fake News
Viaarxiv icon