Alert button
Picture for Reuben Tan

Reuben Tan

Alert button

Koala: Key frame-conditioned long video-LLM

Add code
Bookmark button
Alert button
Apr 19, 2024
Reuben Tan, Ximeng Sun, Ping Hu, Jui-hsien Wang, Hanieh Deilamsalehy, Bryan A. Plummer, Bryan Russell, Kate Saenko

Viaarxiv icon

Socratis: Are large multimodal models emotionally aware?

Add code
Bookmark button
Alert button
Sep 05, 2023
Katherine Deng, Arijit Ray, Reuben Tan, Saadia Gabriel, Bryan A. Plummer, Kate Saenko

Figure 1 for Socratis: Are large multimodal models emotionally aware?
Figure 2 for Socratis: Are large multimodal models emotionally aware?
Figure 3 for Socratis: Are large multimodal models emotionally aware?
Figure 4 for Socratis: Are large multimodal models emotionally aware?
Viaarxiv icon

Multiscale Video Pretraining for Long-Term Activity Forecasting

Add code
Bookmark button
Alert button
Jul 24, 2023
Reuben Tan, Matthias De Lange, Michael Iuzzolino, Bryan A. Plummer, Kate Saenko, Karl Ridgeway, Lorenzo Torresani

Figure 1 for Multiscale Video Pretraining for Long-Term Activity Forecasting
Figure 2 for Multiscale Video Pretraining for Long-Term Activity Forecasting
Figure 3 for Multiscale Video Pretraining for Long-Term Activity Forecasting
Figure 4 for Multiscale Video Pretraining for Long-Term Activity Forecasting
Viaarxiv icon

EgoAdapt: A multi-stream evaluation study of adaptation to real-world egocentric user video

Add code
Bookmark button
Alert button
Jul 11, 2023
Matthias De Lange, Hamid Eghbalzadeh, Reuben Tan, Michael Iuzzolino, Franziska Meier, Karl Ridgeway

Figure 1 for EgoAdapt: A multi-stream evaluation study of adaptation to real-world egocentric user video
Figure 2 for EgoAdapt: A multi-stream evaluation study of adaptation to real-world egocentric user video
Figure 3 for EgoAdapt: A multi-stream evaluation study of adaptation to real-world egocentric user video
Figure 4 for EgoAdapt: A multi-stream evaluation study of adaptation to real-world egocentric user video
Viaarxiv icon

Language-Guided Audio-Visual Source Separation via Trimodal Consistency

Add code
Bookmark button
Alert button
Mar 28, 2023
Reuben Tan, Arijit Ray, Andrea Burns, Bryan A. Plummer, Justin Salamon, Oriol Nieto, Bryan Russell, Kate Saenko

Figure 1 for Language-Guided Audio-Visual Source Separation via Trimodal Consistency
Figure 2 for Language-Guided Audio-Visual Source Separation via Trimodal Consistency
Figure 3 for Language-Guided Audio-Visual Source Separation via Trimodal Consistency
Figure 4 for Language-Guided Audio-Visual Source Separation via Trimodal Consistency
Viaarxiv icon

NewsStories: Illustrating articles with visual summaries

Add code
Bookmark button
Alert button
Aug 14, 2022
Reuben Tan, Bryan A. Plummer, Kate Saenko, JP Lewis, Avneesh Sud, Thomas Leung

Figure 1 for NewsStories: Illustrating articles with visual summaries
Figure 2 for NewsStories: Illustrating articles with visual summaries
Figure 3 for NewsStories: Illustrating articles with visual summaries
Viaarxiv icon

Look at What I'm Doing: Self-Supervised Spatial Grounding of Narrations in Instructional Videos

Add code
Bookmark button
Alert button
Oct 20, 2021
Reuben Tan, Bryan A. Plummer, Kate Saenko, Hailin Jin, Bryan Russell

Figure 1 for Look at What I'm Doing: Self-Supervised Spatial Grounding of Narrations in Instructional Videos
Figure 2 for Look at What I'm Doing: Self-Supervised Spatial Grounding of Narrations in Instructional Videos
Figure 3 for Look at What I'm Doing: Self-Supervised Spatial Grounding of Narrations in Instructional Videos
Figure 4 for Look at What I'm Doing: Self-Supervised Spatial Grounding of Narrations in Instructional Videos
Viaarxiv icon