Alert button
Picture for Marcus Rohrbach

Marcus Rohrbach

Alert button

Efficient Pre-training for Localized Instruction Generation of Videos

Nov 27, 2023
Anil Batra, Davide Moltisanti, Laura Sevilla-Lara, Marcus Rohrbach, Frank Keller

Viaarxiv icon

Improving Selective Visual Question Answering by Learning from Your Peers

Jun 14, 2023
Corentin Dancette, Spencer Whitehead, Rishabh Maheshwary, Ramakrishna Vedantam, Stefan Scherer, Xinlei Chen, Matthieu Cord, Marcus Rohrbach

Figure 1 for Improving Selective Visual Question Answering by Learning from Your Peers
Figure 2 for Improving Selective Visual Question Answering by Learning from Your Peers
Figure 3 for Improving Selective Visual Question Answering by Learning from Your Peers
Figure 4 for Improving Selective Visual Question Answering by Learning from Your Peers
Viaarxiv icon

Simple Token-Level Confidence Improves Caption Correctness

May 11, 2023
Suzanne Petryk, Spencer Whitehead, Joseph E. Gonzalez, Trevor Darrell, Anna Rohrbach, Marcus Rohrbach

Figure 1 for Simple Token-Level Confidence Improves Caption Correctness
Figure 2 for Simple Token-Level Confidence Improves Caption Correctness
Figure 3 for Simple Token-Level Confidence Improves Caption Correctness
Figure 4 for Simple Token-Level Confidence Improves Caption Correctness
Viaarxiv icon

Learn2Augment: Learning to Composite Videos for Data Augmentation in Action Recognition

Jun 09, 2022
Shreyank N Gowda, Marcus Rohrbach, Frank Keller, Laura Sevilla-Lara

Figure 1 for Learn2Augment: Learning to Composite Videos for Data Augmentation in Action Recognition
Figure 2 for Learn2Augment: Learning to Composite Videos for Data Augmentation in Action Recognition
Figure 3 for Learn2Augment: Learning to Composite Videos for Data Augmentation in Action Recognition
Figure 4 for Learn2Augment: Learning to Composite Videos for Data Augmentation in Action Recognition
Viaarxiv icon

Reliable Visual Question Answering: Abstain Rather Than Answer Incorrectly

Apr 28, 2022
Spencer Whitehead, Suzanne Petryk, Vedaad Shakib, Joseph Gonzalez, Trevor Darrell, Anna Rohrbach, Marcus Rohrbach

Figure 1 for Reliable Visual Question Answering: Abstain Rather Than Answer Incorrectly
Figure 2 for Reliable Visual Question Answering: Abstain Rather Than Answer Incorrectly
Figure 3 for Reliable Visual Question Answering: Abstain Rather Than Answer Incorrectly
Figure 4 for Reliable Visual Question Answering: Abstain Rather Than Answer Incorrectly
Viaarxiv icon

Learning To Recognize Procedural Activities with Distant Supervision

Jan 26, 2022
Xudong Lin, Fabio Petroni, Gedas Bertasius, Marcus Rohrbach, Shih-Fu Chang, Lorenzo Torresani

Figure 1 for Learning To Recognize Procedural Activities with Distant Supervision
Figure 2 for Learning To Recognize Procedural Activities with Distant Supervision
Figure 3 for Learning To Recognize Procedural Activities with Distant Supervision
Figure 4 for Learning To Recognize Procedural Activities with Distant Supervision
Viaarxiv icon

FLAVA: A Foundational Language And Vision Alignment Model

Dec 08, 2021
Amanpreet Singh, Ronghang Hu, Vedanuj Goswami, Guillaume Couairon, Wojciech Galuba, Marcus Rohrbach, Douwe Kiela

Figure 1 for FLAVA: A Foundational Language And Vision Alignment Model
Figure 2 for FLAVA: A Foundational Language And Vision Alignment Model
Figure 3 for FLAVA: A Foundational Language And Vision Alignment Model
Figure 4 for FLAVA: A Foundational Language And Vision Alignment Model
Viaarxiv icon

A New Split for Evaluating True Zero-Shot Action Recognition

Jul 27, 2021
Shreyank N Gowda, Laura Sevilla-Lara, Kiyoon Kim, Frank Keller, Marcus Rohrbach

Figure 1 for A New Split for Evaluating True Zero-Shot Action Recognition
Figure 2 for A New Split for Evaluating True Zero-Shot Action Recognition
Figure 3 for A New Split for Evaluating True Zero-Shot Action Recognition
Figure 4 for A New Split for Evaluating True Zero-Shot Action Recognition
Viaarxiv icon

CLASTER: Clustering with Reinforcement Learning for Zero-Shot Action Recognition

Jan 18, 2021
Shreyank N Gowda, Laura Sevilla-Lara, Frank Keller, Marcus Rohrbach

Figure 1 for CLASTER: Clustering with Reinforcement Learning for Zero-Shot Action Recognition
Figure 2 for CLASTER: Clustering with Reinforcement Learning for Zero-Shot Action Recognition
Figure 3 for CLASTER: Clustering with Reinforcement Learning for Zero-Shot Action Recognition
Figure 4 for CLASTER: Clustering with Reinforcement Learning for Zero-Shot Action Recognition
Viaarxiv icon

KRISP: Integrating Implicit and Symbolic Knowledge for Open-Domain Knowledge-Based VQA

Dec 20, 2020
Kenneth Marino, Xinlei Chen, Devi Parikh, Abhinav Gupta, Marcus Rohrbach

Figure 1 for KRISP: Integrating Implicit and Symbolic Knowledge for Open-Domain Knowledge-Based VQA
Figure 2 for KRISP: Integrating Implicit and Symbolic Knowledge for Open-Domain Knowledge-Based VQA
Figure 3 for KRISP: Integrating Implicit and Symbolic Knowledge for Open-Domain Knowledge-Based VQA
Figure 4 for KRISP: Integrating Implicit and Symbolic Knowledge for Open-Domain Knowledge-Based VQA
Viaarxiv icon