Alert button
Picture for Tamara L. Berg

Tamara L. Berg

Alert button

Revealing Single Frame Bias for Video-and-Language Learning

Add code
Bookmark button
Alert button
Jun 07, 2022
Jie Lei, Tamara L. Berg, Mohit Bansal

Figure 1 for Revealing Single Frame Bias for Video-and-Language Learning
Figure 2 for Revealing Single Frame Bias for Video-and-Language Learning
Figure 3 for Revealing Single Frame Bias for Video-and-Language Learning
Figure 4 for Revealing Single Frame Bias for Video-and-Language Learning
Viaarxiv icon

End-to-End Visual Editing with a Generatively Pre-Trained Artist

Add code
Bookmark button
Alert button
May 03, 2022
Andrew Brown, Cheng-Yang Fu, Omkar Parkhi, Tamara L. Berg, Andrea Vedaldi

Figure 1 for End-to-End Visual Editing with a Generatively Pre-Trained Artist
Figure 2 for End-to-End Visual Editing with a Generatively Pre-Trained Artist
Figure 3 for End-to-End Visual Editing with a Generatively Pre-Trained Artist
Figure 4 for End-to-End Visual Editing with a Generatively Pre-Trained Artist
Viaarxiv icon

LoopITR: Combining Dual and Cross Encoder Architectures for Image-Text Retrieval

Add code
Bookmark button
Alert button
Mar 10, 2022
Jie Lei, Xinlei Chen, Ning Zhang, Mengjiao Wang, Mohit Bansal, Tamara L. Berg, Licheng Yu

Figure 1 for LoopITR: Combining Dual and Cross Encoder Architectures for Image-Text Retrieval
Figure 2 for LoopITR: Combining Dual and Cross Encoder Architectures for Image-Text Retrieval
Figure 3 for LoopITR: Combining Dual and Cross Encoder Architectures for Image-Text Retrieval
Figure 4 for LoopITR: Combining Dual and Cross Encoder Architectures for Image-Text Retrieval
Viaarxiv icon

CommerceMM: Large-Scale Commerce MultiModal Representation Learning with Omni Retrieval

Add code
Bookmark button
Alert button
Feb 15, 2022
Licheng Yu, Jun Chen, Animesh Sinha, Mengjiao MJ Wang, Hugo Chen, Tamara L. Berg, Ning Zhang

Figure 1 for CommerceMM: Large-Scale Commerce MultiModal Representation Learning with Omni Retrieval
Figure 2 for CommerceMM: Large-Scale Commerce MultiModal Representation Learning with Omni Retrieval
Figure 3 for CommerceMM: Large-Scale Commerce MultiModal Representation Learning with Omni Retrieval
Figure 4 for CommerceMM: Large-Scale Commerce MultiModal Representation Learning with Omni Retrieval
Viaarxiv icon

MTVR: Multilingual Moment Retrieval in Videos

Add code
Bookmark button
Alert button
Jul 30, 2021
Jie Lei, Tamara L. Berg, Mohit Bansal

Figure 1 for MTVR: Multilingual Moment Retrieval in Videos
Figure 2 for MTVR: Multilingual Moment Retrieval in Videos
Figure 3 for MTVR: Multilingual Moment Retrieval in Videos
Figure 4 for MTVR: Multilingual Moment Retrieval in Videos
Viaarxiv icon

QVHighlights: Detecting Moments and Highlights in Videos via Natural Language Queries

Add code
Bookmark button
Alert button
Jul 20, 2021
Jie Lei, Tamara L. Berg, Mohit Bansal

Figure 1 for QVHighlights: Detecting Moments and Highlights in Videos via Natural Language Queries
Figure 2 for QVHighlights: Detecting Moments and Highlights in Videos via Natural Language Queries
Figure 3 for QVHighlights: Detecting Moments and Highlights in Videos via Natural Language Queries
Figure 4 for QVHighlights: Detecting Moments and Highlights in Videos via Natural Language Queries
Viaarxiv icon

Less is More: ClipBERT for Video-and-Language Learning via Sparse Sampling

Add code
Bookmark button
Alert button
Feb 11, 2021
Jie Lei, Linjie Li, Luowei Zhou, Zhe Gan, Tamara L. Berg, Mohit Bansal, Jingjing Liu

Figure 1 for Less is More: ClipBERT for Video-and-Language Learning via Sparse Sampling
Figure 2 for Less is More: ClipBERT for Video-and-Language Learning via Sparse Sampling
Figure 3 for Less is More: ClipBERT for Video-and-Language Learning via Sparse Sampling
Figure 4 for Less is More: ClipBERT for Video-and-Language Learning via Sparse Sampling
Viaarxiv icon

What is More Likely to Happen Next? Video-and-Language Future Event Prediction

Add code
Bookmark button
Alert button
Oct 15, 2020
Jie Lei, Licheng Yu, Tamara L. Berg, Mohit Bansal

Figure 1 for What is More Likely to Happen Next? Video-and-Language Future Event Prediction
Figure 2 for What is More Likely to Happen Next? Video-and-Language Future Event Prediction
Figure 3 for What is More Likely to Happen Next? Video-and-Language Future Event Prediction
Figure 4 for What is More Likely to Happen Next? Video-and-Language Future Event Prediction
Viaarxiv icon

MART: Memory-Augmented Recurrent Transformer for Coherent Video Paragraph Captioning

Add code
Bookmark button
Alert button
May 11, 2020
Jie Lei, Liwei Wang, Yelong Shen, Dong Yu, Tamara L. Berg, Mohit Bansal

Figure 1 for MART: Memory-Augmented Recurrent Transformer for Coherent Video Paragraph Captioning
Figure 2 for MART: Memory-Augmented Recurrent Transformer for Coherent Video Paragraph Captioning
Figure 3 for MART: Memory-Augmented Recurrent Transformer for Coherent Video Paragraph Captioning
Figure 4 for MART: Memory-Augmented Recurrent Transformer for Coherent Video Paragraph Captioning
Viaarxiv icon

TVR: A Large-Scale Dataset for Video-Subtitle Moment Retrieval

Add code
Bookmark button
Alert button
Jan 24, 2020
Jie Lei, Licheng Yu, Tamara L. Berg, Mohit Bansal

Figure 1 for TVR: A Large-Scale Dataset for Video-Subtitle Moment Retrieval
Figure 2 for TVR: A Large-Scale Dataset for Video-Subtitle Moment Retrieval
Figure 3 for TVR: A Large-Scale Dataset for Video-Subtitle Moment Retrieval
Figure 4 for TVR: A Large-Scale Dataset for Video-Subtitle Moment Retrieval
Viaarxiv icon