Alert button
Picture for Tanzila Rahman

Tanzila Rahman

Alert button

Visual Concept-driven Image Generation with Text-to-Image Diffusion Model

Add code
Bookmark button
Alert button
Feb 18, 2024
Tanzila Rahman, Shweta Mahajan, Hsin-Ying Lee, Jian Ren, Sergey Tulyakov, Leonid Sigal

Viaarxiv icon

Prompting Hard or Hardly Prompting: Prompt Inversion for Text-to-Image Diffusion Models

Add code
Bookmark button
Alert button
Dec 19, 2023
Shweta Mahajan, Tanzila Rahman, Kwang Moo Yi, Leonid Sigal

Viaarxiv icon

Make-A-Story: Visual Memory Conditioned Consistent Story Generation

Add code
Bookmark button
Alert button
Nov 23, 2022
Tanzila Rahman, Hsin-Ying Lee, Jian Ren, Sergey Tulyakov, Shweta Mahajan, Leonid Sigal

Figure 1 for Make-A-Story: Visual Memory Conditioned Consistent Story Generation
Figure 2 for Make-A-Story: Visual Memory Conditioned Consistent Story Generation
Figure 3 for Make-A-Story: Visual Memory Conditioned Consistent Story Generation
Figure 4 for Make-A-Story: Visual Memory Conditioned Consistent Story Generation
Viaarxiv icon

TriBERT: Full-body Human-centric Audio-visual Representation Learning for Visual Sound Separation

Add code
Bookmark button
Alert button
Oct 26, 2021
Tanzila Rahman, Mengyu Yang, Leonid Sigal

Figure 1 for TriBERT: Full-body Human-centric Audio-visual Representation Learning for Visual Sound Separation
Figure 2 for TriBERT: Full-body Human-centric Audio-visual Representation Learning for Visual Sound Separation
Figure 3 for TriBERT: Full-body Human-centric Audio-visual Representation Learning for Visual Sound Separation
Figure 4 for TriBERT: Full-body Human-centric Audio-visual Representation Learning for Visual Sound Separation
Viaarxiv icon

Weakly-supervised Audio-visual Sound Source Detection and Separation

Add code
Bookmark button
Alert button
Mar 25, 2021
Tanzila Rahman, Leonid Sigal

Figure 1 for Weakly-supervised Audio-visual Sound Source Detection and Separation
Figure 2 for Weakly-supervised Audio-visual Sound Source Detection and Separation
Figure 3 for Weakly-supervised Audio-visual Sound Source Detection and Separation
Figure 4 for Weakly-supervised Audio-visual Sound Source Detection and Separation
Viaarxiv icon

An Improved Attention for Visual Question Answering

Add code
Bookmark button
Alert button
Nov 07, 2020
Tanzila Rahman, Shih-Han Chou, Leonid Sigal, Giuseppe Carenini

Figure 1 for An Improved Attention for Visual Question Answering
Figure 2 for An Improved Attention for Visual Question Answering
Figure 3 for An Improved Attention for Visual Question Answering
Figure 4 for An Improved Attention for Visual Question Answering
Viaarxiv icon

Watch, Listen and Tell: Multi-modal Weakly Supervised Dense Event Captioning

Add code
Bookmark button
Alert button
Oct 25, 2019
Tanzila Rahman, Bicheng Xu, Leonid Sigal

Figure 1 for Watch, Listen and Tell: Multi-modal Weakly Supervised Dense Event Captioning
Figure 2 for Watch, Listen and Tell: Multi-modal Weakly Supervised Dense Event Captioning
Figure 3 for Watch, Listen and Tell: Multi-modal Weakly Supervised Dense Event Captioning
Figure 4 for Watch, Listen and Tell: Multi-modal Weakly Supervised Dense Event Captioning
Viaarxiv icon

Convolutional Temporal Attention Model for Video-based Person Re-identification

Add code
Bookmark button
Alert button
Apr 10, 2019
Tanzila Rahman, Mrigank Rochan, Yang Wang

Figure 1 for Convolutional Temporal Attention Model for Video-based Person Re-identification
Figure 2 for Convolutional Temporal Attention Model for Video-based Person Re-identification
Figure 3 for Convolutional Temporal Attention Model for Video-based Person Re-identification
Figure 4 for Convolutional Temporal Attention Model for Video-based Person Re-identification
Viaarxiv icon