Alert button
Picture for Babak Damavandi

Babak Damavandi

Alert button

SnapNTell: Enhancing Entity-Centric Visual Question Answering with Retrieval Augmented Multimodal LLM

Add code
Bookmark button
Alert button
Mar 07, 2024
Jielin Qiu, Andrea Madotto, Zhaojiang Lin, Paul A. Crook, Yifan Ethan Xu, Xin Luna Dong, Christos Faloutsos, Lei Li, Babak Damavandi, Seungwhan Moon

Figure 1 for SnapNTell: Enhancing Entity-Centric Visual Question Answering with Retrieval Augmented Multimodal LLM
Figure 2 for SnapNTell: Enhancing Entity-Centric Visual Question Answering with Retrieval Augmented Multimodal LLM
Figure 3 for SnapNTell: Enhancing Entity-Centric Visual Question Answering with Retrieval Augmented Multimodal LLM
Figure 4 for SnapNTell: Enhancing Entity-Centric Visual Question Answering with Retrieval Augmented Multimodal LLM
Viaarxiv icon

AnyMAL: An Efficient and Scalable Any-Modality Augmented Language Model

Add code
Bookmark button
Alert button
Sep 27, 2023
Seungwhan Moon, Andrea Madotto, Zhaojiang Lin, Tushar Nagarajan, Matt Smith, Shashank Jain, Chun-Fu Yeh, Prakash Murugesan, Peyman Heidari, Yue Liu, Kavya Srinet, Babak Damavandi, Anuj Kumar

Viaarxiv icon

Navigating Connected Memories with a Task-oriented Dialog System

Add code
Bookmark button
Alert button
Nov 15, 2022
Seungwhan Moon, Satwik Kottur, Alborz Geramifard, Babak Damavandi

Figure 1 for Navigating Connected Memories with a Task-oriented Dialog System
Figure 2 for Navigating Connected Memories with a Task-oriented Dialog System
Figure 3 for Navigating Connected Memories with a Task-oriented Dialog System
Figure 4 for Navigating Connected Memories with a Task-oriented Dialog System
Viaarxiv icon

Tell Your Story: Task-Oriented Dialogs for Interactive Content Creation

Add code
Bookmark button
Alert button
Nov 08, 2022
Satwik Kottur, Seungwhan Moon, Aram H. Markosyan, Hardik Shah, Babak Damavandi, Alborz Geramifard

Figure 1 for Tell Your Story: Task-Oriented Dialogs for Interactive Content Creation
Figure 2 for Tell Your Story: Task-Oriented Dialogs for Interactive Content Creation
Figure 3 for Tell Your Story: Task-Oriented Dialogs for Interactive Content Creation
Figure 4 for Tell Your Story: Task-Oriented Dialogs for Interactive Content Creation
Viaarxiv icon

IMU2CLIP: Multimodal Contrastive Learning for IMU Motion Sensors from Egocentric Videos and Text

Add code
Bookmark button
Alert button
Oct 26, 2022
Seungwhan Moon, Andrea Madotto, Zhaojiang Lin, Alireza Dirafzoon, Aparajita Saraf, Amy Bearman, Babak Damavandi

Figure 1 for IMU2CLIP: Multimodal Contrastive Learning for IMU Motion Sensors from Egocentric Videos and Text
Figure 2 for IMU2CLIP: Multimodal Contrastive Learning for IMU Motion Sensors from Egocentric Videos and Text
Figure 3 for IMU2CLIP: Multimodal Contrastive Learning for IMU Motion Sensors from Egocentric Videos and Text
Figure 4 for IMU2CLIP: Multimodal Contrastive Learning for IMU Motion Sensors from Egocentric Videos and Text
Viaarxiv icon

Connecting What to Say With Where to Look by Modeling Human Attention Traces

Add code
Bookmark button
Alert button
May 12, 2021
Zihang Meng, Licheng Yu, Ning Zhang, Tamara Berg, Babak Damavandi, Vikas Singh, Amy Bearman

Figure 1 for Connecting What to Say With Where to Look by Modeling Human Attention Traces
Figure 2 for Connecting What to Say With Where to Look by Modeling Human Attention Traces
Figure 3 for Connecting What to Say With Where to Look by Modeling Human Attention Traces
Figure 4 for Connecting What to Say With Where to Look by Modeling Human Attention Traces
Viaarxiv icon

SIMMC 2.0: A Task-oriented Dialog Dataset for Immersive Multimodal Conversations

Add code
Bookmark button
Alert button
Apr 18, 2021
Satwik Kottur, Seungwhan Moon, Alborz Geramifard, Babak Damavandi

Figure 1 for SIMMC 2.0: A Task-oriented Dialog Dataset for Immersive Multimodal Conversations
Figure 2 for SIMMC 2.0: A Task-oriented Dialog Dataset for Immersive Multimodal Conversations
Figure 3 for SIMMC 2.0: A Task-oriented Dialog Dataset for Immersive Multimodal Conversations
Figure 4 for SIMMC 2.0: A Task-oriented Dialog Dataset for Immersive Multimodal Conversations
Viaarxiv icon

NN-grams: Unifying neural network and n-gram language models for Speech Recognition

Add code
Bookmark button
Alert button
Jun 23, 2016
Babak Damavandi, Shankar Kumar, Noam Shazeer, Antoine Bruguier

Figure 1 for NN-grams: Unifying neural network and n-gram language models for Speech Recognition
Figure 2 for NN-grams: Unifying neural network and n-gram language models for Speech Recognition
Figure 3 for NN-grams: Unifying neural network and n-gram language models for Speech Recognition
Figure 4 for NN-grams: Unifying neural network and n-gram language models for Speech Recognition
Viaarxiv icon