Alert button
Picture for Ajay Divakaran

Ajay Divakaran

Alert button

BloomVQA: Assessing Hierarchical Multi-modal Comprehension

Dec 20, 2023
Yunye Gong, Robik Shrestha, Jared Claypoole, Michael Cogswell, Arijit Ray, Christopher Kanan, Ajay Divakaran

Viaarxiv icon

A Video is Worth 10,000 Words: Training and Benchmarking with Diverse Captions for Better Long Video Retrieval

Nov 30, 2023
Matthew Gwilliam, Michael Cogswell, Meng Ye, Karan Sikka, Abhinav Shrivastava, Ajay Divakaran

Viaarxiv icon

DRESS: Instructing Large Vision-Language Models to Align and Interact with Humans via Natural Language Feedback

Nov 16, 2023
Yangyi Chen, Karan Sikka, Michael Cogswell, Heng Ji, Ajay Divakaran

Figure 1 for DRESS: Instructing Large Vision-Language Models to Align and Interact with Humans via Natural Language Feedback
Figure 2 for DRESS: Instructing Large Vision-Language Models to Align and Interact with Humans via Natural Language Feedback
Figure 3 for DRESS: Instructing Large Vision-Language Models to Align and Interact with Humans via Natural Language Feedback
Figure 4 for DRESS: Instructing Large Vision-Language Models to Align and Interact with Humans via Natural Language Feedback
Viaarxiv icon

Demonstrations Are All You Need: Advancing Offensive Content Paraphrasing using In-Context Learning

Oct 16, 2023
Anirudh Som, Karan Sikka, Helen Gent, Ajay Divakaran, Andreas Kathol, Dimitra Vergyri

Viaarxiv icon

Confidence Calibration for Systems with Cascaded Predictive Modules

Sep 21, 2023
Yunye Gong, Yi Yao, Xiao Lin, Ajay Divakaran, Melinda Gervasio

Viaarxiv icon

Measuring and Improving Chain-of-Thought Reasoning in Vision-Language Models

Sep 08, 2023
Yangyi Chen, Karan Sikka, Michael Cogswell, Heng Ji, Ajay Divakaran

Figure 1 for Measuring and Improving Chain-of-Thought Reasoning in Vision-Language Models
Figure 2 for Measuring and Improving Chain-of-Thought Reasoning in Vision-Language Models
Figure 3 for Measuring and Improving Chain-of-Thought Reasoning in Vision-Language Models
Figure 4 for Measuring and Improving Chain-of-Thought Reasoning in Vision-Language Models
Viaarxiv icon

TIJO: Trigger Inversion with Joint Optimization for Defending Multimodal Backdoored Models

Aug 07, 2023
Indranil Sur, Karan Sikka, Matthew Walmer, Kaushik Koneripalli, Anirban Roy, Xiao Lin, Ajay Divakaran, Susmit Jha

Figure 1 for TIJO: Trigger Inversion with Joint Optimization for Defending Multimodal Backdoored Models
Figure 2 for TIJO: Trigger Inversion with Joint Optimization for Defending Multimodal Backdoored Models
Figure 3 for TIJO: Trigger Inversion with Joint Optimization for Defending Multimodal Backdoored Models
Figure 4 for TIJO: Trigger Inversion with Joint Optimization for Defending Multimodal Backdoored Models
Viaarxiv icon

Probing Conceptual Understanding of Large Visual-Language Models

Apr 07, 2023
Madeline Chantry Schiappa, Michael Cogswell, Ajay Divakaran, Yogesh Singh Rawat

Figure 1 for Probing Conceptual Understanding of Large Visual-Language Models
Figure 2 for Probing Conceptual Understanding of Large Visual-Language Models
Figure 3 for Probing Conceptual Understanding of Large Visual-Language Models
Figure 4 for Probing Conceptual Understanding of Large Visual-Language Models
Viaarxiv icon

Multilingual Content Moderation: A Case Study on Reddit

Feb 19, 2023
Meng Ye, Karan Sikka, Katherine Atwell, Sabit Hassan, Ajay Divakaran, Malihe Alikhani

Figure 1 for Multilingual Content Moderation: A Case Study on Reddit
Figure 2 for Multilingual Content Moderation: A Case Study on Reddit
Figure 3 for Multilingual Content Moderation: A Case Study on Reddit
Figure 4 for Multilingual Content Moderation: A Case Study on Reddit
Viaarxiv icon

System Design for an Integrated Lifelong Reinforcement Learning Agent for Real-Time Strategy Games

Dec 08, 2022
Indranil Sur, Zachary Daniels, Abrar Rahman, Kamil Faber, Gianmarco J. Gallardo, Tyler L. Hayes, Cameron E. Taylor, Mustafa Burak Gurbuz, James Smith, Sahana Joshi, Nathalie Japkowicz, Michael Baron, Zsolt Kira, Christopher Kanan, Roberto Corizzo, Ajay Divakaran, Michael Piacentino, Jesse Hostetler, Aswin Raghavan

Figure 1 for System Design for an Integrated Lifelong Reinforcement Learning Agent for Real-Time Strategy Games
Figure 2 for System Design for an Integrated Lifelong Reinforcement Learning Agent for Real-Time Strategy Games
Viaarxiv icon