Alert button
Picture for Somak Aditya

Somak Aditya

Alert button

MATHSENSEI: A Tool-Augmented Large Language Model for Mathematical Reasoning

Add code
Bookmark button
Alert button
Feb 27, 2024
Debrup Das, Debopriyo Banerjee, Somak Aditya, Ashish Kulkarni

Viaarxiv icon

GRAFFORD: A Benchmark Dataset for Testing the Knowledge of Object Affordances of Language and Vision Models

Add code
Bookmark button
Alert button
Feb 20, 2024
Sayantan Adak, Daivik Agrawal, Animesh Mukherjee, Somak Aditya

Viaarxiv icon

Code Prompting Elicits Conditional Reasoning Abilities in Text+Code LLMs

Add code
Bookmark button
Alert button
Jan 18, 2024
Haritz Puerto, Martin Tutek, Somak Aditya, Xiaodan Zhu, Iryna Gurevych

Viaarxiv icon

Stuck in the Quicksand of Numeracy, Far from AGI Summit: Evaluating LLMs' Mathematical Competency through Ontology-guided Perturbations

Add code
Bookmark button
Alert button
Jan 17, 2024
Pengfei Hong, Deepanway Ghosal, Navonil Majumder, Somak Aditya, Rada Mihalcea, Soujanya Poria

Viaarxiv icon

Towards LogiGLUE: A Brief Survey and A Benchmark for Analyzing Logical Reasoning Capabilities of Language Models

Add code
Bookmark button
Alert button
Oct 02, 2023
Man Luo, Shrinidhi Kumbhar, Ming shen, Mihir Parmar, Neeraj Varshney, Pratyay Banerjee, Somak Aditya, Chitta Baral

Figure 1 for Towards LogiGLUE: A Brief Survey and A Benchmark for Analyzing Logical Reasoning Capabilities of Language Models
Figure 2 for Towards LogiGLUE: A Brief Survey and A Benchmark for Analyzing Logical Reasoning Capabilities of Language Models
Figure 3 for Towards LogiGLUE: A Brief Survey and A Benchmark for Analyzing Logical Reasoning Capabilities of Language Models
Figure 4 for Towards LogiGLUE: A Brief Survey and A Benchmark for Analyzing Logical Reasoning Capabilities of Language Models
Viaarxiv icon

Tricking LLMs into Disobedience: Understanding, Analyzing, and Preventing Jailbreaks

Add code
Bookmark button
Alert button
May 24, 2023
Abhinav Rao, Sachin Vashistha, Atharva Naik, Somak Aditya, Monojit Choudhury

Figure 1 for Tricking LLMs into Disobedience: Understanding, Analyzing, and Preventing Jailbreaks
Figure 2 for Tricking LLMs into Disobedience: Understanding, Analyzing, and Preventing Jailbreaks
Figure 3 for Tricking LLMs into Disobedience: Understanding, Analyzing, and Preventing Jailbreaks
Figure 4 for Tricking LLMs into Disobedience: Understanding, Analyzing, and Preventing Jailbreaks
Viaarxiv icon

ReMask: A Robust Information-Masking Approach for Domain Counterfactual Generation

Add code
Bookmark button
Alert button
May 04, 2023
Pengfei Hong, Rishabh Bhardwaj, Navonil Majumdar, Somak Aditya, Soujanya Poria

Figure 1 for ReMask: A Robust Information-Masking Approach for Domain Counterfactual Generation
Figure 2 for ReMask: A Robust Information-Masking Approach for Domain Counterfactual Generation
Figure 3 for ReMask: A Robust Information-Masking Approach for Domain Counterfactual Generation
Figure 4 for ReMask: A Robust Information-Masking Approach for Domain Counterfactual Generation
Viaarxiv icon

Generating Intermediate Steps for NLI with Next-Step Supervision

Add code
Bookmark button
Alert button
Aug 31, 2022
Deepanway Ghosal, Somak Aditya, Monojit Choudhury

Figure 1 for Generating Intermediate Steps for NLI with Next-Step Supervision
Figure 2 for Generating Intermediate Steps for NLI with Next-Step Supervision
Figure 3 for Generating Intermediate Steps for NLI with Next-Step Supervision
Figure 4 for Generating Intermediate Steps for NLI with Next-Step Supervision
Viaarxiv icon

Multilingual CheckList: Generation and Evaluation

Add code
Bookmark button
Alert button
Mar 30, 2022
Karthikeyan K, Shaily Bhatt, Pankaj Singh, Somak Aditya, Sandipan Dandapat, Sunayana Sitaram, Monojit Choudhury

Figure 1 for Multilingual CheckList: Generation and Evaluation
Figure 2 for Multilingual CheckList: Generation and Evaluation
Figure 3 for Multilingual CheckList: Generation and Evaluation
Figure 4 for Multilingual CheckList: Generation and Evaluation
Viaarxiv icon

LoNLI: An Extensible Framework for Testing Diverse Logical Reasoning Capabilities for NLI

Add code
Bookmark button
Alert button
Dec 04, 2021
Ishan Tarunesh, Somak Aditya, Monojit Choudhury

Figure 1 for LoNLI: An Extensible Framework for Testing Diverse Logical Reasoning Capabilities for NLI
Figure 2 for LoNLI: An Extensible Framework for Testing Diverse Logical Reasoning Capabilities for NLI
Figure 3 for LoNLI: An Extensible Framework for Testing Diverse Logical Reasoning Capabilities for NLI
Figure 4 for LoNLI: An Extensible Framework for Testing Diverse Logical Reasoning Capabilities for NLI
Viaarxiv icon