Picture for Somak Aditya

Somak Aditya

REFINE-AF: A Task-Agnostic Framework to Align Language Models via Self-Generated Instructions using Reinforcement Learning from Automated Feedback

Add code
May 10, 2025
Viaarxiv icon

SMAB: MAB based word Sensitivity Estimation Framework and its Applications in Adversarial Text Generation

Add code
Feb 10, 2025
Viaarxiv icon

ERVQA: A Dataset to Benchmark the Readiness of Large Vision Language Models in Hospital Environments

Add code
Oct 08, 2024
Viaarxiv icon

Jailbreak Paradox: The Achilles' Heel of LLMs

Add code
Jun 18, 2024
Viaarxiv icon

MATHSENSEI: A Tool-Augmented Large Language Model for Mathematical Reasoning

Add code
Feb 27, 2024
Viaarxiv icon

GRAFFORD: A Benchmark Dataset for Testing the Knowledge of Object Affordances of Language and Vision Models

Add code
Feb 20, 2024
Figure 1 for GRAFFORD: A Benchmark Dataset for Testing the Knowledge of Object Affordances of Language and Vision Models
Figure 2 for GRAFFORD: A Benchmark Dataset for Testing the Knowledge of Object Affordances of Language and Vision Models
Figure 3 for GRAFFORD: A Benchmark Dataset for Testing the Knowledge of Object Affordances of Language and Vision Models
Figure 4 for GRAFFORD: A Benchmark Dataset for Testing the Knowledge of Object Affordances of Language and Vision Models
Viaarxiv icon

Code Prompting Elicits Conditional Reasoning Abilities in Text+Code LLMs

Add code
Jan 18, 2024
Viaarxiv icon

Stuck in the Quicksand of Numeracy, Far from AGI Summit: Evaluating LLMs' Mathematical Competency through Ontology-guided Perturbations

Add code
Jan 17, 2024
Viaarxiv icon

Towards LogiGLUE: A Brief Survey and A Benchmark for Analyzing Logical Reasoning Capabilities of Language Models

Add code
Oct 02, 2023
Viaarxiv icon

Tricking LLMs into Disobedience: Understanding, Analyzing, and Preventing Jailbreaks

Add code
May 24, 2023
Viaarxiv icon