Picture for Sarkar Snigdha Sarathi Das

Sarkar Snigdha Sarathi Das

Training Step-Level Reasoning Verifiers with Formal Verification Tools

Add code
May 21, 2025
Viaarxiv icon

HRScene: How Far Are VLMs from Effective High-Resolution Image Understanding?

Add code
Apr 29, 2025
Viaarxiv icon

GREATERPROMPT: A Unified, Customizable, and High-Performing Open-Source Toolkit for Prompt Optimization

Add code
Apr 04, 2025
Figure 1 for GREATERPROMPT: A Unified, Customizable, and High-Performing Open-Source Toolkit for Prompt Optimization
Figure 2 for GREATERPROMPT: A Unified, Customizable, and High-Performing Open-Source Toolkit for Prompt Optimization
Figure 3 for GREATERPROMPT: A Unified, Customizable, and High-Performing Open-Source Toolkit for Prompt Optimization
Figure 4 for GREATERPROMPT: A Unified, Customizable, and High-Performing Open-Source Toolkit for Prompt Optimization
Viaarxiv icon

Can LLMs Rank the Harmfulness of Smaller LLMs? We are Not There Yet

Add code
Feb 07, 2025
Figure 1 for Can LLMs Rank the Harmfulness of Smaller LLMs? We are Not There Yet
Figure 2 for Can LLMs Rank the Harmfulness of Smaller LLMs? We are Not There Yet
Figure 3 for Can LLMs Rank the Harmfulness of Smaller LLMs? We are Not There Yet
Figure 4 for Can LLMs Rank the Harmfulness of Smaller LLMs? We are Not There Yet
Viaarxiv icon

GReaTer: Gradients over Reasoning Makes Smaller Language Models Strong Prompt Optimizers

Add code
Dec 12, 2024
Figure 1 for GReaTer: Gradients over Reasoning Makes Smaller Language Models Strong Prompt Optimizers
Figure 2 for GReaTer: Gradients over Reasoning Makes Smaller Language Models Strong Prompt Optimizers
Figure 3 for GReaTer: Gradients over Reasoning Makes Smaller Language Models Strong Prompt Optimizers
Figure 4 for GReaTer: Gradients over Reasoning Makes Smaller Language Models Strong Prompt Optimizers
Viaarxiv icon

VisOnlyQA: Large Vision Language Models Still Struggle with Visual Perception of Geometric Information

Add code
Dec 01, 2024
Figure 1 for VisOnlyQA: Large Vision Language Models Still Struggle with Visual Perception of Geometric Information
Figure 2 for VisOnlyQA: Large Vision Language Models Still Struggle with Visual Perception of Geometric Information
Figure 3 for VisOnlyQA: Large Vision Language Models Still Struggle with Visual Perception of Geometric Information
Figure 4 for VisOnlyQA: Large Vision Language Models Still Struggle with Visual Perception of Geometric Information
Viaarxiv icon

Verbosity $ eq$ Veracity: Demystify Verbosity Compensation Behavior of Large Language Models

Add code
Nov 12, 2024
Figure 1 for Verbosity $ eq$ Veracity: Demystify Verbosity Compensation Behavior of Large Language Models
Figure 2 for Verbosity $ eq$ Veracity: Demystify Verbosity Compensation Behavior of Large Language Models
Figure 3 for Verbosity $ eq$ Veracity: Demystify Verbosity Compensation Behavior of Large Language Models
Figure 4 for Verbosity $ eq$ Veracity: Demystify Verbosity Compensation Behavior of Large Language Models
Viaarxiv icon

Evaluating LLMs at Detecting Errors in LLM Responses

Add code
Apr 04, 2024
Figure 1 for Evaluating LLMs at Detecting Errors in LLM Responses
Figure 2 for Evaluating LLMs at Detecting Errors in LLM Responses
Figure 3 for Evaluating LLMs at Detecting Errors in LLM Responses
Figure 4 for Evaluating LLMs at Detecting Errors in LLM Responses
Viaarxiv icon

Unified Low-Resource Sequence Labeling by Sample-Aware Dynamic Sparse Finetuning

Add code
Nov 07, 2023
Viaarxiv icon

Hermes: Unlocking Security Analysis of Cellular Network Protocols by Synthesizing Finite State Machines from Natural Language Specifications

Add code
Oct 11, 2023
Figure 1 for Hermes: Unlocking Security Analysis of Cellular Network Protocols by Synthesizing Finite State Machines from Natural Language Specifications
Figure 2 for Hermes: Unlocking Security Analysis of Cellular Network Protocols by Synthesizing Finite State Machines from Natural Language Specifications
Figure 3 for Hermes: Unlocking Security Analysis of Cellular Network Protocols by Synthesizing Finite State Machines from Natural Language Specifications
Figure 4 for Hermes: Unlocking Security Analysis of Cellular Network Protocols by Synthesizing Finite State Machines from Natural Language Specifications
Viaarxiv icon