Picture for Vatsal Gupta

Vatsal Gupta

Prose2Policy (P2P): A Practical LLM Pipeline for Translating Natural-Language Access Policies into Executable Rego

Add code
Mar 16, 2026
Viaarxiv icon

NTSEBENCH: Cognitive Reasoning Benchmark for Vision Language Models

Add code
Jul 15, 2024
Figure 1 for NTSEBENCH: Cognitive Reasoning Benchmark for Vision Language Models
Figure 2 for NTSEBENCH: Cognitive Reasoning Benchmark for Vision Language Models
Figure 3 for NTSEBENCH: Cognitive Reasoning Benchmark for Vision Language Models
Figure 4 for NTSEBENCH: Cognitive Reasoning Benchmark for Vision Language Models
Viaarxiv icon

FlowVQA: Mapping Multimodal Logic in Visual Question Answering with Flowcharts

Add code
Jun 27, 2024
Viaarxiv icon

ChIRAAG: ChatGPT Informed Rapid and Automated Assertion Generation

Add code
Jan 31, 2024
Figure 1 for ChIRAAG: ChatGPT Informed Rapid and Automated Assertion Generation
Figure 2 for ChIRAAG: ChatGPT Informed Rapid and Automated Assertion Generation
Figure 3 for ChIRAAG: ChatGPT Informed Rapid and Automated Assertion Generation
Figure 4 for ChIRAAG: ChatGPT Informed Rapid and Automated Assertion Generation
Viaarxiv icon

Multi-Set Inoculation: Assessing Model Robustness Across Multiple Challenge Sets

Add code
Nov 15, 2023
Figure 1 for Multi-Set Inoculation: Assessing Model Robustness Across Multiple Challenge Sets
Figure 2 for Multi-Set Inoculation: Assessing Model Robustness Across Multiple Challenge Sets
Figure 3 for Multi-Set Inoculation: Assessing Model Robustness Across Multiple Challenge Sets
Figure 4 for Multi-Set Inoculation: Assessing Model Robustness Across Multiple Challenge Sets
Viaarxiv icon