Picture for Prafulla Kumar Choubey

Prafulla Kumar Choubey

CRMArena-Pro: Holistic Assessment of LLM Agents Across Diverse Business Scenarios and Interactions

Add code
May 24, 2025
Viaarxiv icon

Turning Conversations into Workflows: A Framework to Extract and Evaluate Dialog Workflows for Service AI Agents

Add code
Feb 24, 2025
Viaarxiv icon

Unanswerability Evaluation for Retreival Augmented Generation

Add code
Dec 16, 2024
Viaarxiv icon

SiReRAG: Indexing Similar and Related Information for Multihop Reasoning

Add code
Dec 09, 2024
Figure 1 for SiReRAG: Indexing Similar and Related Information for Multihop Reasoning
Figure 2 for SiReRAG: Indexing Similar and Related Information for Multihop Reasoning
Figure 3 for SiReRAG: Indexing Similar and Related Information for Multihop Reasoning
Figure 4 for SiReRAG: Indexing Similar and Related Information for Multihop Reasoning
Viaarxiv icon

Distill-SynthKG: Distilling Knowledge Graph Synthesis Workflow for Improved Coverage and Efficiency

Add code
Oct 22, 2024
Figure 1 for Distill-SynthKG: Distilling Knowledge Graph Synthesis Workflow for Improved Coverage and Efficiency
Figure 2 for Distill-SynthKG: Distilling Knowledge Graph Synthesis Workflow for Improved Coverage and Efficiency
Figure 3 for Distill-SynthKG: Distilling Knowledge Graph Synthesis Workflow for Improved Coverage and Efficiency
Figure 4 for Distill-SynthKG: Distilling Knowledge Graph Synthesis Workflow for Improved Coverage and Efficiency
Viaarxiv icon

Do RAG Systems Cover What Matters? Evaluating and Optimizing Responses with Sub-Question Coverage

Add code
Oct 20, 2024
Figure 1 for Do RAG Systems Cover What Matters? Evaluating and Optimizing Responses with Sub-Question Coverage
Figure 2 for Do RAG Systems Cover What Matters? Evaluating and Optimizing Responses with Sub-Question Coverage
Figure 3 for Do RAG Systems Cover What Matters? Evaluating and Optimizing Responses with Sub-Question Coverage
Figure 4 for Do RAG Systems Cover What Matters? Evaluating and Optimizing Responses with Sub-Question Coverage
Viaarxiv icon

Lexical Repetitions Lead to Rote Learning: Unveiling the Impact of Lexical Overlap in Train and Test Reference Summaries

Add code
Nov 15, 2023
Viaarxiv icon

Embrace Divergence for Richer Insights: A Multi-document Summarization Benchmark and a Case Study on Summarizing Diverse Information from News Articles

Add code
Sep 17, 2023
Viaarxiv icon

XGen-7B Technical Report

Add code
Sep 07, 2023
Viaarxiv icon

Improving Factual Consistency in Summarization with Compression-Based Post-Editing

Add code
Nov 11, 2022
Viaarxiv icon