Picture for Md Arafat Sultan

Md Arafat Sultan

IBM Research AI, T.J. Watson Research Center, New York, USA

Confidence-Weighted Token Set Cover for Early Hypothesis Pruning in Self-Consistency

Add code
Aug 06, 2025
Viaarxiv icon

Optimal Policy Minimum Bayesian Risk

Add code
May 22, 2025
Viaarxiv icon

FIRST: Faster Improved Listwise Reranking with Single Token Decoding

Add code
Jun 21, 2024
Figure 1 for FIRST: Faster Improved Listwise Reranking with Single Token Decoding
Figure 2 for FIRST: Faster Improved Listwise Reranking with Single Token Decoding
Figure 3 for FIRST: Faster Improved Listwise Reranking with Single Token Decoding
Figure 4 for FIRST: Faster Improved Listwise Reranking with Single Token Decoding
Viaarxiv icon

Prompts as Auto-Optimized Training Hyperparameters: Training Best-in-Class IR Models from Scratch with 10 Gold Labels

Add code
Jun 17, 2024
Figure 1 for Prompts as Auto-Optimized Training Hyperparameters: Training Best-in-Class IR Models from Scratch with 10 Gold Labels
Figure 2 for Prompts as Auto-Optimized Training Hyperparameters: Training Best-in-Class IR Models from Scratch with 10 Gold Labels
Figure 3 for Prompts as Auto-Optimized Training Hyperparameters: Training Best-in-Class IR Models from Scratch with 10 Gold Labels
Figure 4 for Prompts as Auto-Optimized Training Hyperparameters: Training Best-in-Class IR Models from Scratch with 10 Gold Labels
Viaarxiv icon

Self-Refinement of Language Models from External Proxy Metrics Feedback

Add code
Feb 27, 2024
Figure 1 for Self-Refinement of Language Models from External Proxy Metrics Feedback
Figure 2 for Self-Refinement of Language Models from External Proxy Metrics Feedback
Figure 3 for Self-Refinement of Language Models from External Proxy Metrics Feedback
Figure 4 for Self-Refinement of Language Models from External Proxy Metrics Feedback
Viaarxiv icon

Structured Chain-of-Thought Prompting for Few-Shot Generation of Content-Grounded QA Conversations

Add code
Feb 20, 2024
Figure 1 for Structured Chain-of-Thought Prompting for Few-Shot Generation of Content-Grounded QA Conversations
Figure 2 for Structured Chain-of-Thought Prompting for Few-Shot Generation of Content-Grounded QA Conversations
Figure 3 for Structured Chain-of-Thought Prompting for Few-Shot Generation of Content-Grounded QA Conversations
Figure 4 for Structured Chain-of-Thought Prompting for Few-Shot Generation of Content-Grounded QA Conversations
Viaarxiv icon

An Empirical Investigation into the Effect of Parameter Choices in Knowledge Distillation

Add code
Jan 12, 2024
Figure 1 for An Empirical Investigation into the Effect of Parameter Choices in Knowledge Distillation
Figure 2 for An Empirical Investigation into the Effect of Parameter Choices in Knowledge Distillation
Figure 3 for An Empirical Investigation into the Effect of Parameter Choices in Knowledge Distillation
Figure 4 for An Empirical Investigation into the Effect of Parameter Choices in Knowledge Distillation
Viaarxiv icon

Multistage Collaborative Knowledge Distillation from Large Language Models

Add code
Nov 15, 2023
Figure 1 for Multistage Collaborative Knowledge Distillation from Large Language Models
Figure 2 for Multistage Collaborative Knowledge Distillation from Large Language Models
Figure 3 for Multistage Collaborative Knowledge Distillation from Large Language Models
Figure 4 for Multistage Collaborative Knowledge Distillation from Large Language Models
Viaarxiv icon

Ensemble-Instruct: Generating Instruction-Tuning Data with a Heterogeneous Mixture of LMs

Add code
Oct 21, 2023
Figure 1 for Ensemble-Instruct: Generating Instruction-Tuning Data with a Heterogeneous Mixture of LMs
Figure 2 for Ensemble-Instruct: Generating Instruction-Tuning Data with a Heterogeneous Mixture of LMs
Figure 3 for Ensemble-Instruct: Generating Instruction-Tuning Data with a Heterogeneous Mixture of LMs
Figure 4 for Ensemble-Instruct: Generating Instruction-Tuning Data with a Heterogeneous Mixture of LMs
Viaarxiv icon

Inference-time Re-ranker Relevance Feedback for Neural Information Retrieval

Add code
May 19, 2023
Figure 1 for Inference-time Re-ranker Relevance Feedback for Neural Information Retrieval
Figure 2 for Inference-time Re-ranker Relevance Feedback for Neural Information Retrieval
Figure 3 for Inference-time Re-ranker Relevance Feedback for Neural Information Retrieval
Figure 4 for Inference-time Re-ranker Relevance Feedback for Neural Information Retrieval
Viaarxiv icon