Picture for Md Arafat Sultan

Md Arafat Sultan

IBM Research AI, T.J. Watson Research Center, New York, USA

FIRST: Faster Improved Listwise Reranking with Single Token Decoding

Add code
Jun 21, 2024
Figure 1 for FIRST: Faster Improved Listwise Reranking with Single Token Decoding
Figure 2 for FIRST: Faster Improved Listwise Reranking with Single Token Decoding
Figure 3 for FIRST: Faster Improved Listwise Reranking with Single Token Decoding
Figure 4 for FIRST: Faster Improved Listwise Reranking with Single Token Decoding
Viaarxiv icon

Prompts as Auto-Optimized Training Hyperparameters: Training Best-in-Class IR Models from Scratch with 10 Gold Labels

Add code
Jun 17, 2024
Viaarxiv icon

Self-Refinement of Language Models from External Proxy Metrics Feedback

Add code
Feb 27, 2024
Figure 1 for Self-Refinement of Language Models from External Proxy Metrics Feedback
Figure 2 for Self-Refinement of Language Models from External Proxy Metrics Feedback
Figure 3 for Self-Refinement of Language Models from External Proxy Metrics Feedback
Figure 4 for Self-Refinement of Language Models from External Proxy Metrics Feedback
Viaarxiv icon

Structured Chain-of-Thought Prompting for Few-Shot Generation of Content-Grounded QA Conversations

Add code
Feb 20, 2024
Viaarxiv icon

An Empirical Investigation into the Effect of Parameter Choices in Knowledge Distillation

Add code
Jan 12, 2024
Viaarxiv icon

Multistage Collaborative Knowledge Distillation from Large Language Models

Add code
Nov 15, 2023
Figure 1 for Multistage Collaborative Knowledge Distillation from Large Language Models
Figure 2 for Multistage Collaborative Knowledge Distillation from Large Language Models
Figure 3 for Multistage Collaborative Knowledge Distillation from Large Language Models
Figure 4 for Multistage Collaborative Knowledge Distillation from Large Language Models
Viaarxiv icon

Ensemble-Instruct: Generating Instruction-Tuning Data with a Heterogeneous Mixture of LMs

Add code
Oct 21, 2023
Figure 1 for Ensemble-Instruct: Generating Instruction-Tuning Data with a Heterogeneous Mixture of LMs
Figure 2 for Ensemble-Instruct: Generating Instruction-Tuning Data with a Heterogeneous Mixture of LMs
Figure 3 for Ensemble-Instruct: Generating Instruction-Tuning Data with a Heterogeneous Mixture of LMs
Figure 4 for Ensemble-Instruct: Generating Instruction-Tuning Data with a Heterogeneous Mixture of LMs
Viaarxiv icon

Inference-time Re-ranker Relevance Feedback for Neural Information Retrieval

Add code
May 19, 2023
Figure 1 for Inference-time Re-ranker Relevance Feedback for Neural Information Retrieval
Figure 2 for Inference-time Re-ranker Relevance Feedback for Neural Information Retrieval
Figure 3 for Inference-time Re-ranker Relevance Feedback for Neural Information Retrieval
Figure 4 for Inference-time Re-ranker Relevance Feedback for Neural Information Retrieval
Viaarxiv icon

UDAPDR: Unsupervised Domain Adaptation via LLM Prompting and Distillation of Rerankers

Add code
Mar 01, 2023
Figure 1 for UDAPDR: Unsupervised Domain Adaptation via LLM Prompting and Distillation of Rerankers
Figure 2 for UDAPDR: Unsupervised Domain Adaptation via LLM Prompting and Distillation of Rerankers
Figure 3 for UDAPDR: Unsupervised Domain Adaptation via LLM Prompting and Distillation of Rerankers
Figure 4 for UDAPDR: Unsupervised Domain Adaptation via LLM Prompting and Distillation of Rerankers
Viaarxiv icon

Knowledge Distillation $\approx$ Label Smoothing: Fact or Fallacy?

Add code
Feb 06, 2023
Figure 1 for Knowledge Distillation $\approx$ Label Smoothing: Fact or Fallacy?
Figure 2 for Knowledge Distillation $\approx$ Label Smoothing: Fact or Fallacy?
Figure 3 for Knowledge Distillation $\approx$ Label Smoothing: Fact or Fallacy?
Figure 4 for Knowledge Distillation $\approx$ Label Smoothing: Fact or Fallacy?
Viaarxiv icon