Picture for Jinming Nian

Jinming Nian

Submodular Evaluation Subset Selection in Automatic Prompt Optimization

Add code
Jan 07, 2026
Viaarxiv icon

Evaluating Social Biases in LLM Reasoning

Add code
Feb 21, 2025
Figure 1 for Evaluating Social Biases in LLM Reasoning
Figure 2 for Evaluating Social Biases in LLM Reasoning
Figure 3 for Evaluating Social Biases in LLM Reasoning
Figure 4 for Evaluating Social Biases in LLM Reasoning
Viaarxiv icon

RAG-ConfusionQA: A Benchmark for Evaluating LLMs on Confusing Questions

Add code
Oct 18, 2024
Figure 1 for RAG-ConfusionQA: A Benchmark for Evaluating LLMs on Confusing Questions
Figure 2 for RAG-ConfusionQA: A Benchmark for Evaluating LLMs on Confusing Questions
Figure 3 for RAG-ConfusionQA: A Benchmark for Evaluating LLMs on Confusing Questions
Figure 4 for RAG-ConfusionQA: A Benchmark for Evaluating LLMs on Confusing Questions
Viaarxiv icon

W-RAG: Weakly Supervised Dense Retrieval in RAG for Open-domain Question Answering

Add code
Aug 15, 2024
Viaarxiv icon