Picture for Mrinmaya Sachan

Mrinmaya Sachan

From Problem-Solving to Teaching Problem-Solving: Aligning LLMs with Pedagogy using Reinforcement Learning

Add code
May 21, 2025
Viaarxiv icon

LEXam: Benchmarking Legal Reasoning on 340 Law Exams

Add code
May 19, 2025
Figure 1 for LEXam: Benchmarking Legal Reasoning on 340 Law Exams
Figure 2 for LEXam: Benchmarking Legal Reasoning on 340 Law Exams
Figure 3 for LEXam: Benchmarking Legal Reasoning on 340 Law Exams
Figure 4 for LEXam: Benchmarking Legal Reasoning on 340 Law Exams
Viaarxiv icon

A Head to Predict and a Head to Question: Pre-trained Uncertainty Quantification Heads for Hallucination Detection in LLM Outputs

Add code
May 13, 2025
Viaarxiv icon

Multilingual Performance Biases of Large Language Models in Education

Add code
Apr 24, 2025
Viaarxiv icon

MathTutorBench: A Benchmark for Measuring Open-ended Pedagogical Capabilities of LLM Tutors

Add code
Feb 26, 2025
Viaarxiv icon

Navigating the Helpfulness-Truthfulness Trade-Off with Uncertainty-Aware Instruction Fine-Tuning

Add code
Feb 17, 2025
Viaarxiv icon

Grammar Control in Dialogue Response Generation for Language Learning Chatbots

Add code
Feb 11, 2025
Viaarxiv icon

Investigating the Zone of Proximal Development of Language Models for In-Context Learning

Add code
Feb 10, 2025
Viaarxiv icon

How to Select Datapoints for Efficient Human Evaluation of NLG Models?

Add code
Jan 30, 2025
Figure 1 for How to Select Datapoints for Efficient Human Evaluation of NLG Models?
Figure 2 for How to Select Datapoints for Efficient Human Evaluation of NLG Models?
Figure 3 for How to Select Datapoints for Efficient Human Evaluation of NLG Models?
Figure 4 for How to Select Datapoints for Efficient Human Evaluation of NLG Models?
Viaarxiv icon

Likelihood as a Performance Gauge for Retrieval-Augmented Generation

Add code
Nov 12, 2024
Viaarxiv icon