Picture for Samarendra Chandan Bindu Dash

Samarendra Chandan Bindu Dash

Evaluating LLM Reasoning in the Operations Research Domain with ORQA

Add code
Dec 22, 2024
Figure 1 for Evaluating LLM Reasoning in the Operations Research Domain with ORQA
Figure 2 for Evaluating LLM Reasoning in the Operations Research Domain with ORQA
Figure 3 for Evaluating LLM Reasoning in the Operations Research Domain with ORQA
Figure 4 for Evaluating LLM Reasoning in the Operations Research Domain with ORQA
Viaarxiv icon

DADAgger: Disagreement-Augmented Dataset Aggregation

Add code
Jan 03, 2023
Viaarxiv icon