Picture for Dhruv Kumar

Dhruv Kumar

Beyond Accuracy: Diagnosing Algebraic Reasoning Failures in LLMs Across Nine Complexity Dimensions

Add code
Apr 08, 2026
Viaarxiv icon

LUDOBENCH: Evaluating LLM Behavioural Decision-Making Through Spot-Based Board Game Scenarios in Ludo

Add code
Apr 07, 2026
Viaarxiv icon

LLM-as-a-Judge for Time Series Explanations

Add code
Apr 02, 2026
Viaarxiv icon

Infinite Problem Generator: Verifiably Scaling Physics Reasoning Data with Agentic Workflows

Add code
Mar 15, 2026
Viaarxiv icon

Trust Regions Sell, But Who's Buying? Overlap Geometry as an Alternative Trust Region for Policy Optimization

Add code
Feb 06, 2026
Viaarxiv icon

The Compliance Paradox: Semantic-Instruction Decoupling in Automated Academic Code Evaluation

Add code
Jan 29, 2026
Viaarxiv icon

A Hybrid Supervised-LLM Pipeline for Actionable Suggestion Mining in Unstructured Customer Reviews

Add code
Jan 27, 2026
Viaarxiv icon

PhysicsSolutionAgent: Towards Multimodal Explanations for Numerical Physics Problem Solving

Add code
Jan 19, 2026
Viaarxiv icon

Actionable Advice from Reviews via Mixture of LoRA Experts: A Two-LLM Pipeline for Issue Extraction and Business Recommendations

Add code
Jan 18, 2026
Viaarxiv icon

A Multi-Agent System for Generating Actionable Business Advice

Add code
Jan 17, 2026
Viaarxiv icon