Picture for Sathish A. P. Kumar

Sathish A. P. Kumar

ACE-RLHF: Automated Code Evaluation and Socratic Feedback Generation Tool using Large Language Models and Reinforcement Learning with Human Feedback

Add code
Apr 07, 2025
Viaarxiv icon

Enhanced Penalty-based Bidirectional Reinforcement Learning Algorithms

Add code
Apr 04, 2025
Viaarxiv icon

MORAL: A Multimodal Reinforcement Learning Framework for Decision Making in Autonomous Laboratories

Add code
Apr 04, 2025
Viaarxiv icon

GPA: Grover Policy Agent for Generating Optimal Quantum Sensor Circuits

Add code
Feb 19, 2025
Viaarxiv icon