Picture for Anirudh Satheesh

Anirudh Satheesh

Regret Analysis of Unichain Average Reward Constrained MDPs with General Parameterization

Add code
Feb 08, 2026
Viaarxiv icon

Distributionally Robust Self Paced Curriculum Reinforcement Learning

Add code
Nov 12, 2025
Figure 1 for Distributionally Robust Self Paced Curriculum Reinforcement Learning
Figure 2 for Distributionally Robust Self Paced Curriculum Reinforcement Learning
Figure 3 for Distributionally Robust Self Paced Curriculum Reinforcement Learning
Figure 4 for Distributionally Robust Self Paced Curriculum Reinforcement Learning
Viaarxiv icon

Primal-Only Actor Critic Algorithm for Robust Constrained Average Cost MDPs

Add code
Nov 07, 2025
Viaarxiv icon

cMALC-D: Contextual Multi-Agent LLM-Guided Curriculum Learning with Diversity-Based Context Blending

Add code
Aug 28, 2025
Viaarxiv icon

PICore: Physics-Informed Unsupervised Coreset Selection for Data Efficient Neural Operator Training

Add code
Jul 23, 2025
Viaarxiv icon

MORSE-500: A Programmatically Controllable Video Benchmark to Stress-Test Multimodal Reasoning

Add code
Jun 05, 2025
Viaarxiv icon

EnsemW2S: Enhancing Weak-to-Strong Generalization with Large Language Model Ensembles

Add code
May 28, 2025
Viaarxiv icon

A Constrained Multi-Agent Reinforcement Learning Approach to Autonomous Traffic Signal Control

Add code
Mar 30, 2025
Figure 1 for A Constrained Multi-Agent Reinforcement Learning Approach to Autonomous Traffic Signal Control
Figure 2 for A Constrained Multi-Agent Reinforcement Learning Approach to Autonomous Traffic Signal Control
Figure 3 for A Constrained Multi-Agent Reinforcement Learning Approach to Autonomous Traffic Signal Control
Figure 4 for A Constrained Multi-Agent Reinforcement Learning Approach to Autonomous Traffic Signal Control
Viaarxiv icon

EnsemW2S: Can an Ensemble of LLMs be Leveraged to Obtain a Stronger LLM?

Add code
Oct 06, 2024
Figure 1 for EnsemW2S: Can an Ensemble of LLMs be Leveraged to Obtain a Stronger LLM?
Figure 2 for EnsemW2S: Can an Ensemble of LLMs be Leveraged to Obtain a Stronger LLM?
Figure 3 for EnsemW2S: Can an Ensemble of LLMs be Leveraged to Obtain a Stronger LLM?
Figure 4 for EnsemW2S: Can an Ensemble of LLMs be Leveraged to Obtain a Stronger LLM?
Viaarxiv icon

SAFLEX: Self-Adaptive Augmentation via Feature Label Extrapolation

Add code
Oct 03, 2024
Figure 1 for SAFLEX: Self-Adaptive Augmentation via Feature Label Extrapolation
Figure 2 for SAFLEX: Self-Adaptive Augmentation via Feature Label Extrapolation
Figure 3 for SAFLEX: Self-Adaptive Augmentation via Feature Label Extrapolation
Figure 4 for SAFLEX: Self-Adaptive Augmentation via Feature Label Extrapolation
Viaarxiv icon