Picture for Harshad Khadilkar

Harshad Khadilkar

AEGIS: An Agent for Extraction and Geographic Identification in Scholarly Proceedings

Add code
Sep 11, 2025
Viaarxiv icon

$TAR^2$: Temporal-Agent Reward Redistribution for Optimal Policy Preservation in Multi-Agent Reinforcement Learning

Add code
Feb 07, 2025
Figure 1 for $TAR^2$: Temporal-Agent Reward Redistribution for Optimal Policy Preservation in Multi-Agent Reinforcement Learning
Figure 2 for $TAR^2$: Temporal-Agent Reward Redistribution for Optimal Policy Preservation in Multi-Agent Reinforcement Learning
Figure 3 for $TAR^2$: Temporal-Agent Reward Redistribution for Optimal Policy Preservation in Multi-Agent Reinforcement Learning
Figure 4 for $TAR^2$: Temporal-Agent Reward Redistribution for Optimal Policy Preservation in Multi-Agent Reinforcement Learning
Viaarxiv icon

Agent-Temporal Credit Assignment for Optimal Policy Preservation in Sparse Multi-Agent Reinforcement Learning

Add code
Dec 19, 2024
Figure 1 for Agent-Temporal Credit Assignment for Optimal Policy Preservation in Sparse Multi-Agent Reinforcement Learning
Viaarxiv icon

DeepClean: Integrated Distortion Identification and Algorithm Selection for Rectifying Image Corruptions

Add code
Jul 23, 2024
Viaarxiv icon

Leveraging Domain Knowledge for Efficient Reward Modelling in RLHF: A Case-Study in E-Commerce Opinion Summarization

Add code
Feb 23, 2024
Figure 1 for Leveraging Domain Knowledge for Efficient Reward Modelling in RLHF: A Case-Study in E-Commerce Opinion Summarization
Figure 2 for Leveraging Domain Knowledge for Efficient Reward Modelling in RLHF: A Case-Study in E-Commerce Opinion Summarization
Figure 3 for Leveraging Domain Knowledge for Efficient Reward Modelling in RLHF: A Case-Study in E-Commerce Opinion Summarization
Figure 4 for Leveraging Domain Knowledge for Efficient Reward Modelling in RLHF: A Case-Study in E-Commerce Opinion Summarization
Viaarxiv icon

Transformers are Expressive, But Are They Expressive Enough for Regression?

Add code
Feb 23, 2024
Figure 1 for Transformers are Expressive, But Are They Expressive Enough for Regression?
Figure 2 for Transformers are Expressive, But Are They Expressive Enough for Regression?
Figure 3 for Transformers are Expressive, But Are They Expressive Enough for Regression?
Figure 4 for Transformers are Expressive, But Are They Expressive Enough for Regression?
Viaarxiv icon

Reinforcement Replaces Supervision: Query focused Summarization using Deep Reinforcement Learning

Add code
Nov 29, 2023
Viaarxiv icon

Multi-Agent Learning of Efficient Fulfilment and Routing Strategies in E-Commerce

Add code
Nov 20, 2023
Viaarxiv icon

Using General Value Functions to Learn Domain-Backed Inventory Management Policies

Add code
Nov 03, 2023
Figure 1 for Using General Value Functions to Learn Domain-Backed Inventory Management Policies
Figure 2 for Using General Value Functions to Learn Domain-Backed Inventory Management Policies
Figure 3 for Using General Value Functions to Learn Domain-Backed Inventory Management Policies
Figure 4 for Using General Value Functions to Learn Domain-Backed Inventory Management Policies
Viaarxiv icon

Using Linear Regression for Iteratively Training Neural Networks

Add code
Jul 14, 2023
Figure 1 for Using Linear Regression for Iteratively Training Neural Networks
Figure 2 for Using Linear Regression for Iteratively Training Neural Networks
Figure 3 for Using Linear Regression for Iteratively Training Neural Networks
Figure 4 for Using Linear Regression for Iteratively Training Neural Networks
Viaarxiv icon