Picture for Keerthiram Murugesan

Keerthiram Murugesan

CTBench: A Comprehensive Benchmark for Evaluating Language Model Capabilities in Clinical Trial Design

Add code
Jun 25, 2024
Viaarxiv icon

STARLING: Self-supervised Training of Text-based Reinforcement Learning Agent with Large Language Models

Add code
Jun 09, 2024
Viaarxiv icon

Facilitating Human-LLM Collaboration through Factuality Scores and Source Attributions

Add code
May 30, 2024
Viaarxiv icon

SF-DQN: Provable Knowledge Transfer using Successor Feature for Deep Reinforcement Learning

Add code
May 24, 2024
Figure 1 for SF-DQN: Provable Knowledge Transfer using Successor Feature for Deep Reinforcement Learning
Figure 2 for SF-DQN: Provable Knowledge Transfer using Successor Feature for Deep Reinforcement Learning
Figure 3 for SF-DQN: Provable Knowledge Transfer using Successor Feature for Deep Reinforcement Learning
Figure 4 for SF-DQN: Provable Knowledge Transfer using Successor Feature for Deep Reinforcement Learning
Viaarxiv icon

On the Effects of Fine-tuning Language Models for Text-Based Reinforcement Learning

Add code
Apr 15, 2024
Viaarxiv icon

EXPLORER: Exploration-guided Reasoning for Textual Reinforcement Learning

Add code
Mar 15, 2024
Figure 1 for EXPLORER: Exploration-guided Reasoning for Textual Reinforcement Learning
Figure 2 for EXPLORER: Exploration-guided Reasoning for Textual Reinforcement Learning
Figure 3 for EXPLORER: Exploration-guided Reasoning for Textual Reinforcement Learning
Figure 4 for EXPLORER: Exploration-guided Reasoning for Textual Reinforcement Learning
Viaarxiv icon

Detectors for Safe and Reliable LLMs: Implementations, Uses, and Limitations

Add code
Mar 09, 2024
Figure 1 for Detectors for Safe and Reliable LLMs: Implementations, Uses, and Limitations
Figure 2 for Detectors for Safe and Reliable LLMs: Implementations, Uses, and Limitations
Figure 3 for Detectors for Safe and Reliable LLMs: Implementations, Uses, and Limitations
Figure 4 for Detectors for Safe and Reliable LLMs: Implementations, Uses, and Limitations
Viaarxiv icon

Language Guided Exploration for RL Agents in Text Environments

Add code
Mar 05, 2024
Figure 1 for Language Guided Exploration for RL Agents in Text Environments
Figure 2 for Language Guided Exploration for RL Agents in Text Environments
Figure 3 for Language Guided Exploration for RL Agents in Text Environments
Figure 4 for Language Guided Exploration for RL Agents in Text Environments
Viaarxiv icon

On the Prospects of Incorporating Large Language Models (LLMs) in Automated Planning and Scheduling (APS)

Add code
Jan 04, 2024
Figure 1 for On the Prospects of Incorporating Large Language Models (LLMs) in Automated Planning and Scheduling (APS)
Figure 2 for On the Prospects of Incorporating Large Language Models (LLMs) in Automated Planning and Scheduling (APS)
Figure 3 for On the Prospects of Incorporating Large Language Models (LLMs) in Automated Planning and Scheduling (APS)
Figure 4 for On the Prospects of Incorporating Large Language Models (LLMs) in Automated Planning and Scheduling (APS)
Viaarxiv icon

On the Convergence and Sample Complexity Analysis of Deep Q-Networks with $ε$-Greedy Exploration

Add code
Oct 24, 2023
Figure 1 for On the Convergence and Sample Complexity Analysis of Deep Q-Networks with $ε$-Greedy Exploration
Figure 2 for On the Convergence and Sample Complexity Analysis of Deep Q-Networks with $ε$-Greedy Exploration
Figure 3 for On the Convergence and Sample Complexity Analysis of Deep Q-Networks with $ε$-Greedy Exploration
Figure 4 for On the Convergence and Sample Complexity Analysis of Deep Q-Networks with $ε$-Greedy Exploration
Viaarxiv icon