Picture for Adith Swaminathan

Adith Swaminathan

Trace is the New AutoDiff -- Unlocking Efficient Optimization of Computational Workflows

Add code
Jun 23, 2024
Viaarxiv icon

On Overcoming Miscalibrated Conversational Priors in LLM-based Chatbots

Add code
Jun 01, 2024
Viaarxiv icon

The Importance of Directional Feedback for LLM-based Optimizers

Add code
May 26, 2024
Figure 1 for The Importance of Directional Feedback for LLM-based Optimizers
Figure 2 for The Importance of Directional Feedback for LLM-based Optimizers
Figure 3 for The Importance of Directional Feedback for LLM-based Optimizers
Figure 4 for The Importance of Directional Feedback for LLM-based Optimizers
Viaarxiv icon

AutoAttacker: A Large Language Model Guided System to Implement Automatic Cyber-attacks

Add code
Mar 02, 2024
Figure 1 for AutoAttacker: A Large Language Model Guided System to Implement Automatic Cyber-attacks
Figure 2 for AutoAttacker: A Large Language Model Guided System to Implement Automatic Cyber-attacks
Figure 3 for AutoAttacker: A Large Language Model Guided System to Implement Automatic Cyber-attacks
Figure 4 for AutoAttacker: A Large Language Model Guided System to Implement Automatic Cyber-attacks
Viaarxiv icon

LLF-Bench: Benchmark for Interactive Learning from Language Feedback

Add code
Dec 13, 2023
Figure 1 for LLF-Bench: Benchmark for Interactive Learning from Language Feedback
Figure 2 for LLF-Bench: Benchmark for Interactive Learning from Language Feedback
Figure 3 for LLF-Bench: Benchmark for Interactive Learning from Language Feedback
Figure 4 for LLF-Bench: Benchmark for Interactive Learning from Language Feedback
Viaarxiv icon

Interactive Robot Learning from Verbal Correction

Add code
Oct 26, 2023
Figure 1 for Interactive Robot Learning from Verbal Correction
Figure 2 for Interactive Robot Learning from Verbal Correction
Figure 3 for Interactive Robot Learning from Verbal Correction
Figure 4 for Interactive Robot Learning from Verbal Correction
Viaarxiv icon

Hindsight Learning for MDPs with Exogenous Inputs

Add code
Jul 13, 2022
Figure 1 for Hindsight Learning for MDPs with Exogenous Inputs
Figure 2 for Hindsight Learning for MDPs with Exogenous Inputs
Figure 3 for Hindsight Learning for MDPs with Exogenous Inputs
Figure 4 for Hindsight Learning for MDPs with Exogenous Inputs
Viaarxiv icon

Heuristic-Guided Reinforcement Learning

Add code
Jun 05, 2021
Figure 1 for Heuristic-Guided Reinforcement Learning
Figure 2 for Heuristic-Guided Reinforcement Learning
Figure 3 for Heuristic-Guided Reinforcement Learning
Figure 4 for Heuristic-Guided Reinforcement Learning
Viaarxiv icon

Improving Long-Term Metrics in Recommendation Systems using Short-Horizon Offline RL

Add code
Jun 01, 2021
Figure 1 for Improving Long-Term Metrics in Recommendation Systems using Short-Horizon Offline RL
Figure 2 for Improving Long-Term Metrics in Recommendation Systems using Short-Horizon Offline RL
Figure 3 for Improving Long-Term Metrics in Recommendation Systems using Short-Horizon Offline RL
Figure 4 for Improving Long-Term Metrics in Recommendation Systems using Short-Horizon Offline RL
Viaarxiv icon

Provably Good Batch Reinforcement Learning Without Great Exploration

Add code
Jul 22, 2020
Figure 1 for Provably Good Batch Reinforcement Learning Without Great Exploration
Figure 2 for Provably Good Batch Reinforcement Learning Without Great Exploration
Figure 3 for Provably Good Batch Reinforcement Learning Without Great Exploration
Figure 4 for Provably Good Batch Reinforcement Learning Without Great Exploration
Viaarxiv icon