Picture for Marius-Constantin Dinu

Marius-Constantin Dinu

Large Language Models Can Self-Improve At Web Agent Tasks

Add code
May 30, 2024
Figure 1 for Large Language Models Can Self-Improve At Web Agent Tasks
Figure 2 for Large Language Models Can Self-Improve At Web Agent Tasks
Figure 3 for Large Language Models Can Self-Improve At Web Agent Tasks
Figure 4 for Large Language Models Can Self-Improve At Web Agent Tasks
Viaarxiv icon

SymbolicAI: A framework for logic-based approaches combining generative models and solvers

Add code
Feb 05, 2024
Viaarxiv icon

Addressing Parameter Choice Issues in Unsupervised Domain Adaptation by Aggregation

Add code
May 02, 2023
Figure 1 for Addressing Parameter Choice Issues in Unsupervised Domain Adaptation by Aggregation
Figure 2 for Addressing Parameter Choice Issues in Unsupervised Domain Adaptation by Aggregation
Figure 3 for Addressing Parameter Choice Issues in Unsupervised Domain Adaptation by Aggregation
Figure 4 for Addressing Parameter Choice Issues in Unsupervised Domain Adaptation by Aggregation
Viaarxiv icon

Reactive Exploration to Cope with Non-Stationarity in Lifelong Reinforcement Learning

Add code
Jul 12, 2022
Figure 1 for Reactive Exploration to Cope with Non-Stationarity in Lifelong Reinforcement Learning
Figure 2 for Reactive Exploration to Cope with Non-Stationarity in Lifelong Reinforcement Learning
Figure 3 for Reactive Exploration to Cope with Non-Stationarity in Lifelong Reinforcement Learning
Figure 4 for Reactive Exploration to Cope with Non-Stationarity in Lifelong Reinforcement Learning
Viaarxiv icon

Understanding the Effects of Dataset Characteristics on Offline Reinforcement Learning

Add code
Nov 08, 2021
Figure 1 for Understanding the Effects of Dataset Characteristics on Offline Reinforcement Learning
Figure 2 for Understanding the Effects of Dataset Characteristics on Offline Reinforcement Learning
Figure 3 for Understanding the Effects of Dataset Characteristics on Offline Reinforcement Learning
Figure 4 for Understanding the Effects of Dataset Characteristics on Offline Reinforcement Learning
Viaarxiv icon

Align-RUDDER: Learning From Few Demonstrations by Reward Redistribution

Add code
Sep 29, 2020
Figure 1 for Align-RUDDER: Learning From Few Demonstrations by Reward Redistribution
Figure 2 for Align-RUDDER: Learning From Few Demonstrations by Reward Redistribution
Figure 3 for Align-RUDDER: Learning From Few Demonstrations by Reward Redistribution
Figure 4 for Align-RUDDER: Learning From Few Demonstrations by Reward Redistribution
Viaarxiv icon