Picture for Anton Dereventsov

Anton Dereventsov

Data-Centric Approach to Constrained Machine Learning: A Case Study on Conway's Game of Life

Add code
Aug 23, 2024
Viaarxiv icon

An Empirical Categorization of Prompting Techniques for Large Language Models: A Practitioner's Guide

Add code
Feb 18, 2024
Figure 1 for An Empirical Categorization of Prompting Techniques for Large Language Models: A Practitioner's Guide
Figure 2 for An Empirical Categorization of Prompting Techniques for Large Language Models: A Practitioner's Guide
Figure 3 for An Empirical Categorization of Prompting Techniques for Large Language Models: A Practitioner's Guide
Viaarxiv icon

Increasing Entropy to Boost Policy Gradient Performance on Personalization Tasks

Add code
Oct 09, 2023
Figure 1 for Increasing Entropy to Boost Policy Gradient Performance on Personalization Tasks
Figure 2 for Increasing Entropy to Boost Policy Gradient Performance on Personalization Tasks
Figure 3 for Increasing Entropy to Boost Policy Gradient Performance on Personalization Tasks
Figure 4 for Increasing Entropy to Boost Policy Gradient Performance on Personalization Tasks
Viaarxiv icon

Zero-Shot Recommendations with Pre-Trained Large Language Models for Multimodal Nudging

Add code
Sep 02, 2023
Figure 1 for Zero-Shot Recommendations with Pre-Trained Large Language Models for Multimodal Nudging
Figure 2 for Zero-Shot Recommendations with Pre-Trained Large Language Models for Multimodal Nudging
Figure 3 for Zero-Shot Recommendations with Pre-Trained Large Language Models for Multimodal Nudging
Figure 4 for Zero-Shot Recommendations with Pre-Trained Large Language Models for Multimodal Nudging
Viaarxiv icon

Modeling Non-deterministic Human Behaviors in Discrete Food Choices

Add code
Jan 23, 2023
Viaarxiv icon

Examining Policy Entropy of Reinforcement Learning Agents for Personalization Tasks

Add code
Nov 21, 2022
Figure 1 for Examining Policy Entropy of Reinforcement Learning Agents for Personalization Tasks
Figure 2 for Examining Policy Entropy of Reinforcement Learning Agents for Personalization Tasks
Figure 3 for Examining Policy Entropy of Reinforcement Learning Agents for Personalization Tasks
Figure 4 for Examining Policy Entropy of Reinforcement Learning Agents for Personalization Tasks
Viaarxiv icon

Simulated Contextual Bandits for Personalization Tasks from Recommendation Datasets

Add code
Oct 12, 2022
Figure 1 for Simulated Contextual Bandits for Personalization Tasks from Recommendation Datasets
Figure 2 for Simulated Contextual Bandits for Personalization Tasks from Recommendation Datasets
Figure 3 for Simulated Contextual Bandits for Personalization Tasks from Recommendation Datasets
Figure 4 for Simulated Contextual Bandits for Personalization Tasks from Recommendation Datasets
Viaarxiv icon

On the Unreasonable Efficiency of State Space Clustering in Personalization Tasks

Add code
Dec 24, 2021
Figure 1 for On the Unreasonable Efficiency of State Space Clustering in Personalization Tasks
Figure 2 for On the Unreasonable Efficiency of State Space Clustering in Personalization Tasks
Figure 3 for On the Unreasonable Efficiency of State Space Clustering in Personalization Tasks
Figure 4 for On the Unreasonable Efficiency of State Space Clustering in Personalization Tasks
Viaarxiv icon

Offline Policy Comparison under Limited Historical Agent-Environment Interactions

Add code
Jun 07, 2021
Figure 1 for Offline Policy Comparison under Limited Historical Agent-Environment Interactions
Figure 2 for Offline Policy Comparison under Limited Historical Agent-Environment Interactions
Figure 3 for Offline Policy Comparison under Limited Historical Agent-Environment Interactions
Figure 4 for Offline Policy Comparison under Limited Historical Agent-Environment Interactions
Viaarxiv icon

An adaptive stochastic gradient-free approach for high-dimensional blackbox optimization

Add code
Jun 18, 2020
Figure 1 for An adaptive stochastic gradient-free approach for high-dimensional blackbox optimization
Figure 2 for An adaptive stochastic gradient-free approach for high-dimensional blackbox optimization
Figure 3 for An adaptive stochastic gradient-free approach for high-dimensional blackbox optimization
Figure 4 for An adaptive stochastic gradient-free approach for high-dimensional blackbox optimization
Viaarxiv icon