Picture for Andrew Cohen

Andrew Cohen

The ART of LLM Refinement: Ask, Refine, and Trust

Add code
Nov 14, 2023
Viaarxiv icon

End-to-end Story Plot Generator

Add code
Oct 13, 2023
Figure 1 for End-to-end Story Plot Generator
Figure 2 for End-to-end Story Plot Generator
Figure 3 for End-to-end Story Plot Generator
Figure 4 for End-to-end Story Plot Generator
Viaarxiv icon

Learning Personalized Story Evaluation

Add code
Oct 10, 2023
Viaarxiv icon

Making PPO even better: Value-Guided Monte-Carlo Tree Search decoding

Add code
Sep 26, 2023
Figure 1 for Making PPO even better: Value-Guided Monte-Carlo Tree Search decoding
Figure 2 for Making PPO even better: Value-Guided Monte-Carlo Tree Search decoding
Figure 3 for Making PPO even better: Value-Guided Monte-Carlo Tree Search decoding
Figure 4 for Making PPO even better: Value-Guided Monte-Carlo Tree Search decoding
Viaarxiv icon

Modeling Scattering Coefficients using Self-Attentive Complex Polynomials with Image-based Representation

Add code
Jan 10, 2023
Figure 1 for Modeling Scattering Coefficients using Self-Attentive Complex Polynomials with Image-based Representation
Figure 2 for Modeling Scattering Coefficients using Self-Attentive Complex Polynomials with Image-based Representation
Figure 3 for Modeling Scattering Coefficients using Self-Attentive Complex Polynomials with Image-based Representation
Figure 4 for Modeling Scattering Coefficients using Self-Attentive Complex Polynomials with Image-based Representation
Viaarxiv icon

Biomedical image analysis competitions: The state of current participation practice

Add code
Dec 16, 2022
Viaarxiv icon

Transfer RL across Observation Feature Spaces via Model-Based Regularization

Add code
Jan 01, 2022
Figure 1 for Transfer RL across Observation Feature Spaces via Model-Based Regularization
Figure 2 for Transfer RL across Observation Feature Spaces via Model-Based Regularization
Figure 3 for Transfer RL across Observation Feature Spaces via Model-Based Regularization
Figure 4 for Transfer RL across Observation Feature Spaces via Model-Based Regularization
Viaarxiv icon

On the Use and Misuse of Absorbing States in Multi-agent Reinforcement Learning

Add code
Nov 10, 2021
Figure 1 for On the Use and Misuse of Absorbing States in Multi-agent Reinforcement Learning
Figure 2 for On the Use and Misuse of Absorbing States in Multi-agent Reinforcement Learning
Figure 3 for On the Use and Misuse of Absorbing States in Multi-agent Reinforcement Learning
Figure 4 for On the Use and Misuse of Absorbing States in Multi-agent Reinforcement Learning
Viaarxiv icon

Perfecting the Crime Machine

Add code
Jan 14, 2020
Figure 1 for Perfecting the Crime Machine
Figure 2 for Perfecting the Crime Machine
Figure 3 for Perfecting the Crime Machine
Figure 4 for Perfecting the Crime Machine
Viaarxiv icon

Maximum Entropy Diverse Exploration: Disentangling Maximum Entropy Reinforcement Learning

Add code
Nov 03, 2019
Figure 1 for Maximum Entropy Diverse Exploration: Disentangling Maximum Entropy Reinforcement Learning
Figure 2 for Maximum Entropy Diverse Exploration: Disentangling Maximum Entropy Reinforcement Learning
Figure 3 for Maximum Entropy Diverse Exploration: Disentangling Maximum Entropy Reinforcement Learning
Viaarxiv icon