Picture for Djallel Bouneffouf

Djallel Bouneffouf

Computing the Dirichlet-Multinomial Log-Likelihood Function

Add code
Jul 17, 2020
Figure 1 for Computing the Dirichlet-Multinomial Log-Likelihood Function
Figure 2 for Computing the Dirichlet-Multinomial Log-Likelihood Function
Viaarxiv icon

Solving Constrained CASH Problems with ADMM

Add code
Jul 11, 2020
Figure 1 for Solving Constrained CASH Problems with ADMM
Figure 2 for Solving Constrained CASH Problems with ADMM
Figure 3 for Solving Constrained CASH Problems with ADMM
Figure 4 for Solving Constrained CASH Problems with ADMM
Viaarxiv icon

Online learning with Corrupted context: Corrupted Contextual Bandits

Add code
Jun 26, 2020
Figure 1 for Online learning with Corrupted context: Corrupted Contextual Bandits
Figure 2 for Online learning with Corrupted context: Corrupted Contextual Bandits
Figure 3 for Online learning with Corrupted context: Corrupted Contextual Bandits
Viaarxiv icon

Online Learning in Iterated Prisoner's Dilemma to Mimic Human Behavior

Add code
Jun 09, 2020
Figure 1 for Online Learning in Iterated Prisoner's Dilemma to Mimic Human Behavior
Figure 2 for Online Learning in Iterated Prisoner's Dilemma to Mimic Human Behavior
Figure 3 for Online Learning in Iterated Prisoner's Dilemma to Mimic Human Behavior
Figure 4 for Online Learning in Iterated Prisoner's Dilemma to Mimic Human Behavior
Viaarxiv icon

Unified Models of Human Behavioral Agents in Bandits, Contextual Bandits and RL

Add code
May 12, 2020
Figure 1 for Unified Models of Human Behavioral Agents in Bandits, Contextual Bandits and RL
Figure 2 for Unified Models of Human Behavioral Agents in Bandits, Contextual Bandits and RL
Figure 3 for Unified Models of Human Behavioral Agents in Bandits, Contextual Bandits and RL
Figure 4 for Unified Models of Human Behavioral Agents in Bandits, Contextual Bandits and RL
Viaarxiv icon

Hyper-parameter Tuning for the Contextual Bandit

Add code
May 04, 2020
Figure 1 for Hyper-parameter Tuning for the Contextual Bandit
Figure 2 for Hyper-parameter Tuning for the Contextual Bandit
Viaarxiv icon

How can AI Automate End-to-End Data Science?

Add code
Oct 22, 2019
Figure 1 for How can AI Automate End-to-End Data Science?
Viaarxiv icon

Reinforcement Learning Models of Human Behavior: Reward Processing in Mental Disorders

Add code
Jun 28, 2019
Figure 1 for Reinforcement Learning Models of Human Behavior: Reward Processing in Mental Disorders
Figure 2 for Reinforcement Learning Models of Human Behavior: Reward Processing in Mental Disorders
Figure 3 for Reinforcement Learning Models of Human Behavior: Reward Processing in Mental Disorders
Figure 4 for Reinforcement Learning Models of Human Behavior: Reward Processing in Mental Disorders
Viaarxiv icon

Split Q Learning: Reinforcement Learning with Two-Stream Rewards

Add code
Jun 21, 2019
Figure 1 for Split Q Learning: Reinforcement Learning with Two-Stream Rewards
Viaarxiv icon

Optimal Exploitation of Clustering and History Information in Multi-Armed Bandit

Add code
May 31, 2019
Figure 1 for Optimal Exploitation of Clustering and History Information in Multi-Armed Bandit
Viaarxiv icon