Get our free extension to see links to code for papers anywhere online!

 Add to Chrome

 Add to Firefox

CatalyzeX Code Finder - Browser extension linking code for ML papers across the web! | Product Hunt Embed
Etat de l'art sur l'application des bandits multi-bras

Jan 04, 2021
Djallel Bouneffouf

* in French 

  Access Paper or Ask Questions

Predicting Human Decision Making in Psychological Tasks with Recurrent Neural Networks

Oct 22, 2020
Baihan Lin, Djallel Bouneffouf, Guillermo Cecchi


  Access Paper or Ask Questions

Learned Fine-Tuner for Incongruous Few-Shot Learning

Oct 20, 2020
Pu Zhao, Sijia Liu, Parikshit Ram, Songtao Lu, Djallel Bouneffouf, Xue Lin


  Access Paper or Ask Questions

Double-Linear Thompson Sampling for Context-Attentive Bandits

Oct 15, 2020
Djallel Bouneffouf, Raphaël Féraud, Sohini Upadhyay, Yasaman Khazaeni, Irina Rish

* arXiv admin note: text overlap with arXiv:1906.09384 

  Access Paper or Ask Questions

Spectral Clustering using Eigenspectrum Shape Based Nystrom Sampling

Jul 21, 2020
Djallel Bouneffouf


  Access Paper or Ask Questions

Contextual Bandit with Missing Rewards

Jul 19, 2020
Djallel Bouneffouf, Sohini Upadhyay, Yasaman Khazaeni


  Access Paper or Ask Questions

Computing the Dirichlet-Multinomial Log-Likelihood Function

Jul 17, 2020
Djallel Bouneffouf


  Access Paper or Ask Questions

Solving Constrained CASH Problems with ADMM

Jul 11, 2020
Parikshit Ram, Sijia Liu, Deepak Vijaykeerthi, Dakuo Wang, Djallel Bouneffouf, Greg Bramble, Horst Samulowitz, Alexander G. Gray

* 7th ICML Workshop on Automated Machine Learning (2020) 

  Access Paper or Ask Questions

Online learning with Corrupted context: Corrupted Contextual Bandits

Jun 26, 2020
Djallel Bouneffouf


  Access Paper or Ask Questions

Online Learning in Iterated Prisoner's Dilemma to Mimic Human Behavior

Jun 09, 2020
Baihan Lin, Djallel Bouneffouf, Guillermo Cecchi

* To the best of our knowledge, this is the first attempt to explore the full spectrum of reinforcement learning agents (multi-armed bandits, contextual bandits and reinforcement learning) in the sequential social dilemma. This mental variants section supersedes and extends our work arXiv:1706.02897 (MAB), arXiv:2005.04544 (CB) and arXiv:1906.11286 (RL) into the multi-agent setting 

  Access Paper or Ask Questions

Unified Models of Human Behavioral Agents in Bandits, Contextual Bandits and RL

May 12, 2020
Baihan Lin, Guillermo Cecchi, Djallel Bouneffouf, Jenna Reinen, Irina Rish

* This article supersedes and extends our work arXiv:1706.02897 (MAB) and arXiv:1906.11286 (RL) into the Contextual Bandit (CB) framework. It generalized extensively into multi-armed bandits, contextual bandits and RL settings to create a unified framework of human behavioral agents 

  Access Paper or Ask Questions

Hyper-parameter Tuning for the Contextual Bandit

May 04, 2020
Djallel Bouneffouf, Emmanuelle Claeys

* arXiv admin note: text overlap with arXiv:1705.03821 

  Access Paper or Ask Questions

How can AI Automate End-to-End Data Science?

Oct 22, 2019
Charu Aggarwal, Djallel Bouneffouf, Horst Samulowitz, Beat Buesser, Thanh Hoang, Udayan Khurana, Sijia Liu, Tejaswini Pedapati, Parikshit Ram, Ambrish Rawat, Martin Wistuba, Alexander Gray


  Access Paper or Ask Questions

Reinforcement Learning Models of Human Behavior: Reward Processing in Mental Disorders

Jun 28, 2019
Baihan Lin, Guillermo Cecchi, Djallel Bouneffouf, Jenna Reinen, Irina Rish

* arXiv admin note: substantial text overlap with arXiv:1706.02897 

  Access Paper or Ask Questions

Split Q Learning: Reinforcement Learning with Two-Stream Rewards

Jun 21, 2019
Baihan Lin, Djallel Bouneffouf, Guillermo Cecchi

* IJCAI 2019. arXiv admin note: substantial text overlap with arXiv:1706.02897 and arXiv:1906.11286 

  Access Paper or Ask Questions

Optimal Exploitation of Clustering and History Information in Multi-Armed Bandit

May 31, 2019
Djallel Bouneffouf, Srinivasan Parthasarathy, Horst Samulowitz, Martin Wistub

* IJCAI 2019, International Joint Conferences on Artificial Intelligence 

  Access Paper or Ask Questions

Automated Machine Learning via ADMM

May 01, 2019
Sijia Liu, Parikshit Ram, Djallel Bouneffouf, Gregory Bramble, Andrew R Conn, Horst Samulowitz, Alexander Gray


  Access Paper or Ask Questions

A Survey on Practical Applications of Multi-Armed and Contextual Bandits

Apr 02, 2019
Djallel Bouneffouf, Irina Rish

* under review by IJCAI 2019 Survey 

  Access Paper or Ask Questions

Beyond Backprop: Online Alternating Minimization with Auxiliary Variables

Oct 24, 2018
Anna Choromanska, Sadhana Kumaravel, Ronny Luss, Irina Rish, Brian Kingsbury, Mattia Rigotti, Paolo DiAchille, Viatcheslav Gurev, Ravi Tejwani, Djallel Bouneffouf

* First four authors contributed equally to this work: A.C. - theory, manuscript, S.K. - code, experiments, R.L. - algorithm, experiments, I.R. - algorithm, manuscript 

  Access Paper or Ask Questions

Interpretable Multi-Objective Reinforcement Learning through Policy Orchestration

Sep 21, 2018
Ritesh Noothigattu, Djallel Bouneffouf, Nicholas Mattei, Rachita Chandra, Piyush Madan, Kush Varshney, Murray Campbell, Moninder Singh, Francesca Rossi

* 8 pages, 3 figures 

  Access Paper or Ask Questions

Incorporating Behavioral Constraints in Online AI Systems

Sep 15, 2018
Avinash Balakrishnan, Djallel Bouneffouf, Nicholas Mattei, Francesca Rossi

* 9 pages, 6 figures 

  Access Paper or Ask Questions

Adaptive Representation Selection in Contextual Bandit

May 15, 2018
Baihan Lin, Guillermo Cecchi, Djallel Bouneffouf, Irina Rish


  Access Paper or Ask Questions

Scalable Recollections for Continual Lifelong Learning

Feb 26, 2018
Matthew Riemer, Tim Klinger, Michele Franceschini, Djallel Bouneffouf


  Access Paper or Ask Questions

Context Attentive Bandits: Contextual Bandit with Restricted Context

Jun 07, 2017
Djallel Bouneffouf, Irina Rish, Guillermo A. Cecchi, Raphael Feraud

* IJCAI 2017 

  Access Paper or Ask Questions

Bandit Models of Human Behavior: Reward Processing in Mental Disorders

Jun 07, 2017
Djallel Bouneffouf, Irina Rish, Guillermo A. Cecchi

* Conference on Artificial General Intelligence, AGI-17 

  Access Paper or Ask Questions

Multi-armed Bandit Problem with Known Trend

May 10, 2017
Djallel Bouneffouf, Raphaël Feraud

* Neurocomputing 2016. arXiv admin note: text overlap with arXiv:0805.3415 by other authors 

  Access Paper or Ask Questions

Freshness-Aware Thompson Sampling

Sep 29, 2014
Djallel Bouneffouf

* 21st International Conference on Neural Information Processing. arXiv admin note: text overlap with arXiv:1409.7729 

  Access Paper or Ask Questions