Picture for Ciamac C. Moallemi

Ciamac C. Moallemi

Outbidding and Outbluffing Elite Humans: Mastering Liar's Poker via Self-Play and Reinforcement Learning

Add code
Nov 05, 2025
Viaarxiv icon

Optimal Dynamic Fees for Blockchain Resources

Add code
Sep 22, 2023
Figure 1 for Optimal Dynamic Fees for Blockchain Resources
Figure 2 for Optimal Dynamic Fees for Blockchain Resources
Figure 3 for Optimal Dynamic Fees for Blockchain Resources
Figure 4 for Optimal Dynamic Fees for Blockchain Resources
Viaarxiv icon

Policy Gradient Optimization of Thompson Sampling Policies

Add code
Jun 30, 2020
Figure 1 for Policy Gradient Optimization of Thompson Sampling Policies
Figure 2 for Policy Gradient Optimization of Thompson Sampling Policies
Figure 3 for Policy Gradient Optimization of Thompson Sampling Policies
Figure 4 for Policy Gradient Optimization of Thompson Sampling Policies
Viaarxiv icon

Thompson Sampling with Information Relaxation Penalties

Add code
Feb 12, 2019
Figure 1 for Thompson Sampling with Information Relaxation Penalties
Figure 2 for Thompson Sampling with Information Relaxation Penalties
Figure 3 for Thompson Sampling with Information Relaxation Penalties
Figure 4 for Thompson Sampling with Information Relaxation Penalties
Viaarxiv icon

Universal Reinforcement Learning

Add code
Jul 22, 2009
Figure 1 for Universal Reinforcement Learning
Viaarxiv icon

Convergence of Min-Sum Message Passing for Quadratic Optimization

Add code
Dec 24, 2008
Viaarxiv icon

Consensus Propagation

Add code
May 29, 2007
Figure 1 for Consensus Propagation
Viaarxiv icon