Picture for Shinji Ito

Shinji Ito

Scale-Invariant Fast Convergence in Games

Add code
Feb 12, 2026
Viaarxiv icon

Adversarial Learning in Games with Bandit Feedback: Logarithmic Pure-Strategy Maximin Regret

Add code
Feb 06, 2026
Viaarxiv icon

Fast EXP3 Algorithms

Add code
Dec 12, 2025
Figure 1 for Fast EXP3 Algorithms
Figure 2 for Fast EXP3 Algorithms
Figure 3 for Fast EXP3 Algorithms
Figure 4 for Fast EXP3 Algorithms
Viaarxiv icon

Reinforcement Learning from Adversarial Preferences in Tabular MDPs

Add code
Jul 15, 2025
Figure 1 for Reinforcement Learning from Adversarial Preferences in Tabular MDPs
Figure 2 for Reinforcement Learning from Adversarial Preferences in Tabular MDPs
Viaarxiv icon

Bandit Max-Min Fair Allocation

Add code
May 08, 2025
Viaarxiv icon

Optimal Regret of Bernoulli Bandits under Global Differential Privacy

Add code
May 08, 2025
Viaarxiv icon

Influential Bandits: Pulling an Arm May Change the Environment

Add code
Apr 11, 2025
Figure 1 for Influential Bandits: Pulling an Arm May Change the Environment
Figure 2 for Influential Bandits: Pulling an Arm May Change the Environment
Figure 3 for Influential Bandits: Pulling an Arm May Change the Environment
Figure 4 for Influential Bandits: Pulling an Arm May Change the Environment
Viaarxiv icon

Instance-Dependent Regret Bounds for Learning Two-Player Zero-Sum Games with Bandit Feedback

Add code
Feb 24, 2025
Figure 1 for Instance-Dependent Regret Bounds for Learning Two-Player Zero-Sum Games with Bandit Feedback
Figure 2 for Instance-Dependent Regret Bounds for Learning Two-Player Zero-Sum Games with Bandit Feedback
Viaarxiv icon

Data-dependent Bounds with $T$-Optimal Best-of-Both-Worlds Guarantees in Multi-Armed Bandits using Stability-Penalty Matching

Add code
Feb 12, 2025
Figure 1 for Data-dependent Bounds with $T$-Optimal Best-of-Both-Worlds Guarantees in Multi-Armed Bandits using Stability-Penalty Matching
Viaarxiv icon

Corrupted Learning Dynamics in Games

Add code
Dec 10, 2024
Figure 1 for Corrupted Learning Dynamics in Games
Figure 2 for Corrupted Learning Dynamics in Games
Viaarxiv icon