Picture for Taira Tsuchiya

Taira Tsuchiya

Scale-Invariant Fast Convergence in Games

Add code
Feb 12, 2026
Viaarxiv icon

Adversarial Learning in Games with Bandit Feedback: Logarithmic Pure-Strategy Maximin Regret

Add code
Feb 06, 2026
Viaarxiv icon

Data- and Variance-dependent Regret Bounds for Online Tabular MDPs

Add code
Feb 02, 2026
Viaarxiv icon

Reinforcement Learning from Adversarial Preferences in Tabular MDPs

Add code
Jul 15, 2025
Figure 1 for Reinforcement Learning from Adversarial Preferences in Tabular MDPs
Figure 2 for Reinforcement Learning from Adversarial Preferences in Tabular MDPs
Viaarxiv icon

Bandit and Delayed Feedback in Online Structured Prediction

Add code
Feb 26, 2025
Viaarxiv icon

Instance-Dependent Regret Bounds for Learning Two-Player Zero-Sum Games with Bandit Feedback

Add code
Feb 24, 2025
Figure 1 for Instance-Dependent Regret Bounds for Learning Two-Player Zero-Sum Games with Bandit Feedback
Figure 2 for Instance-Dependent Regret Bounds for Learning Two-Player Zero-Sum Games with Bandit Feedback
Viaarxiv icon

Online Inverse Linear Optimization: Improved Regret Bound, Robustness to Suboptimality, and Toward Tight Regret Analysis

Add code
Jan 27, 2025
Figure 1 for Online Inverse Linear Optimization: Improved Regret Bound, Robustness to Suboptimality, and Toward Tight Regret Analysis
Figure 2 for Online Inverse Linear Optimization: Improved Regret Bound, Robustness to Suboptimality, and Toward Tight Regret Analysis
Figure 3 for Online Inverse Linear Optimization: Improved Regret Bound, Robustness to Suboptimality, and Toward Tight Regret Analysis
Viaarxiv icon

Revisiting Online Learning Approach to Inverse Linear Optimization: A Fenchel$-$Young Loss Perspective and Gap-Dependent Regret Analysis

Add code
Jan 24, 2025
Figure 1 for Revisiting Online Learning Approach to Inverse Linear Optimization: A Fenchel$-$Young Loss Perspective and Gap-Dependent Regret Analysis
Viaarxiv icon

Corrupted Learning Dynamics in Games

Add code
Dec 10, 2024
Figure 1 for Corrupted Learning Dynamics in Games
Figure 2 for Corrupted Learning Dynamics in Games
Viaarxiv icon

A Simple and Adaptive Learning Rate for FTRL in Online Learning with Minimax Regret of $Θ$ and its Application to Best-of-Both-Worlds

Add code
May 30, 2024
Figure 1 for A Simple and Adaptive Learning Rate for FTRL in Online Learning with Minimax Regret of $Θ$ and its Application to Best-of-Both-Worlds
Viaarxiv icon