Picture for Mikoto Kudo

Mikoto Kudo

Sample-Efficient Hypergradient Estimation for Decentralized Bi-Level Reinforcement Learning

Add code
Mar 16, 2026
Viaarxiv icon

Cost-Minimized Label-Flipping Poisoning Attack to LLM Alignment

Add code
Nov 12, 2025
Viaarxiv icon

Policy Iteration for Pareto-Optimal Policies in Stochastic Stackelberg Games

Add code
May 07, 2024
Viaarxiv icon