Picture for Shuze Daniel Liu

Shuze Daniel Liu

Convergence of Two-Timescale Markovian Stochastic Approximations with Applications in Reinforcement Learning

Add code
May 29, 2026
Viaarxiv icon

Offline Two-Player Zero-Sum Markov Games with KL Regularization

Add code
May 13, 2026
Viaarxiv icon

AstroAlertBench: Evaluating the Accuracy, Reasoning, and Honesty of Multimodal LLMs in Astronomical Classification

Add code
May 07, 2026
Viaarxiv icon

Instructing LLMs to Negotiate using Reinforcement Learning with Verifiable Rewards

Add code
Apr 10, 2026
Viaarxiv icon

MathlibLemma: Folklore Lemma Generation and Benchmark for Formal Mathematics

Add code
Jan 30, 2026
Viaarxiv icon