Picture for Kaiyan Zhao

Kaiyan Zhao

ANO: A Principled Approach to Robust Policy Optimization

Add code
May 06, 2026
Viaarxiv icon

Anon: Extrapolating Adaptivity Beyond SGD and Adam

Add code
May 06, 2026
Viaarxiv icon

C$^2$T: Captioning-Structure and LLM-Aligned Common-Sense Reward Learning for Traffic--Vehicle Coordination

Add code
Apr 10, 2026
Viaarxiv icon

Heuristic Self-Paced Learning for Domain Adaptive Semantic Segmentation under Adverse Conditions

Add code
Mar 25, 2026
Viaarxiv icon

When Attention Betrays: Erasing Backdoor Attacks in Robotic Policies by Reconstructing Visual Tokens

Add code
Feb 03, 2026
Viaarxiv icon

Decouple Searching from Training: Scaling Data Mixing via Model Merging for Large Language Model Pre-training

Add code
Jan 31, 2026
Viaarxiv icon

Benchmarking Machine Translation on Chinese Social Media Texts

Add code
Jan 30, 2026
Viaarxiv icon

NeoAMT: Neologism-Aware Agentic Machine Translation with Reinforcement Learning

Add code
Jan 07, 2026
Viaarxiv icon

EComStage: Stage-wise and Orientation-specific Benchmarking for Large Language Models in E-commerce

Add code
Jan 06, 2026
Viaarxiv icon

RGMP: Recurrent Geometric-prior Multimodal Policy for Generalizable Humanoid Robot Manipulation

Add code
Nov 12, 2025
Viaarxiv icon