Picture for Junxiu Zhou

Junxiu Zhou

AutoFormBench: Benchmark Dataset for Automating Form Understanding

Add code
Mar 31, 2026
Viaarxiv icon

Beyond pass@1: A Reliability Science Framework for Long-Horizon LLM Agents

Add code
Mar 31, 2026
Viaarxiv icon

Severe Domain Shift in Skeleton-Based Action Recognition:A Study of Uncertainty Failure in Real-World Gym Environments

Add code
Mar 16, 2026
Viaarxiv icon