Picture for Yiding Ma

Yiding Ma

OracleProto: A Reproducible Framework for Benchmarking LLM Native Forecasting via Knowledge Cutoff and Temporal Masking

Add code
May 05, 2026
Viaarxiv icon

WorldArena: A Unified Benchmark for Evaluating Perception and Functional Utility of Embodied World Models

Add code
Feb 09, 2026
Viaarxiv icon