Picture for Kaoyan Lu

Kaoyan Lu

ERGeoBench:A Comprehensive Benchmark for Embodied Reasoning and Geo-localization in Multimodal Large Language Models

Add code
May 29, 2026
Viaarxiv icon

CreBench: Human-Aligned Creativity Evaluation from Idea to Process to Product

Add code
Nov 17, 2025
Viaarxiv icon