Picture for Yesheng Liu

Yesheng Liu

Do Vision-Language Models Measure Up? Benchmarking Visual Measurement Reading with MeasureBench

Add code
Oct 30, 2025
Viaarxiv icon

FlagEvalMM: A Flexible Framework for Comprehensive Multimodal Model Evaluation

Add code
Jun 10, 2025
Viaarxiv icon