Picture for Yijin Guo

Yijin Guo

EvolMem: A Cognitive-Driven Benchmark for Multi-Session Dialogue Memory

Add code
Jan 07, 2026
Viaarxiv icon

A Multi-To-One Interview Paradigm for Efficient MLLM Evaluation

Add code
Sep 18, 2025
Viaarxiv icon

The Ever-Evolving Science Exam

Add code
Jul 22, 2025
Figure 1 for The Ever-Evolving Science Exam
Figure 2 for The Ever-Evolving Science Exam
Figure 3 for The Ever-Evolving Science Exam
Figure 4 for The Ever-Evolving Science Exam
Viaarxiv icon