Picture for Xinda Zhao

Xinda Zhao

EverMemBench: Benchmarking Long-Term Interactive Memory in Large Language Models

Add code
Feb 03, 2026
Viaarxiv icon

Beyond the Needle's Illusion: Decoupled Evaluation of Evidence Access and Use under Semantic Interference at 326M-Token Scale

Add code
Jan 28, 2026
Viaarxiv icon