Picture for Luyu Chen

Luyu Chen

Beyond Single-Point Judgment: Distribution Alignment for LLM-as-a-Judge

Add code
May 18, 2025
Viaarxiv icon

MemSim: A Bayesian Simulator for Evaluating Memory of LLM-based Personal Assistants

Add code
Sep 30, 2024
Figure 1 for MemSim: A Bayesian Simulator for Evaluating Memory of LLM-based Personal Assistants
Figure 2 for MemSim: A Bayesian Simulator for Evaluating Memory of LLM-based Personal Assistants
Figure 3 for MemSim: A Bayesian Simulator for Evaluating Memory of LLM-based Personal Assistants
Figure 4 for MemSim: A Bayesian Simulator for Evaluating Memory of LLM-based Personal Assistants
Viaarxiv icon