Picture for Ye Shen

Ye Shen

EvolMem: A Cognitive-Driven Benchmark for Multi-Session Dialogue Memory

Add code
Jan 07, 2026
Viaarxiv icon

One Battle After Another: Probing LLMs' Limits on Multi-Turn Instruction Following with a Benchmark Evolving Framework

Add code
Nov 05, 2025
Viaarxiv icon

Efficiently Learning Synthetic Control Models for High-dimensional Disaggregated Data

Add code
Oct 26, 2025
Viaarxiv icon

A Multi-To-One Interview Paradigm for Efficient MLLM Evaluation

Add code
Sep 18, 2025
Viaarxiv icon

The Ever-Evolving Science Exam

Add code
Jul 22, 2025
Figure 1 for The Ever-Evolving Science Exam
Figure 2 for The Ever-Evolving Science Exam
Figure 3 for The Ever-Evolving Science Exam
Figure 4 for The Ever-Evolving Science Exam
Viaarxiv icon

When Life Gives You Samples: The Benefits of Scaling up Inference Compute for Multilingual LLMs

Add code
Jun 25, 2025
Viaarxiv icon

Command A: An Enterprise-Ready Large Language Model

Add code
Apr 01, 2025
Figure 1 for Command A: An Enterprise-Ready Large Language Model
Figure 2 for Command A: An Enterprise-Ready Large Language Model
Figure 3 for Command A: An Enterprise-Ready Large Language Model
Figure 4 for Command A: An Enterprise-Ready Large Language Model
Viaarxiv icon

Large Language Models for Bioinformatics

Add code
Jan 10, 2025
Figure 1 for Large Language Models for Bioinformatics
Viaarxiv icon

PharmacyGPT: The AI Pharmacist

Add code
Jul 21, 2023
Viaarxiv icon

AD-AutoGPT: An Autonomous GPT for Alzheimer's Disease Infodemiology

Add code
Jun 16, 2023
Figure 1 for AD-AutoGPT: An Autonomous GPT for Alzheimer's Disease Infodemiology
Figure 2 for AD-AutoGPT: An Autonomous GPT for Alzheimer's Disease Infodemiology
Figure 3 for AD-AutoGPT: An Autonomous GPT for Alzheimer's Disease Infodemiology
Figure 4 for AD-AutoGPT: An Autonomous GPT for Alzheimer's Disease Infodemiology
Viaarxiv icon