Picture for Lingyu Li

Lingyu Li

A Rigorous Benchmark with Multidimensional Evaluation for Deep Research Agents: From Answers to Reports

Add code
Oct 02, 2025
Viaarxiv icon

SafeWork-R1: Coevolving Safety and Intelligence under the AI-45$^{\circ}$ Law

Add code
Jul 24, 2025
Viaarxiv icon

Integrating emotional intelligence, memory architecture, and gestures to achieve empathetic humanoid robot interaction in an educational setting

Add code
May 26, 2025
Viaarxiv icon

Reflection-Bench: probing AI intelligence with reflection

Add code
Oct 21, 2024
Figure 1 for Reflection-Bench: probing AI intelligence with reflection
Figure 2 for Reflection-Bench: probing AI intelligence with reflection
Figure 3 for Reflection-Bench: probing AI intelligence with reflection
Figure 4 for Reflection-Bench: probing AI intelligence with reflection
Viaarxiv icon

ESC-Eval: Evaluating Emotion Support Conversations in Large Language Models

Add code
Jun 24, 2024
Figure 1 for ESC-Eval: Evaluating Emotion Support Conversations in Large Language Models
Figure 2 for ESC-Eval: Evaluating Emotion Support Conversations in Large Language Models
Figure 3 for ESC-Eval: Evaluating Emotion Support Conversations in Large Language Models
Figure 4 for ESC-Eval: Evaluating Emotion Support Conversations in Large Language Models
Viaarxiv icon