Picture for Ruilin Yan

Ruilin Yan

System Design for Maintaining Internal State Consistency in Long-Horizon Robotic Tabletop Games

Add code
Mar 26, 2026
Viaarxiv icon

Generative Evaluation of Complex Reasoning in Large Language Models

Add code
Apr 03, 2025
Figure 1 for Generative Evaluation of Complex Reasoning in Large Language Models
Figure 2 for Generative Evaluation of Complex Reasoning in Large Language Models
Figure 3 for Generative Evaluation of Complex Reasoning in Large Language Models
Figure 4 for Generative Evaluation of Complex Reasoning in Large Language Models
Viaarxiv icon