Picture for Zheng Yuan

Zheng Yuan

Istituto Italiano di Tecnologia, Italy, Università di Ferrara, Italy

Teacher Demonstrations in a BabyLM's Zone of Proximal Development for Contingent Multi-Turn Interaction

Add code
Oct 23, 2025
Viaarxiv icon

Probing Latent Knowledge Conflict for Faithful Retrieval-Augmented Generation

Add code
Oct 14, 2025
Viaarxiv icon

GenQuest: An LLM-based Text Adventure Game for Language Learners

Add code
Oct 06, 2025
Viaarxiv icon

Beyond the Score: Uncertainty-Calibrated LLMs for Automated Essay Assessment

Add code
Sep 19, 2025
Viaarxiv icon

You Don't Need Pre-built Graphs for RAG: Retrieval Augmented Generation with Adaptive Reasoning Structures

Add code
Aug 08, 2025
Viaarxiv icon

Seed-Prover: Deep and Broad Reasoning for Automated Theorem Proving

Add code
Aug 01, 2025
Viaarxiv icon

RESAR-BEV: An Explainable Progressive Residual Autoregressive Approach for Camera-Radar Fusion in BEV Segmentation

Add code
May 10, 2025
Viaarxiv icon

FormalMATH: Benchmarking Formal Mathematical Reasoning of Large Language Models

Add code
May 05, 2025
Viaarxiv icon

Rubrik's Cube: Testing a New Rubric for Evaluating Explanations on the CUBE dataset

Add code
Mar 31, 2025
Viaarxiv icon

REVAL: A Comprehension Evaluation on Reliability and Values of Large Vision-Language Models

Add code
Mar 20, 2025
Figure 1 for REVAL: A Comprehension Evaluation on Reliability and Values of Large Vision-Language Models
Figure 2 for REVAL: A Comprehension Evaluation on Reliability and Values of Large Vision-Language Models
Figure 3 for REVAL: A Comprehension Evaluation on Reliability and Values of Large Vision-Language Models
Figure 4 for REVAL: A Comprehension Evaluation on Reliability and Values of Large Vision-Language Models
Viaarxiv icon