Picture for Jieyu Zhao

Jieyu Zhao

Video-Based Reward Modeling for Computer-Use Agents

Add code
Mar 10, 2026
Viaarxiv icon

MED-COPILOT: A Medical Assistant Powered by GraphRAG and Similar Patient Case Retrieval

Add code
Feb 28, 2026
Viaarxiv icon

Experiential Reinforcement Learning

Add code
Feb 15, 2026
Viaarxiv icon

CoAct-1: Computer-using Agents with Coding as Actions

Add code
Aug 05, 2025
Figure 1 for CoAct-1: Computer-using Agents with Coding as Actions
Figure 2 for CoAct-1: Computer-using Agents with Coding as Actions
Figure 3 for CoAct-1: Computer-using Agents with Coding as Actions
Figure 4 for CoAct-1: Computer-using Agents with Coding as Actions
Viaarxiv icon

Can LLMs Express Personality Across Cultures? Introducing CulturalPersonas for Evaluating Trait Alignment

Add code
Jun 06, 2025
Viaarxiv icon

SocialMaze: A Benchmark for Evaluating Social Reasoning in Large Language Models

Add code
May 29, 2025
Figure 1 for SocialMaze: A Benchmark for Evaluating Social Reasoning in Large Language Models
Figure 2 for SocialMaze: A Benchmark for Evaluating Social Reasoning in Large Language Models
Figure 3 for SocialMaze: A Benchmark for Evaluating Social Reasoning in Large Language Models
Figure 4 for SocialMaze: A Benchmark for Evaluating Social Reasoning in Large Language Models
Viaarxiv icon

Cross-Lingual Pitfalls: Automatic Probing Cross-Lingual Weakness of Multilingual Large Language Models

Add code
May 24, 2025
Viaarxiv icon

The Hallucination Tax of Reinforcement Finetuning

Add code
May 20, 2025
Viaarxiv icon

BIASINSPECTOR: Detecting Bias in Structured Data through LLM Agents

Add code
Apr 07, 2025
Viaarxiv icon

Efficient Reinforcement Finetuning via Adaptive Curriculum Learning

Add code
Apr 07, 2025
Viaarxiv icon