Picture for Chenyang Zhao

Chenyang Zhao

Balancing Multiple Objectives in Urban Traffic Control with Reinforcement Learning from AI Feedback

Add code
Feb 24, 2026
Viaarxiv icon

CM2: Reinforcement Learning with Checklist Rewards for Multi-Turn and Multi-Step Agentic Tool Use

Add code
Feb 12, 2026
Viaarxiv icon

MMFormalizer: Multimodal Autoformalization in the Wild

Add code
Jan 06, 2026
Viaarxiv icon

ScienceDB AI: An LLM-Driven Agentic Recommender System for Large-Scale Scientific Data Sharing Services

Add code
Jan 03, 2026
Viaarxiv icon

RLVE: Scaling Up Reinforcement Learning for Language Models with Adaptive Verifiable Environments

Add code
Nov 10, 2025
Figure 1 for RLVE: Scaling Up Reinforcement Learning for Language Models with Adaptive Verifiable Environments
Figure 2 for RLVE: Scaling Up Reinforcement Learning for Language Models with Adaptive Verifiable Environments
Figure 3 for RLVE: Scaling Up Reinforcement Learning for Language Models with Adaptive Verifiable Environments
Figure 4 for RLVE: Scaling Up Reinforcement Learning for Language Models with Adaptive Verifiable Environments
Viaarxiv icon

Object-IR: Leveraging Object Consistency and Mesh Deformation for Self-Supervised Image Retargeting

Add code
Oct 31, 2025
Viaarxiv icon

A1: Asynchronous Test-Time Scaling via Conformal Prediction

Add code
Sep 18, 2025
Figure 1 for A1: Asynchronous Test-Time Scaling via Conformal Prediction
Figure 2 for A1: Asynchronous Test-Time Scaling via Conformal Prediction
Figure 3 for A1: Asynchronous Test-Time Scaling via Conformal Prediction
Figure 4 for A1: Asynchronous Test-Time Scaling via Conformal Prediction
Viaarxiv icon

LongEmotion: Measuring Emotional Intelligence of Large Language Models in Long-Context Interaction

Add code
Sep 09, 2025
Figure 1 for LongEmotion: Measuring Emotional Intelligence of Large Language Models in Long-Context Interaction
Figure 2 for LongEmotion: Measuring Emotional Intelligence of Large Language Models in Long-Context Interaction
Figure 3 for LongEmotion: Measuring Emotional Intelligence of Large Language Models in Long-Context Interaction
Figure 4 for LongEmotion: Measuring Emotional Intelligence of Large Language Models in Long-Context Interaction
Viaarxiv icon

SwingArena: Competitive Programming Arena for Long-context GitHub Issue Solving

Add code
May 29, 2025
Figure 1 for SwingArena: Competitive Programming Arena for Long-context GitHub Issue Solving
Figure 2 for SwingArena: Competitive Programming Arena for Long-context GitHub Issue Solving
Figure 3 for SwingArena: Competitive Programming Arena for Long-context GitHub Issue Solving
Figure 4 for SwingArena: Competitive Programming Arena for Long-context GitHub Issue Solving
Viaarxiv icon

Point-to-Region Loss for Semi-Supervised Point-Based Crowd Counting

Add code
May 28, 2025
Figure 1 for Point-to-Region Loss for Semi-Supervised Point-Based Crowd Counting
Figure 2 for Point-to-Region Loss for Semi-Supervised Point-Based Crowd Counting
Figure 3 for Point-to-Region Loss for Semi-Supervised Point-Based Crowd Counting
Figure 4 for Point-to-Region Loss for Semi-Supervised Point-Based Crowd Counting
Viaarxiv icon