Picture for Donghao Ying

Donghao Ying

Max

AccidentBench: Benchmarking Multimodal Understanding and Reasoning in Vehicle Accidents and Beyond

Add code
Sep 30, 2025
Viaarxiv icon

Towards VM Rescheduling Optimization Through Deep Reinforcement Learning

Add code
May 23, 2025
Viaarxiv icon

Few-Shot Test-Time Optimization Without Retraining for Semiconductor Recipe Generation and Beyond

Add code
May 21, 2025
Viaarxiv icon

Reward-Safety Balance in Offline Safe RL via Diffusion Regularization

Add code
Feb 18, 2025
Figure 1 for Reward-Safety Balance in Offline Safe RL via Diffusion Regularization
Figure 2 for Reward-Safety Balance in Offline Safe RL via Diffusion Regularization
Figure 3 for Reward-Safety Balance in Offline Safe RL via Diffusion Regularization
Figure 4 for Reward-Safety Balance in Offline Safe RL via Diffusion Regularization
Viaarxiv icon

Bagging Improves Generalization Exponentially

Add code
May 23, 2024
Figure 1 for Bagging Improves Generalization Exponentially
Figure 2 for Bagging Improves Generalization Exponentially
Figure 3 for Bagging Improves Generalization Exponentially
Figure 4 for Bagging Improves Generalization Exponentially
Viaarxiv icon

Scalable Primal-Dual Actor-Critic Method for Safe Multi-Agent RL with General Utilities

Add code
May 27, 2023
Viaarxiv icon

Scalable Multi-Agent Reinforcement Learning with General Utilities

Add code
Feb 15, 2023
Viaarxiv icon

Policy-based Primal-Dual Methods for Convex Constrained Markov Decision Processes

Add code
May 22, 2022
Figure 1 for Policy-based Primal-Dual Methods for Convex Constrained Markov Decision Processes
Viaarxiv icon

A Dual Approach to Constrained Markov Decision Processes with Entropy Regularization

Add code
Oct 17, 2021
Viaarxiv icon