Picture for Mingyang Yi

Mingyang Yi

Renmin University of China

ETS: Energy-Guided Test-Time Scaling for Training-Free RL Alignment

Add code
Jan 29, 2026
Viaarxiv icon

Towards a Theoretical Understanding to the Generalization of RLHF

Add code
Jan 23, 2026
Viaarxiv icon

TR-PTS: Task-Relevant Parameter and Token Selection for Efficient Tuning

Add code
Jul 30, 2025
Viaarxiv icon

Reward-SQL: Boosting Text-to-SQL via Stepwise Reasoning and Process-Supervised Rewards

Add code
May 07, 2025
Figure 1 for Reward-SQL: Boosting Text-to-SQL via Stepwise Reasoning and Process-Supervised Rewards
Figure 2 for Reward-SQL: Boosting Text-to-SQL via Stepwise Reasoning and Process-Supervised Rewards
Figure 3 for Reward-SQL: Boosting Text-to-SQL via Stepwise Reasoning and Process-Supervised Rewards
Figure 4 for Reward-SQL: Boosting Text-to-SQL via Stepwise Reasoning and Process-Supervised Rewards
Viaarxiv icon

Improved Diffusion-based Generative Model with Better Adversarial Robustness

Add code
Feb 24, 2025
Viaarxiv icon

Reveal the Mystery of DPO: The Connection between DPO and RL Algorithms

Add code
Feb 05, 2025
Viaarxiv icon

Towards Understanding the Working Mechanism of Text-to-Image Diffusion Model

Add code
May 24, 2024
Viaarxiv icon

Enhancing Text-to-Image Editing via Hybrid Mask-Informed Fusion

Add code
May 24, 2024
Viaarxiv icon

Continuous-time Riemannian SGD and SVRG Flows on Wasserstein Probabilistic Space

Add code
Jan 25, 2024
Viaarxiv icon

SA-Solver: Stochastic Adams Solver for Fast Sampling of Diffusion Models

Add code
Sep 10, 2023
Figure 1 for SA-Solver: Stochastic Adams Solver for Fast Sampling of Diffusion Models
Figure 2 for SA-Solver: Stochastic Adams Solver for Fast Sampling of Diffusion Models
Figure 3 for SA-Solver: Stochastic Adams Solver for Fast Sampling of Diffusion Models
Figure 4 for SA-Solver: Stochastic Adams Solver for Fast Sampling of Diffusion Models
Viaarxiv icon