Picture for Yasi Zhang

Yasi Zhang

REAL: Regression-Aware Reinforcement Learning for LLM-as-a-Judge

Add code
Mar 17, 2026
Viaarxiv icon

Learning Regularization Functionals for Inverse Problems: A Comparative Study

Add code
Oct 02, 2025
Viaarxiv icon

Restoration Score Distillation: From Corrupted Diffusion Pretraining to One-Step High-Quality Generation

Add code
May 19, 2025
Figure 1 for Restoration Score Distillation: From Corrupted Diffusion Pretraining to One-Step High-Quality Generation
Figure 2 for Restoration Score Distillation: From Corrupted Diffusion Pretraining to One-Step High-Quality Generation
Figure 3 for Restoration Score Distillation: From Corrupted Diffusion Pretraining to One-Step High-Quality Generation
Figure 4 for Restoration Score Distillation: From Corrupted Diffusion Pretraining to One-Step High-Quality Generation
Viaarxiv icon

Denoising Score Distillation: From Noisy Diffusion Pretraining to One-Step High-Quality Generation

Add code
Mar 10, 2025
Viaarxiv icon

Unlocking the Potential of Text-to-Image Diffusion with PAC-Bayesian Theory

Add code
Nov 25, 2024
Figure 1 for Unlocking the Potential of Text-to-Image Diffusion with PAC-Bayesian Theory
Figure 2 for Unlocking the Potential of Text-to-Image Diffusion with PAC-Bayesian Theory
Figure 3 for Unlocking the Potential of Text-to-Image Diffusion with PAC-Bayesian Theory
Figure 4 for Unlocking the Potential of Text-to-Image Diffusion with PAC-Bayesian Theory
Viaarxiv icon

Statistical Guarantees for Lifelong Reinforcement Learning using PAC-Bayesian Theory

Add code
Nov 01, 2024
Figure 1 for Statistical Guarantees for Lifelong Reinforcement Learning using PAC-Bayesian Theory
Figure 2 for Statistical Guarantees for Lifelong Reinforcement Learning using PAC-Bayesian Theory
Figure 3 for Statistical Guarantees for Lifelong Reinforcement Learning using PAC-Bayesian Theory
Figure 4 for Statistical Guarantees for Lifelong Reinforcement Learning using PAC-Bayesian Theory
Viaarxiv icon

DODT: Enhanced Online Decision Transformer Learning through Dreamer's Actor-Critic Trajectory Forecasting

Add code
Oct 15, 2024
Figure 1 for DODT: Enhanced Online Decision Transformer Learning through Dreamer's Actor-Critic Trajectory Forecasting
Figure 2 for DODT: Enhanced Online Decision Transformer Learning through Dreamer's Actor-Critic Trajectory Forecasting
Figure 3 for DODT: Enhanced Online Decision Transformer Learning through Dreamer's Actor-Critic Trajectory Forecasting
Figure 4 for DODT: Enhanced Online Decision Transformer Learning through Dreamer's Actor-Critic Trajectory Forecasting
Viaarxiv icon

Think Twice Before You Act: Improving Inverse Problem Solving With MCMC

Add code
Sep 13, 2024
Viaarxiv icon

Flow Priors for Linear Inverse Problems via Iterative Corrupted Trajectory Matching

Add code
May 29, 2024
Figure 1 for Flow Priors for Linear Inverse Problems via Iterative Corrupted Trajectory Matching
Figure 2 for Flow Priors for Linear Inverse Problems via Iterative Corrupted Trajectory Matching
Figure 3 for Flow Priors for Linear Inverse Problems via Iterative Corrupted Trajectory Matching
Figure 4 for Flow Priors for Linear Inverse Problems via Iterative Corrupted Trajectory Matching
Viaarxiv icon

Latent Energy-Based Odyssey: Black-Box Optimization via Expanded Exploration in the Energy-Based Latent Space

Add code
May 27, 2024
Viaarxiv icon