Picture for Jun Zhu

Jun Zhu

Tsinghua University

Offline Reinforcement Learning via High-Fidelity Generative Behavior Modeling

Add code
Sep 29, 2022
Figure 1 for Offline Reinforcement Learning via High-Fidelity Generative Behavior Modeling
Figure 2 for Offline Reinforcement Learning via High-Fidelity Generative Behavior Modeling
Figure 3 for Offline Reinforcement Learning via High-Fidelity Generative Behavior Modeling
Figure 4 for Offline Reinforcement Learning via High-Fidelity Generative Behavior Modeling
Viaarxiv icon

All are Worth Words: a ViT Backbone for Score-based Diffusion Models

Add code
Sep 25, 2022
Figure 1 for All are Worth Words: a ViT Backbone for Score-based Diffusion Models
Figure 2 for All are Worth Words: a ViT Backbone for Score-based Diffusion Models
Figure 3 for All are Worth Words: a ViT Backbone for Score-based Diffusion Models
Figure 4 for All are Worth Words: a ViT Backbone for Score-based Diffusion Models
Viaarxiv icon

Bi-level Physics-Informed Neural Networks for PDE Constrained Optimization using Broyden's Hypergradients

Add code
Sep 15, 2022
Figure 1 for Bi-level Physics-Informed Neural Networks for PDE Constrained Optimization using Broyden's Hypergradients
Figure 2 for Bi-level Physics-Informed Neural Networks for PDE Constrained Optimization using Broyden's Hypergradients
Figure 3 for Bi-level Physics-Informed Neural Networks for PDE Constrained Optimization using Broyden's Hypergradients
Figure 4 for Bi-level Physics-Informed Neural Networks for PDE Constrained Optimization using Broyden's Hypergradients
Viaarxiv icon

On the Reuse Bias in Off-Policy Reinforcement Learning

Add code
Sep 15, 2022
Figure 1 for On the Reuse Bias in Off-Policy Reinforcement Learning
Figure 2 for On the Reuse Bias in Off-Policy Reinforcement Learning
Figure 3 for On the Reuse Bias in Off-Policy Reinforcement Learning
Figure 4 for On the Reuse Bias in Off-Policy Reinforcement Learning
Viaarxiv icon

Regret Analysis for Hierarchical Experts Bandit Problem

Add code
Aug 11, 2022
Figure 1 for Regret Analysis for Hierarchical Experts Bandit Problem
Figure 2 for Regret Analysis for Hierarchical Experts Bandit Problem
Figure 3 for Regret Analysis for Hierarchical Experts Bandit Problem
Figure 4 for Regret Analysis for Hierarchical Experts Bandit Problem
Viaarxiv icon

Robust Learning of Deep Time Series Anomaly Detection Models with Contaminated Training Data

Add code
Aug 03, 2022
Figure 1 for Robust Learning of Deep Time Series Anomaly Detection Models with Contaminated Training Data
Figure 2 for Robust Learning of Deep Time Series Anomaly Detection Models with Contaminated Training Data
Figure 3 for Robust Learning of Deep Time Series Anomaly Detection Models with Contaminated Training Data
Figure 4 for Robust Learning of Deep Time Series Anomaly Detection Models with Contaminated Training Data
Viaarxiv icon

EGSDE: Unpaired Image-to-Image Translation via Energy-Guided Stochastic Differential Equations

Add code
Jul 14, 2022
Figure 1 for EGSDE: Unpaired Image-to-Image Translation via Energy-Guided Stochastic Differential Equations
Figure 2 for EGSDE: Unpaired Image-to-Image Translation via Energy-Guided Stochastic Differential Equations
Figure 3 for EGSDE: Unpaired Image-to-Image Translation via Energy-Guided Stochastic Differential Equations
Figure 4 for EGSDE: Unpaired Image-to-Image Translation via Energy-Guided Stochastic Differential Equations
Viaarxiv icon

CoSCL: Cooperation of Small Continual Learners is Stronger than a Big One

Add code
Jul 13, 2022
Figure 1 for CoSCL: Cooperation of Small Continual Learners is Stronger than a Big One
Figure 2 for CoSCL: Cooperation of Small Continual Learners is Stronger than a Big One
Figure 3 for CoSCL: Cooperation of Small Continual Learners is Stronger than a Big One
Figure 4 for CoSCL: Cooperation of Small Continual Learners is Stronger than a Big One
Viaarxiv icon

DGPO: Discovering Multiple Strategies with Diversity-Guided Policy Optimization

Add code
Jul 12, 2022
Figure 1 for DGPO: Discovering Multiple Strategies with Diversity-Guided Policy Optimization
Figure 2 for DGPO: Discovering Multiple Strategies with Diversity-Guided Policy Optimization
Figure 3 for DGPO: Discovering Multiple Strategies with Diversity-Guided Policy Optimization
Figure 4 for DGPO: Discovering Multiple Strategies with Diversity-Guided Policy Optimization
Viaarxiv icon

Maximum Likelihood Training for Score-Based Diffusion ODEs by High-Order Denoising Score Matching

Add code
Jun 27, 2022
Figure 1 for Maximum Likelihood Training for Score-Based Diffusion ODEs by High-Order Denoising Score Matching
Figure 2 for Maximum Likelihood Training for Score-Based Diffusion ODEs by High-Order Denoising Score Matching
Figure 3 for Maximum Likelihood Training for Score-Based Diffusion ODEs by High-Order Denoising Score Matching
Figure 4 for Maximum Likelihood Training for Score-Based Diffusion ODEs by High-Order Denoising Score Matching
Viaarxiv icon