Alert button
Picture for Yihao Feng

Yihao Feng

Alert button

UniControl: A Unified Diffusion Model for Controllable Visual Generation In the Wild

May 18, 2023
Can Qin, Shu Zhang, Ning Yu, Yihao Feng, Xinyi Yang, Yingbo Zhou, Huan Wang, Juan Carlos Niebles, Caiming Xiong, Silvio Savarese, Stefano Ermon, Yun Fu, Ran Xu

Figure 1 for UniControl: A Unified Diffusion Model for Controllable Visual Generation In the Wild
Figure 2 for UniControl: A Unified Diffusion Model for Controllable Visual Generation In the Wild
Figure 3 for UniControl: A Unified Diffusion Model for Controllable Visual Generation In the Wild
Figure 4 for UniControl: A Unified Diffusion Model for Controllable Visual Generation In the Wild
Viaarxiv icon

HIVE: Harnessing Human Feedback for Instructional Visual Editing

Mar 16, 2023
Shu Zhang, Xinyi Yang, Yihao Feng, Can Qin, Chia-Chih Chen, Ning Yu, Zeyuan Chen, Huan Wang, Silvio Savarese, Stefano Ermon, Caiming Xiong, Ran Xu

Figure 1 for HIVE: Harnessing Human Feedback for Instructional Visual Editing
Figure 2 for HIVE: Harnessing Human Feedback for Instructional Visual Editing
Figure 3 for HIVE: Harnessing Human Feedback for Instructional Visual Editing
Figure 4 for HIVE: Harnessing Human Feedback for Instructional Visual Editing
Viaarxiv icon

Fantastic Rewards and How to Tame Them: A Case Study on Reward Learning for Task-oriented Dialogue Systems

Feb 20, 2023
Yihao Feng, Shentao Yang, Shujian Zhang, Jianguo Zhang, Caiming Xiong, Mingyuan Zhou, Huan Wang

Figure 1 for Fantastic Rewards and How to Tame Them: A Case Study on Reward Learning for Task-oriented Dialogue Systems
Viaarxiv icon

A Unified Framework for Alternating Offline Model Training and Policy Learning

Oct 12, 2022
Shentao Yang, Shujian Zhang, Yihao Feng, Mingyuan Zhou

Figure 1 for A Unified Framework for Alternating Offline Model Training and Policy Learning
Figure 2 for A Unified Framework for Alternating Offline Model Training and Policy Learning
Figure 3 for A Unified Framework for Alternating Offline Model Training and Policy Learning
Figure 4 for A Unified Framework for Alternating Offline Model Training and Policy Learning
Viaarxiv icon

Metric Residual Networks for Sample Efficient Goal-conditioned Reinforcement Learning

Aug 17, 2022
Bo Liu, Yihao Feng, Qiang Liu, Peter Stone

Figure 1 for Metric Residual Networks for Sample Efficient Goal-conditioned Reinforcement Learning
Figure 2 for Metric Residual Networks for Sample Efficient Goal-conditioned Reinforcement Learning
Figure 3 for Metric Residual Networks for Sample Efficient Goal-conditioned Reinforcement Learning
Figure 4 for Metric Residual Networks for Sample Efficient Goal-conditioned Reinforcement Learning
Viaarxiv icon

Regularizing a Model-based Policy Stationary Distribution to Stabilize Offline Reinforcement Learning

Jun 14, 2022
Shentao Yang, Yihao Feng, Shujian Zhang, Mingyuan Zhou

Figure 1 for Regularizing a Model-based Policy Stationary Distribution to Stabilize Offline Reinforcement Learning
Figure 2 for Regularizing a Model-based Policy Stationary Distribution to Stabilize Offline Reinforcement Learning
Figure 3 for Regularizing a Model-based Policy Stationary Distribution to Stabilize Offline Reinforcement Learning
Figure 4 for Regularizing a Model-based Policy Stationary Distribution to Stabilize Offline Reinforcement Learning
Viaarxiv icon

A Regularized Implicit Policy for Offline Reinforcement Learning

Feb 19, 2022
Shentao Yang, Zhendong Wang, Huangjie Zheng, Yihao Feng, Mingyuan Zhou

Figure 1 for A Regularized Implicit Policy for Offline Reinforcement Learning
Figure 2 for A Regularized Implicit Policy for Offline Reinforcement Learning
Figure 3 for A Regularized Implicit Policy for Offline Reinforcement Learning
Figure 4 for A Regularized Implicit Policy for Offline Reinforcement Learning
Viaarxiv icon

Operator Deep Q-Learning: Zero-Shot Reward Transferring in Reinforcement Learning

Jan 01, 2022
Ziyang Tang, Yihao Feng, Qiang Liu

Figure 1 for Operator Deep Q-Learning: Zero-Shot Reward Transferring in Reinforcement Learning
Figure 2 for Operator Deep Q-Learning: Zero-Shot Reward Transferring in Reinforcement Learning
Figure 3 for Operator Deep Q-Learning: Zero-Shot Reward Transferring in Reinforcement Learning
Figure 4 for Operator Deep Q-Learning: Zero-Shot Reward Transferring in Reinforcement Learning
Viaarxiv icon

Unsupervised Out-of-Domain Detection via Pre-trained Transformers

Jun 02, 2021
Keyang Xu, Tongzheng Ren, Shikun Zhang, Yihao Feng, Caiming Xiong

Figure 1 for Unsupervised Out-of-Domain Detection via Pre-trained Transformers
Figure 2 for Unsupervised Out-of-Domain Detection via Pre-trained Transformers
Figure 3 for Unsupervised Out-of-Domain Detection via Pre-trained Transformers
Figure 4 for Unsupervised Out-of-Domain Detection via Pre-trained Transformers
Viaarxiv icon

Incremental Few-shot Text Classification with Multi-round New Classes: Formulation, Dataset and System

Apr 24, 2021
Congying Xia, Wenpeng Yin, Yihao Feng, Philip Yu

Figure 1 for Incremental Few-shot Text Classification with Multi-round New Classes: Formulation, Dataset and System
Figure 2 for Incremental Few-shot Text Classification with Multi-round New Classes: Formulation, Dataset and System
Figure 3 for Incremental Few-shot Text Classification with Multi-round New Classes: Formulation, Dataset and System
Figure 4 for Incremental Few-shot Text Classification with Multi-round New Classes: Formulation, Dataset and System
Viaarxiv icon