Alert button
Picture for Mingfei Sun

Mingfei Sun

Alert button

The University of Manchester

TTA-Nav: Test-time Adaptive Reconstruction for Point-Goal Navigation under Visual Corruptions

Mar 14, 2024
Maytus Piriyajitakonkij, Mingfei Sun, Mengmi Zhang, Wei Pan

Figure 1 for TTA-Nav: Test-time Adaptive Reconstruction for Point-Goal Navigation under Visual Corruptions
Figure 2 for TTA-Nav: Test-time Adaptive Reconstruction for Point-Goal Navigation under Visual Corruptions
Figure 3 for TTA-Nav: Test-time Adaptive Reconstruction for Point-Goal Navigation under Visual Corruptions
Figure 4 for TTA-Nav: Test-time Adaptive Reconstruction for Point-Goal Navigation under Visual Corruptions
Viaarxiv icon

FARPLS: A Feature-Augmented Robot Trajectory Preference Labeling System to Assist Human Labelers' Preference Elicitation

Mar 10, 2024
Hanfang Lyu, Yuanchen Bai, Xin Liang, Ujaan Das, Chuhan Shi, Leiliang Gong, Yingchi Li, Mingfei Sun, Ming Ge, Xiaojuan Ma

Viaarxiv icon

Comparing the Efficacy of Fine-Tuning and Meta-Learning for Few-Shot Policy Imitation

Jun 23, 2023
Massimiliano Patacchiola, Mingfei Sun, Katja Hofmann, Richard E. Turner

Figure 1 for Comparing the Efficacy of Fine-Tuning and Meta-Learning for Few-Shot Policy Imitation
Figure 2 for Comparing the Efficacy of Fine-Tuning and Meta-Learning for Few-Shot Policy Imitation
Figure 3 for Comparing the Efficacy of Fine-Tuning and Meta-Learning for Few-Shot Policy Imitation
Figure 4 for Comparing the Efficacy of Fine-Tuning and Meta-Learning for Few-Shot Policy Imitation
Viaarxiv icon

Trust-Region-Free Policy Optimization for Stochastic Policies

Feb 15, 2023
Mingfei Sun, Benjamin Ellis, Anuj Mahajan, Sam Devlin, Katja Hofmann, Shimon Whiteson

Figure 1 for Trust-Region-Free Policy Optimization for Stochastic Policies
Figure 2 for Trust-Region-Free Policy Optimization for Stochastic Policies
Viaarxiv icon

Sample Dropout: A Simple yet Effective Variance Reduction Technique in Deep Policy Optimization

Feb 05, 2023
Zichuan Lin, Xiapeng Wu, Mingfei Sun, Deheng Ye, Qiang Fu, Wei Yang, Wei Liu

Figure 1 for Sample Dropout: A Simple yet Effective Variance Reduction Technique in Deep Policy Optimization
Figure 2 for Sample Dropout: A Simple yet Effective Variance Reduction Technique in Deep Policy Optimization
Figure 3 for Sample Dropout: A Simple yet Effective Variance Reduction Technique in Deep Policy Optimization
Figure 4 for Sample Dropout: A Simple yet Effective Variance Reduction Technique in Deep Policy Optimization
Viaarxiv icon

Imitating Human Behaviour with Diffusion Models

Jan 25, 2023
Tim Pearce, Tabish Rashid, Anssi Kanervisto, Dave Bignell, Mingfei Sun, Raluca Georgescu, Sergio Valcarcel Macua, Shan Zheng Tan, Ida Momennejad, Katja Hofmann, Sam Devlin

Figure 1 for Imitating Human Behaviour with Diffusion Models
Figure 2 for Imitating Human Behaviour with Diffusion Models
Figure 3 for Imitating Human Behaviour with Diffusion Models
Figure 4 for Imitating Human Behaviour with Diffusion Models
Viaarxiv icon

Revisiting Estimation Bias in Policy Gradients for Deep Reinforcement Learning

Jan 20, 2023
Haoxuan Pan, Deheng Ye, Xiaoming Duan, Qiang Fu, Wei Yang, Jianping He, Mingfei Sun

Figure 1 for Revisiting Estimation Bias in Policy Gradients for Deep Reinforcement Learning
Figure 2 for Revisiting Estimation Bias in Policy Gradients for Deep Reinforcement Learning
Figure 3 for Revisiting Estimation Bias in Policy Gradients for Deep Reinforcement Learning
Figure 4 for Revisiting Estimation Bias in Policy Gradients for Deep Reinforcement Learning
Viaarxiv icon

SMACv2: An Improved Benchmark for Cooperative Multi-Agent Reinforcement Learning

Dec 14, 2022
Benjamin Ellis, Skander Moalla, Mikayel Samvelyan, Mingfei Sun, Anuj Mahajan, Jakob N. Foerster, Shimon Whiteson

Figure 1 for SMACv2: An Improved Benchmark for Cooperative Multi-Agent Reinforcement Learning
Figure 2 for SMACv2: An Improved Benchmark for Cooperative Multi-Agent Reinforcement Learning
Figure 3 for SMACv2: An Improved Benchmark for Cooperative Multi-Agent Reinforcement Learning
Figure 4 for SMACv2: An Improved Benchmark for Cooperative Multi-Agent Reinforcement Learning
Viaarxiv icon

UniMASK: Unified Inference in Sequential Decision Problems

Nov 20, 2022
Micah Carroll, Orr Paradise, Jessy Lin, Raluca Georgescu, Mingfei Sun, David Bignell, Stephanie Milani, Katja Hofmann, Matthew Hausknecht, Anca Dragan, Sam Devlin

Figure 1 for UniMASK: Unified Inference in Sequential Decision Problems
Figure 2 for UniMASK: Unified Inference in Sequential Decision Problems
Figure 3 for UniMASK: Unified Inference in Sequential Decision Problems
Figure 4 for UniMASK: Unified Inference in Sequential Decision Problems
Viaarxiv icon