Alert button
Picture for Mingfei Sun

Mingfei Sun

Alert button

The University of Manchester

TTA-Nav: Test-time Adaptive Reconstruction for Point-Goal Navigation under Visual Corruptions

Add code
Bookmark button
Alert button
Mar 14, 2024
Maytus Piriyajitakonkij, Mingfei Sun, Mengmi Zhang, Wei Pan

Figure 1 for TTA-Nav: Test-time Adaptive Reconstruction for Point-Goal Navigation under Visual Corruptions
Figure 2 for TTA-Nav: Test-time Adaptive Reconstruction for Point-Goal Navigation under Visual Corruptions
Figure 3 for TTA-Nav: Test-time Adaptive Reconstruction for Point-Goal Navigation under Visual Corruptions
Figure 4 for TTA-Nav: Test-time Adaptive Reconstruction for Point-Goal Navigation under Visual Corruptions
Viaarxiv icon

FARPLS: A Feature-Augmented Robot Trajectory Preference Labeling System to Assist Human Labelers' Preference Elicitation

Add code
Bookmark button
Alert button
Mar 10, 2024
Hanfang Lyu, Yuanchen Bai, Xin Liang, Ujaan Das, Chuhan Shi, Leiliang Gong, Yingchi Li, Mingfei Sun, Ming Ge, Xiaojuan Ma

Figure 1 for FARPLS: A Feature-Augmented Robot Trajectory Preference Labeling System to Assist Human Labelers' Preference Elicitation
Figure 2 for FARPLS: A Feature-Augmented Robot Trajectory Preference Labeling System to Assist Human Labelers' Preference Elicitation
Figure 3 for FARPLS: A Feature-Augmented Robot Trajectory Preference Labeling System to Assist Human Labelers' Preference Elicitation
Figure 4 for FARPLS: A Feature-Augmented Robot Trajectory Preference Labeling System to Assist Human Labelers' Preference Elicitation
Viaarxiv icon

Comparing the Efficacy of Fine-Tuning and Meta-Learning for Few-Shot Policy Imitation

Add code
Bookmark button
Alert button
Jun 23, 2023
Massimiliano Patacchiola, Mingfei Sun, Katja Hofmann, Richard E. Turner

Figure 1 for Comparing the Efficacy of Fine-Tuning and Meta-Learning for Few-Shot Policy Imitation
Figure 2 for Comparing the Efficacy of Fine-Tuning and Meta-Learning for Few-Shot Policy Imitation
Figure 3 for Comparing the Efficacy of Fine-Tuning and Meta-Learning for Few-Shot Policy Imitation
Figure 4 for Comparing the Efficacy of Fine-Tuning and Meta-Learning for Few-Shot Policy Imitation
Viaarxiv icon

Trust-Region-Free Policy Optimization for Stochastic Policies

Add code
Bookmark button
Alert button
Feb 15, 2023
Mingfei Sun, Benjamin Ellis, Anuj Mahajan, Sam Devlin, Katja Hofmann, Shimon Whiteson

Figure 1 for Trust-Region-Free Policy Optimization for Stochastic Policies
Figure 2 for Trust-Region-Free Policy Optimization for Stochastic Policies
Viaarxiv icon

Sample Dropout: A Simple yet Effective Variance Reduction Technique in Deep Policy Optimization

Add code
Bookmark button
Alert button
Feb 05, 2023
Zichuan Lin, Xiapeng Wu, Mingfei Sun, Deheng Ye, Qiang Fu, Wei Yang, Wei Liu

Figure 1 for Sample Dropout: A Simple yet Effective Variance Reduction Technique in Deep Policy Optimization
Figure 2 for Sample Dropout: A Simple yet Effective Variance Reduction Technique in Deep Policy Optimization
Figure 3 for Sample Dropout: A Simple yet Effective Variance Reduction Technique in Deep Policy Optimization
Figure 4 for Sample Dropout: A Simple yet Effective Variance Reduction Technique in Deep Policy Optimization
Viaarxiv icon

Imitating Human Behaviour with Diffusion Models

Add code
Bookmark button
Alert button
Jan 25, 2023
Tim Pearce, Tabish Rashid, Anssi Kanervisto, Dave Bignell, Mingfei Sun, Raluca Georgescu, Sergio Valcarcel Macua, Shan Zheng Tan, Ida Momennejad, Katja Hofmann, Sam Devlin

Figure 1 for Imitating Human Behaviour with Diffusion Models
Figure 2 for Imitating Human Behaviour with Diffusion Models
Figure 3 for Imitating Human Behaviour with Diffusion Models
Figure 4 for Imitating Human Behaviour with Diffusion Models
Viaarxiv icon

Revisiting Estimation Bias in Policy Gradients for Deep Reinforcement Learning

Add code
Bookmark button
Alert button
Jan 20, 2023
Haoxuan Pan, Deheng Ye, Xiaoming Duan, Qiang Fu, Wei Yang, Jianping He, Mingfei Sun

Figure 1 for Revisiting Estimation Bias in Policy Gradients for Deep Reinforcement Learning
Figure 2 for Revisiting Estimation Bias in Policy Gradients for Deep Reinforcement Learning
Figure 3 for Revisiting Estimation Bias in Policy Gradients for Deep Reinforcement Learning
Figure 4 for Revisiting Estimation Bias in Policy Gradients for Deep Reinforcement Learning
Viaarxiv icon

SMACv2: An Improved Benchmark for Cooperative Multi-Agent Reinforcement Learning

Add code
Bookmark button
Alert button
Dec 14, 2022
Benjamin Ellis, Skander Moalla, Mikayel Samvelyan, Mingfei Sun, Anuj Mahajan, Jakob N. Foerster, Shimon Whiteson

Figure 1 for SMACv2: An Improved Benchmark for Cooperative Multi-Agent Reinforcement Learning
Figure 2 for SMACv2: An Improved Benchmark for Cooperative Multi-Agent Reinforcement Learning
Figure 3 for SMACv2: An Improved Benchmark for Cooperative Multi-Agent Reinforcement Learning
Figure 4 for SMACv2: An Improved Benchmark for Cooperative Multi-Agent Reinforcement Learning
Viaarxiv icon

UniMASK: Unified Inference in Sequential Decision Problems

Add code
Bookmark button
Alert button
Nov 20, 2022
Micah Carroll, Orr Paradise, Jessy Lin, Raluca Georgescu, Mingfei Sun, David Bignell, Stephanie Milani, Katja Hofmann, Matthew Hausknecht, Anca Dragan, Sam Devlin

Figure 1 for UniMASK: Unified Inference in Sequential Decision Problems
Figure 2 for UniMASK: Unified Inference in Sequential Decision Problems
Figure 3 for UniMASK: Unified Inference in Sequential Decision Problems
Figure 4 for UniMASK: Unified Inference in Sequential Decision Problems
Viaarxiv icon