Picture for Mingfei Sun

Mingfei Sun

The University of Manchester

Effective Generation of Feasible Solutions for Integer Programming via Guided Diffusion

Add code
Jun 18, 2024
Viaarxiv icon

TTA-Nav: Test-time Adaptive Reconstruction for Point-Goal Navigation under Visual Corruptions

Add code
Mar 14, 2024
Figure 1 for TTA-Nav: Test-time Adaptive Reconstruction for Point-Goal Navigation under Visual Corruptions
Figure 2 for TTA-Nav: Test-time Adaptive Reconstruction for Point-Goal Navigation under Visual Corruptions
Figure 3 for TTA-Nav: Test-time Adaptive Reconstruction for Point-Goal Navigation under Visual Corruptions
Figure 4 for TTA-Nav: Test-time Adaptive Reconstruction for Point-Goal Navigation under Visual Corruptions
Viaarxiv icon

FARPLS: A Feature-Augmented Robot Trajectory Preference Labeling System to Assist Human Labelers' Preference Elicitation

Add code
Mar 10, 2024
Figure 1 for FARPLS: A Feature-Augmented Robot Trajectory Preference Labeling System to Assist Human Labelers' Preference Elicitation
Figure 2 for FARPLS: A Feature-Augmented Robot Trajectory Preference Labeling System to Assist Human Labelers' Preference Elicitation
Figure 3 for FARPLS: A Feature-Augmented Robot Trajectory Preference Labeling System to Assist Human Labelers' Preference Elicitation
Figure 4 for FARPLS: A Feature-Augmented Robot Trajectory Preference Labeling System to Assist Human Labelers' Preference Elicitation
Viaarxiv icon

Comparing the Efficacy of Fine-Tuning and Meta-Learning for Few-Shot Policy Imitation

Add code
Jun 23, 2023
Figure 1 for Comparing the Efficacy of Fine-Tuning and Meta-Learning for Few-Shot Policy Imitation
Figure 2 for Comparing the Efficacy of Fine-Tuning and Meta-Learning for Few-Shot Policy Imitation
Figure 3 for Comparing the Efficacy of Fine-Tuning and Meta-Learning for Few-Shot Policy Imitation
Figure 4 for Comparing the Efficacy of Fine-Tuning and Meta-Learning for Few-Shot Policy Imitation
Viaarxiv icon

Trust-Region-Free Policy Optimization for Stochastic Policies

Add code
Feb 15, 2023
Figure 1 for Trust-Region-Free Policy Optimization for Stochastic Policies
Figure 2 for Trust-Region-Free Policy Optimization for Stochastic Policies
Viaarxiv icon

Sample Dropout: A Simple yet Effective Variance Reduction Technique in Deep Policy Optimization

Add code
Feb 05, 2023
Figure 1 for Sample Dropout: A Simple yet Effective Variance Reduction Technique in Deep Policy Optimization
Figure 2 for Sample Dropout: A Simple yet Effective Variance Reduction Technique in Deep Policy Optimization
Figure 3 for Sample Dropout: A Simple yet Effective Variance Reduction Technique in Deep Policy Optimization
Figure 4 for Sample Dropout: A Simple yet Effective Variance Reduction Technique in Deep Policy Optimization
Viaarxiv icon

Imitating Human Behaviour with Diffusion Models

Add code
Jan 25, 2023
Figure 1 for Imitating Human Behaviour with Diffusion Models
Figure 2 for Imitating Human Behaviour with Diffusion Models
Figure 3 for Imitating Human Behaviour with Diffusion Models
Figure 4 for Imitating Human Behaviour with Diffusion Models
Viaarxiv icon

Revisiting Estimation Bias in Policy Gradients for Deep Reinforcement Learning

Add code
Jan 20, 2023
Figure 1 for Revisiting Estimation Bias in Policy Gradients for Deep Reinforcement Learning
Figure 2 for Revisiting Estimation Bias in Policy Gradients for Deep Reinforcement Learning
Figure 3 for Revisiting Estimation Bias in Policy Gradients for Deep Reinforcement Learning
Figure 4 for Revisiting Estimation Bias in Policy Gradients for Deep Reinforcement Learning
Viaarxiv icon

SMACv2: An Improved Benchmark for Cooperative Multi-Agent Reinforcement Learning

Add code
Dec 14, 2022
Figure 1 for SMACv2: An Improved Benchmark for Cooperative Multi-Agent Reinforcement Learning
Figure 2 for SMACv2: An Improved Benchmark for Cooperative Multi-Agent Reinforcement Learning
Figure 3 for SMACv2: An Improved Benchmark for Cooperative Multi-Agent Reinforcement Learning
Figure 4 for SMACv2: An Improved Benchmark for Cooperative Multi-Agent Reinforcement Learning
Viaarxiv icon

UniMASK: Unified Inference in Sequential Decision Problems

Add code
Nov 20, 2022
Figure 1 for UniMASK: Unified Inference in Sequential Decision Problems
Figure 2 for UniMASK: Unified Inference in Sequential Decision Problems
Figure 3 for UniMASK: Unified Inference in Sequential Decision Problems
Figure 4 for UniMASK: Unified Inference in Sequential Decision Problems
Viaarxiv icon