Picture for Simon Shaolei Du

Simon Shaolei Du

Cost-Effective Proxy Reward Model Construction with On-Policy and Active Learning

Add code
Jul 02, 2024
Figure 1 for Cost-Effective Proxy Reward Model Construction with On-Policy and Active Learning
Figure 2 for Cost-Effective Proxy Reward Model Construction with On-Policy and Active Learning
Figure 3 for Cost-Effective Proxy Reward Model Construction with On-Policy and Active Learning
Figure 4 for Cost-Effective Proxy Reward Model Construction with On-Policy and Active Learning
Viaarxiv icon

CLIPLoss and Norm-Based Data Selection Methods for Multimodal Contrastive Learning

Add code
May 29, 2024
Viaarxiv icon

Offline Multi-task Transfer RL with Representational Penalization

Add code
Feb 19, 2024
Viaarxiv icon

Variance Alignment Score: A Simple But Tough-to-Beat Data Selection Method for Multimodal Contrastive Learning

Add code
Feb 03, 2024
Viaarxiv icon

Free from Bellman Completeness: Trajectory Stitching via Model-based Return-conditioned Supervised Learning

Add code
Oct 30, 2023
Viaarxiv icon

Robust Offline Reinforcement Learning -- Certify the Confidence Interval

Add code
Oct 03, 2023
Figure 1 for Robust Offline Reinforcement Learning -- Certify the Confidence Interval
Figure 2 for Robust Offline Reinforcement Learning -- Certify the Confidence Interval
Figure 3 for Robust Offline Reinforcement Learning -- Certify the Confidence Interval
Figure 4 for Robust Offline Reinforcement Learning -- Certify the Confidence Interval
Viaarxiv icon

LabelBench: A Comprehensive Framework for Benchmarking Label-Efficient Learning

Add code
Jun 16, 2023
Figure 1 for LabelBench: A Comprehensive Framework for Benchmarking Label-Efficient Learning
Figure 2 for LabelBench: A Comprehensive Framework for Benchmarking Label-Efficient Learning
Figure 3 for LabelBench: A Comprehensive Framework for Benchmarking Label-Efficient Learning
Figure 4 for LabelBench: A Comprehensive Framework for Benchmarking Label-Efficient Learning
Viaarxiv icon

A Benchmark for Low-Switching-Cost Reinforcement Learning

Add code
Dec 13, 2021
Figure 1 for A Benchmark for Low-Switching-Cost Reinforcement Learning
Figure 2 for A Benchmark for Low-Switching-Cost Reinforcement Learning
Figure 3 for A Benchmark for Low-Switching-Cost Reinforcement Learning
Figure 4 for A Benchmark for Low-Switching-Cost Reinforcement Learning
Viaarxiv icon

Hypothesis Transfer Learning via Transformation Functions

Add code
Nov 05, 2017
Figure 1 for Hypothesis Transfer Learning via Transformation Functions
Figure 2 for Hypothesis Transfer Learning via Transformation Functions
Figure 3 for Hypothesis Transfer Learning via Transformation Functions
Figure 4 for Hypothesis Transfer Learning via Transformation Functions
Viaarxiv icon