Picture for Shuang Qiu

Shuang Qiu

University of Michigan, Ann Arbor

Pessimism Meets Risk: Risk-Sensitive Offline Reinforcement Learning

Add code
Jul 10, 2024
Viaarxiv icon

Human-like object concept representations emerge naturally in multimodal large language models

Add code
Jul 01, 2024
Figure 1 for Human-like object concept representations emerge naturally in multimodal large language models
Figure 2 for Human-like object concept representations emerge naturally in multimodal large language models
Figure 3 for Human-like object concept representations emerge naturally in multimodal large language models
Figure 4 for Human-like object concept representations emerge naturally in multimodal large language models
Viaarxiv icon

Arithmetic Control of LLMs for Diverse User Preferences: Directional Preference Alignment with Multi-Objective Rewards

Add code
Mar 06, 2024
Figure 1 for Arithmetic Control of LLMs for Diverse User Preferences: Directional Preference Alignment with Multi-Objective Rewards
Figure 2 for Arithmetic Control of LLMs for Diverse User Preferences: Directional Preference Alignment with Multi-Objective Rewards
Figure 3 for Arithmetic Control of LLMs for Diverse User Preferences: Directional Preference Alignment with Multi-Objective Rewards
Figure 4 for Arithmetic Control of LLMs for Diverse User Preferences: Directional Preference Alignment with Multi-Objective Rewards
Viaarxiv icon

Rewards-in-Context: Multi-objective Alignment of Foundation Models with Dynamic Preference Adjustment

Add code
Feb 25, 2024
Figure 1 for Rewards-in-Context: Multi-objective Alignment of Foundation Models with Dynamic Preference Adjustment
Figure 2 for Rewards-in-Context: Multi-objective Alignment of Foundation Models with Dynamic Preference Adjustment
Figure 3 for Rewards-in-Context: Multi-objective Alignment of Foundation Models with Dynamic Preference Adjustment
Figure 4 for Rewards-in-Context: Multi-objective Alignment of Foundation Models with Dynamic Preference Adjustment
Viaarxiv icon

A Temporal-Spectral Fusion Transformer with Subject-specific Adapter for Enhancing RSVP-BCI Decoding

Add code
Jan 12, 2024
Figure 1 for A Temporal-Spectral Fusion Transformer with Subject-specific Adapter for Enhancing RSVP-BCI Decoding
Figure 2 for A Temporal-Spectral Fusion Transformer with Subject-specific Adapter for Enhancing RSVP-BCI Decoding
Figure 3 for A Temporal-Spectral Fusion Transformer with Subject-specific Adapter for Enhancing RSVP-BCI Decoding
Figure 4 for A Temporal-Spectral Fusion Transformer with Subject-specific Adapter for Enhancing RSVP-BCI Decoding
Viaarxiv icon

Posterior Sampling for Competitive RL: Function Approximation and Partial Observation

Add code
Oct 30, 2023
Viaarxiv icon

StairNetV3: Depth-aware Stair Modeling using Deep Learning

Add code
Aug 13, 2023
Figure 1 for StairNetV3: Depth-aware Stair Modeling using Deep Learning
Figure 2 for StairNetV3: Depth-aware Stair Modeling using Deep Learning
Figure 3 for StairNetV3: Depth-aware Stair Modeling using Deep Learning
Figure 4 for StairNetV3: Depth-aware Stair Modeling using Deep Learning
Viaarxiv icon

On the Value of Myopic Behavior in Policy Reuse

Add code
May 28, 2023
Figure 1 for On the Value of Myopic Behavior in Policy Reuse
Figure 2 for On the Value of Myopic Behavior in Policy Reuse
Figure 3 for On the Value of Myopic Behavior in Policy Reuse
Figure 4 for On the Value of Myopic Behavior in Policy Reuse
Viaarxiv icon

RGB-D-based Stair Detection using Deep Learning for Autonomous Stair Climbing

Add code
Dec 09, 2022
Figure 1 for RGB-D-based Stair Detection using Deep Learning for Autonomous Stair Climbing
Figure 2 for RGB-D-based Stair Detection using Deep Learning for Autonomous Stair Climbing
Figure 3 for RGB-D-based Stair Detection using Deep Learning for Autonomous Stair Climbing
Figure 4 for RGB-D-based Stair Detection using Deep Learning for Autonomous Stair Climbing
Viaarxiv icon

Contrastive UCB: Provably Efficient Contrastive Self-Supervised Learning in Online Reinforcement Learning

Add code
Jul 29, 2022
Figure 1 for Contrastive UCB: Provably Efficient Contrastive Self-Supervised Learning in Online Reinforcement Learning
Figure 2 for Contrastive UCB: Provably Efficient Contrastive Self-Supervised Learning in Online Reinforcement Learning
Figure 3 for Contrastive UCB: Provably Efficient Contrastive Self-Supervised Learning in Online Reinforcement Learning
Figure 4 for Contrastive UCB: Provably Efficient Contrastive Self-Supervised Learning in Online Reinforcement Learning
Viaarxiv icon