Picture for Yuexiang Zhai

Yuexiang Zhai

Fine-Tuning Large Vision-Language Models as Decision-Making Agents via Reinforcement Learning

Add code
May 17, 2024
Figure 1 for Fine-Tuning Large Vision-Language Models as Decision-Making Agents via Reinforcement Learning
Figure 2 for Fine-Tuning Large Vision-Language Models as Decision-Making Agents via Reinforcement Learning
Figure 3 for Fine-Tuning Large Vision-Language Models as Decision-Making Agents via Reinforcement Learning
Figure 4 for Fine-Tuning Large Vision-Language Models as Decision-Making Agents via Reinforcement Learning
Viaarxiv icon

Is Offline Decision Making Possible with Only Few Samples? Reliable Decisions in Data-Starved Bandits via Trust Region Enhancement

Add code
Feb 24, 2024
Viaarxiv icon

Eyes Wide Shut? Exploring the Visual Shortcomings of Multimodal LLMs

Add code
Jan 11, 2024
Figure 1 for Eyes Wide Shut? Exploring the Visual Shortcomings of Multimodal LLMs
Figure 2 for Eyes Wide Shut? Exploring the Visual Shortcomings of Multimodal LLMs
Figure 3 for Eyes Wide Shut? Exploring the Visual Shortcomings of Multimodal LLMs
Figure 4 for Eyes Wide Shut? Exploring the Visual Shortcomings of Multimodal LLMs
Viaarxiv icon

LMRL Gym: Benchmarks for Multi-Turn Reinforcement Learning with Language Models

Add code
Nov 30, 2023
Figure 1 for LMRL Gym: Benchmarks for Multi-Turn Reinforcement Learning with Language Models
Figure 2 for LMRL Gym: Benchmarks for Multi-Turn Reinforcement Learning with Language Models
Figure 3 for LMRL Gym: Benchmarks for Multi-Turn Reinforcement Learning with Language Models
Figure 4 for LMRL Gym: Benchmarks for Multi-Turn Reinforcement Learning with Language Models
Viaarxiv icon

White-Box Transformers via Sparse Rate Reduction: Compression Is All There Is?

Add code
Nov 24, 2023
Viaarxiv icon

RLIF: Interactive Imitation Learning as Reinforcement Learning

Add code
Nov 21, 2023
Figure 1 for RLIF: Interactive Imitation Learning as Reinforcement Learning
Figure 2 for RLIF: Interactive Imitation Learning as Reinforcement Learning
Figure 3 for RLIF: Interactive Imitation Learning as Reinforcement Learning
Figure 4 for RLIF: Interactive Imitation Learning as Reinforcement Learning
Viaarxiv icon

Investigating the Catastrophic Forgetting in Multimodal Large Language Models

Add code
Sep 26, 2023
Figure 1 for Investigating the Catastrophic Forgetting in Multimodal Large Language Models
Figure 2 for Investigating the Catastrophic Forgetting in Multimodal Large Language Models
Figure 3 for Investigating the Catastrophic Forgetting in Multimodal Large Language Models
Figure 4 for Investigating the Catastrophic Forgetting in Multimodal Large Language Models
Viaarxiv icon

Cal-QL: Calibrated Offline RL Pre-Training for Efficient Online Fine-Tuning

Add code
Mar 09, 2023
Figure 1 for Cal-QL: Calibrated Offline RL Pre-Training for Efficient Online Fine-Tuning
Figure 2 for Cal-QL: Calibrated Offline RL Pre-Training for Efficient Online Fine-Tuning
Figure 3 for Cal-QL: Calibrated Offline RL Pre-Training for Efficient Online Fine-Tuning
Figure 4 for Cal-QL: Calibrated Offline RL Pre-Training for Efficient Online Fine-Tuning
Viaarxiv icon

Closed-Loop Transcription via Convolutional Sparse Coding

Add code
Feb 18, 2023
Figure 1 for Closed-Loop Transcription via Convolutional Sparse Coding
Figure 2 for Closed-Loop Transcription via Convolutional Sparse Coding
Figure 3 for Closed-Loop Transcription via Convolutional Sparse Coding
Figure 4 for Closed-Loop Transcription via Convolutional Sparse Coding
Viaarxiv icon

Understanding the Complexity Gains of Single-Task RL with a Curriculum

Add code
Dec 24, 2022
Figure 1 for Understanding the Complexity Gains of Single-Task RL with a Curriculum
Figure 2 for Understanding the Complexity Gains of Single-Task RL with a Curriculum
Figure 3 for Understanding the Complexity Gains of Single-Task RL with a Curriculum
Figure 4 for Understanding the Complexity Gains of Single-Task RL with a Curriculum
Viaarxiv icon