Picture for Zichen Zhang

Zichen Zhang

SoupLM: Model Integration in Large Language and Multi-Modal Models

Add code
Jul 11, 2024
Viaarxiv icon

PoliFormer: Scaling On-Policy RL with Transformers Results in Masterful Navigators

Add code
Jun 28, 2024
Figure 1 for PoliFormer: Scaling On-Policy RL with Transformers Results in Masterful Navigators
Figure 2 for PoliFormer: Scaling On-Policy RL with Transformers Results in Masterful Navigators
Figure 3 for PoliFormer: Scaling On-Policy RL with Transformers Results in Masterful Navigators
Figure 4 for PoliFormer: Scaling On-Policy RL with Transformers Results in Masterful Navigators
Viaarxiv icon

Bidirectional Progressive Transformer for Interaction Intention Anticipation

Add code
May 09, 2024
Viaarxiv icon

Unified-IO 2: Scaling Autoregressive Multimodal Models with Vision, Language, Audio, and Action

Add code
Dec 28, 2023
Viaarxiv icon

Universal Visual Decomposer: Long-Horizon Manipulation Made Easy

Add code
Oct 12, 2023
Figure 1 for Universal Visual Decomposer: Long-Horizon Manipulation Made Easy
Figure 2 for Universal Visual Decomposer: Long-Horizon Manipulation Made Easy
Figure 3 for Universal Visual Decomposer: Long-Horizon Manipulation Made Easy
Figure 4 for Universal Visual Decomposer: Long-Horizon Manipulation Made Easy
Viaarxiv icon

When Learning Is Out of Reach, Reset: Generalization in Autonomous Visuomotor Reinforcement Learning

Add code
Mar 30, 2023
Figure 1 for When Learning Is Out of Reach, Reset: Generalization in Autonomous Visuomotor Reinforcement Learning
Figure 2 for When Learning Is Out of Reach, Reset: Generalization in Autonomous Visuomotor Reinforcement Learning
Figure 3 for When Learning Is Out of Reach, Reset: Generalization in Autonomous Visuomotor Reinforcement Learning
Figure 4 for When Learning Is Out of Reach, Reset: Generalization in Autonomous Visuomotor Reinforcement Learning
Viaarxiv icon

Streaming Traffic Flow Prediction Based on Continuous Reinforcement Learning

Add code
Dec 24, 2022
Figure 1 for Streaming Traffic Flow Prediction Based on Continuous Reinforcement Learning
Figure 2 for Streaming Traffic Flow Prediction Based on Continuous Reinforcement Learning
Figure 3 for Streaming Traffic Flow Prediction Based on Continuous Reinforcement Learning
Figure 4 for Streaming Traffic Flow Prediction Based on Continuous Reinforcement Learning
Viaarxiv icon

Managing Temporal Resolution in Continuous Value Estimation: A Fundamental Trade-off

Add code
Dec 17, 2022
Figure 1 for Managing Temporal Resolution in Continuous Value Estimation: A Fundamental Trade-off
Figure 2 for Managing Temporal Resolution in Continuous Value Estimation: A Fundamental Trade-off
Figure 3 for Managing Temporal Resolution in Continuous Value Estimation: A Fundamental Trade-off
Figure 4 for Managing Temporal Resolution in Continuous Value Estimation: A Fundamental Trade-off
Viaarxiv icon

A Simple Decentralized Cross-Entropy Method

Add code
Dec 16, 2022
Figure 1 for A Simple Decentralized Cross-Entropy Method
Figure 2 for A Simple Decentralized Cross-Entropy Method
Figure 3 for A Simple Decentralized Cross-Entropy Method
Figure 4 for A Simple Decentralized Cross-Entropy Method
Viaarxiv icon

VIMA: General Robot Manipulation with Multimodal Prompts

Add code
Oct 06, 2022
Figure 1 for VIMA: General Robot Manipulation with Multimodal Prompts
Figure 2 for VIMA: General Robot Manipulation with Multimodal Prompts
Figure 3 for VIMA: General Robot Manipulation with Multimodal Prompts
Figure 4 for VIMA: General Robot Manipulation with Multimodal Prompts
Viaarxiv icon