Alert button
Picture for Xiyao Wang

Xiyao Wang

Alert button

Premier-TACO is a Few-Shot Policy Learner: Pretraining Multitask Representation via Temporal Action-Driven Contrastive Loss

Add code
Bookmark button
Alert button
Feb 13, 2024
Ruijie Zheng, Yongyuan Liang, Xiyao Wang, Shuang Ma, Hal Daumé III, Huazhe Xu, John Langford, Praveen Palanisamy, Kalyan Shankar Basu, Furong Huang

Viaarxiv icon

Mementos: A Comprehensive Benchmark for Multimodal Large Language Model Reasoning over Image Sequences

Add code
Bookmark button
Alert button
Jan 25, 2024
Xiyao Wang, Yuhang Zhou, Xiaoyu Liu, Hongjin Lu, Yuancheng Xu, Feihong He, Jaehong Yoon, Taixi Lu, Gedas Bertasius, Mohit Bansal, Huaxiu Yao, Furong Huang

Viaarxiv icon

Emojis Decoded: Leveraging ChatGPT for Enhanced Understanding in Social Media Communications

Add code
Bookmark button
Alert button
Jan 22, 2024
Yuhang Zhou, Paiheng Xu, Xiyao Wang, Xuan Lu, Ge Gao, Wei Ai

Viaarxiv icon

DrM: Mastering Visual Reinforcement Learning through Dormant Ratio Minimization

Add code
Bookmark button
Alert button
Oct 30, 2023
Guowei Xu, Ruijie Zheng, Yongyuan Liang, Xiyao Wang, Zhecheng Yuan, Tianying Ji, Yu Luo, Xiaoyu Liu, Jiaxin Yuan, Pu Hua, Shuzhen Li, Yanjie Ze, Hal Daumé III, Furong Huang, Huazhe Xu

Figure 1 for DrM: Mastering Visual Reinforcement Learning through Dormant Ratio Minimization
Figure 2 for DrM: Mastering Visual Reinforcement Learning through Dormant Ratio Minimization
Figure 3 for DrM: Mastering Visual Reinforcement Learning through Dormant Ratio Minimization
Figure 4 for DrM: Mastering Visual Reinforcement Learning through Dormant Ratio Minimization
Viaarxiv icon

COPlanner: Plan to Roll Out Conservatively but to Explore Optimistically for Model-Based RL

Add code
Bookmark button
Alert button
Oct 11, 2023
Xiyao Wang, Ruijie Zheng, Yanchao Sun, Ruonan Jia, Wichayaporn Wongkamjan, Huazhe Xu, Furong Huang

Viaarxiv icon

Equal Long-term Benefit Rate: Adapting Static Fairness Notions to Sequential Decision Making

Add code
Bookmark button
Alert button
Sep 07, 2023
Yuancheng Xu, Chenghao Deng, Yanchao Sun, Ruijie Zheng, Xiyao Wang, Jieyu Zhao, Furong Huang

Figure 1 for Equal Long-term Benefit Rate: Adapting Static Fairness Notions to Sequential Decision Making
Figure 2 for Equal Long-term Benefit Rate: Adapting Static Fairness Notions to Sequential Decision Making
Figure 3 for Equal Long-term Benefit Rate: Adapting Static Fairness Notions to Sequential Decision Making
Figure 4 for Equal Long-term Benefit Rate: Adapting Static Fairness Notions to Sequential Decision Making
Viaarxiv icon

TACO: Temporal Latent Action-Driven Contrastive Loss for Visual Reinforcement Learning

Add code
Bookmark button
Alert button
Jun 22, 2023
Ruijie Zheng, Xiyao Wang, Yanchao Sun, Shuang Ma, Jieyu Zhao, Huazhe Xu, Hal Daumé III, Furong Huang

Figure 1 for TACO: Temporal Latent Action-Driven Contrastive Loss for Visual Reinforcement Learning
Figure 2 for TACO: Temporal Latent Action-Driven Contrastive Loss for Visual Reinforcement Learning
Figure 3 for TACO: Temporal Latent Action-Driven Contrastive Loss for Visual Reinforcement Learning
Figure 4 for TACO: Temporal Latent Action-Driven Contrastive Loss for Visual Reinforcement Learning
Viaarxiv icon

Is Model Ensemble Necessary? Model-based RL via a Single Model with Lipschitz Regularized Value Function

Add code
Bookmark button
Alert button
Feb 02, 2023
Ruijie Zheng, Xiyao Wang, Huazhe Xu, Furong Huang

Figure 1 for Is Model Ensemble Necessary? Model-based RL via a Single Model with Lipschitz Regularized Value Function
Figure 2 for Is Model Ensemble Necessary? Model-based RL via a Single Model with Lipschitz Regularized Value Function
Figure 3 for Is Model Ensemble Necessary? Model-based RL via a Single Model with Lipschitz Regularized Value Function
Figure 4 for Is Model Ensemble Necessary? Model-based RL via a Single Model with Lipschitz Regularized Value Function
Viaarxiv icon

Live in the Moment: Learning Dynamics Model Adapted to Evolving Policy

Add code
Bookmark button
Alert button
Jul 25, 2022
Xiyao Wang, Wichayaporn Wongkamjan, Furong Huang

Figure 1 for Live in the Moment: Learning Dynamics Model Adapted to Evolving Policy
Figure 2 for Live in the Moment: Learning Dynamics Model Adapted to Evolving Policy
Figure 3 for Live in the Moment: Learning Dynamics Model Adapted to Evolving Policy
Figure 4 for Live in the Moment: Learning Dynamics Model Adapted to Evolving Policy
Viaarxiv icon

Transfer RL across Observation Feature Spaces via Model-Based Regularization

Add code
Bookmark button
Alert button
Jan 01, 2022
Yanchao Sun, Ruijie Zheng, Xiyao Wang, Andrew Cohen, Furong Huang

Figure 1 for Transfer RL across Observation Feature Spaces via Model-Based Regularization
Figure 2 for Transfer RL across Observation Feature Spaces via Model-Based Regularization
Figure 3 for Transfer RL across Observation Feature Spaces via Model-Based Regularization
Figure 4 for Transfer RL across Observation Feature Spaces via Model-Based Regularization
Viaarxiv icon