Picture for Xiyao Wang

Xiyao Wang

Multi-Stage Balanced Distillation: Addressing Long-Tail Challenges in Sequence-Level Knowledge Distillation

Add code
Jun 19, 2024
Figure 1 for Multi-Stage Balanced Distillation: Addressing Long-Tail Challenges in Sequence-Level Knowledge Distillation
Figure 2 for Multi-Stage Balanced Distillation: Addressing Long-Tail Challenges in Sequence-Level Knowledge Distillation
Figure 3 for Multi-Stage Balanced Distillation: Addressing Long-Tail Challenges in Sequence-Level Knowledge Distillation
Figure 4 for Multi-Stage Balanced Distillation: Addressing Long-Tail Challenges in Sequence-Level Knowledge Distillation
Viaarxiv icon

World Models with Hints of Large Language Models for Goal Achieving

Add code
Jun 11, 2024
Figure 1 for World Models with Hints of Large Language Models for Goal Achieving
Figure 2 for World Models with Hints of Large Language Models for Goal Achieving
Figure 3 for World Models with Hints of Large Language Models for Goal Achieving
Figure 4 for World Models with Hints of Large Language Models for Goal Achieving
Viaarxiv icon

Enhancing Visual-Language Modality Alignment in Large Vision Language Models via Self-Improvement

Add code
May 29, 2024
Viaarxiv icon

Calibrated Self-Rewarding Vision Language Models

Add code
May 23, 2024
Figure 1 for Calibrated Self-Rewarding Vision Language Models
Figure 2 for Calibrated Self-Rewarding Vision Language Models
Figure 3 for Calibrated Self-Rewarding Vision Language Models
Figure 4 for Calibrated Self-Rewarding Vision Language Models
Viaarxiv icon

Premier-TACO is a Few-Shot Policy Learner: Pretraining Multitask Representation via Temporal Action-Driven Contrastive Loss

Add code
Feb 13, 2024
Viaarxiv icon

Mementos: A Comprehensive Benchmark for Multimodal Large Language Model Reasoning over Image Sequences

Add code
Jan 25, 2024
Viaarxiv icon

Emojis Decoded: Leveraging ChatGPT for Enhanced Understanding in Social Media Communications

Add code
Jan 22, 2024
Figure 1 for Emojis Decoded: Leveraging ChatGPT for Enhanced Understanding in Social Media Communications
Figure 2 for Emojis Decoded: Leveraging ChatGPT for Enhanced Understanding in Social Media Communications
Figure 3 for Emojis Decoded: Leveraging ChatGPT for Enhanced Understanding in Social Media Communications
Figure 4 for Emojis Decoded: Leveraging ChatGPT for Enhanced Understanding in Social Media Communications
Viaarxiv icon

DrM: Mastering Visual Reinforcement Learning through Dormant Ratio Minimization

Add code
Oct 30, 2023
Figure 1 for DrM: Mastering Visual Reinforcement Learning through Dormant Ratio Minimization
Figure 2 for DrM: Mastering Visual Reinforcement Learning through Dormant Ratio Minimization
Figure 3 for DrM: Mastering Visual Reinforcement Learning through Dormant Ratio Minimization
Figure 4 for DrM: Mastering Visual Reinforcement Learning through Dormant Ratio Minimization
Viaarxiv icon

COPlanner: Plan to Roll Out Conservatively but to Explore Optimistically for Model-Based RL

Add code
Oct 11, 2023
Figure 1 for COPlanner: Plan to Roll Out Conservatively but to Explore Optimistically for Model-Based RL
Figure 2 for COPlanner: Plan to Roll Out Conservatively but to Explore Optimistically for Model-Based RL
Figure 3 for COPlanner: Plan to Roll Out Conservatively but to Explore Optimistically for Model-Based RL
Figure 4 for COPlanner: Plan to Roll Out Conservatively but to Explore Optimistically for Model-Based RL
Viaarxiv icon

Equal Long-term Benefit Rate: Adapting Static Fairness Notions to Sequential Decision Making

Add code
Sep 07, 2023
Figure 1 for Equal Long-term Benefit Rate: Adapting Static Fairness Notions to Sequential Decision Making
Figure 2 for Equal Long-term Benefit Rate: Adapting Static Fairness Notions to Sequential Decision Making
Figure 3 for Equal Long-term Benefit Rate: Adapting Static Fairness Notions to Sequential Decision Making
Figure 4 for Equal Long-term Benefit Rate: Adapting Static Fairness Notions to Sequential Decision Making
Viaarxiv icon