Picture for Hao Shao

Hao Shao

MoVA: Adapting Mixture of Vision Experts to Multimodal Context

Add code
Apr 19, 2024
Viaarxiv icon

Visual CoT: Unleashing Chain-of-Thought Reasoning in Multi-Modal Language Models

Add code
Mar 25, 2024
Figure 1 for Visual CoT: Unleashing Chain-of-Thought Reasoning in Multi-Modal Language Models
Figure 2 for Visual CoT: Unleashing Chain-of-Thought Reasoning in Multi-Modal Language Models
Figure 3 for Visual CoT: Unleashing Chain-of-Thought Reasoning in Multi-Modal Language Models
Figure 4 for Visual CoT: Unleashing Chain-of-Thought Reasoning in Multi-Modal Language Models
Viaarxiv icon

SmartRefine: A Scenario-Adaptive Refinement Framework for Efficient Motion Prediction

Add code
Mar 19, 2024
Figure 1 for SmartRefine: A Scenario-Adaptive Refinement Framework for Efficient Motion Prediction
Figure 2 for SmartRefine: A Scenario-Adaptive Refinement Framework for Efficient Motion Prediction
Figure 3 for SmartRefine: A Scenario-Adaptive Refinement Framework for Efficient Motion Prediction
Figure 4 for SmartRefine: A Scenario-Adaptive Refinement Framework for Efficient Motion Prediction
Viaarxiv icon

SPHINX-X: Scaling Data and Parameters for a Family of Multi-modal Large Language Models

Add code
Feb 08, 2024
Viaarxiv icon

LMDrive: Closed-Loop End-to-End Driving with Large Language Models

Add code
Dec 21, 2023
Viaarxiv icon

MCANet: Medical Image Segmentation with Multi-Scale Cross-Axis Attention

Add code
Dec 20, 2023
Figure 1 for MCANet: Medical Image Segmentation with Multi-Scale Cross-Axis Attention
Figure 2 for MCANet: Medical Image Segmentation with Multi-Scale Cross-Axis Attention
Figure 3 for MCANet: Medical Image Segmentation with Multi-Scale Cross-Axis Attention
Figure 4 for MCANet: Medical Image Segmentation with Multi-Scale Cross-Axis Attention
Viaarxiv icon

Polyper: Boundary Sensitive Polyp Segmentation

Add code
Dec 14, 2023
Viaarxiv icon

ReasonNet: End-to-End Driving with Temporal and Global Reasoning

Add code
May 17, 2023
Figure 1 for ReasonNet: End-to-End Driving with Temporal and Global Reasoning
Figure 2 for ReasonNet: End-to-End Driving with Temporal and Global Reasoning
Figure 3 for ReasonNet: End-to-End Driving with Temporal and Global Reasoning
Figure 4 for ReasonNet: End-to-End Driving with Temporal and Global Reasoning
Viaarxiv icon

Efficient Reinforcement Learning for Autonomous Driving with Parameterized Skills and Priors

Add code
May 08, 2023
Figure 1 for Efficient Reinforcement Learning for Autonomous Driving with Parameterized Skills and Priors
Figure 2 for Efficient Reinforcement Learning for Autonomous Driving with Parameterized Skills and Priors
Figure 3 for Efficient Reinforcement Learning for Autonomous Driving with Parameterized Skills and Priors
Figure 4 for Efficient Reinforcement Learning for Autonomous Driving with Parameterized Skills and Priors
Viaarxiv icon

Safety-Enhanced Autonomous Driving Using Interpretable Sensor Fusion Transformer

Add code
Jul 29, 2022
Figure 1 for Safety-Enhanced Autonomous Driving Using Interpretable Sensor Fusion Transformer
Figure 2 for Safety-Enhanced Autonomous Driving Using Interpretable Sensor Fusion Transformer
Figure 3 for Safety-Enhanced Autonomous Driving Using Interpretable Sensor Fusion Transformer
Figure 4 for Safety-Enhanced Autonomous Driving Using Interpretable Sensor Fusion Transformer
Viaarxiv icon