Picture for Wenhao Yu

Wenhao Yu

China University of Geosciences

Describe-then-Reason: Improving Multimodal Mathematical Reasoning through Visual Comprehension Training

Add code
Apr 26, 2024
Viaarxiv icon

A Progressive Framework of Vision-language Knowledge Distillation and Alignment for Multilingual Scene

Add code
Apr 17, 2024
Figure 1 for A Progressive Framework of Vision-language Knowledge Distillation and Alignment for Multilingual Scene
Figure 2 for A Progressive Framework of Vision-language Knowledge Distillation and Alignment for Multilingual Scene
Figure 3 for A Progressive Framework of Vision-language Knowledge Distillation and Alignment for Multilingual Scene
Figure 4 for A Progressive Framework of Vision-language Knowledge Distillation and Alignment for Multilingual Scene
Viaarxiv icon

LocoMan: Advancing Versatile Quadrupedal Dexterity with Lightweight Loco-Manipulators

Add code
Mar 27, 2024
Figure 1 for LocoMan: Advancing Versatile Quadrupedal Dexterity with Lightweight Loco-Manipulators
Figure 2 for LocoMan: Advancing Versatile Quadrupedal Dexterity with Lightweight Loco-Manipulators
Figure 3 for LocoMan: Advancing Versatile Quadrupedal Dexterity with Lightweight Loco-Manipulators
Figure 4 for LocoMan: Advancing Versatile Quadrupedal Dexterity with Lightweight Loco-Manipulators
Viaarxiv icon

CoNVOI: Context-aware Navigation using Vision Language Models in Outdoor and Indoor Environments

Add code
Mar 22, 2024
Figure 1 for CoNVOI: Context-aware Navigation using Vision Language Models in Outdoor and Indoor Environments
Figure 2 for CoNVOI: Context-aware Navigation using Vision Language Models in Outdoor and Indoor Environments
Figure 3 for CoNVOI: Context-aware Navigation using Vision Language Models in Outdoor and Indoor Environments
Figure 4 for CoNVOI: Context-aware Navigation using Vision Language Models in Outdoor and Indoor Environments
Viaarxiv icon

Towards Safe and Reliable Autonomous Driving: Dynamic Occupancy Set Prediction

Add code
Feb 29, 2024
Figure 1 for Towards Safe and Reliable Autonomous Driving: Dynamic Occupancy Set Prediction
Figure 2 for Towards Safe and Reliable Autonomous Driving: Dynamic Occupancy Set Prediction
Figure 3 for Towards Safe and Reliable Autonomous Driving: Dynamic Occupancy Set Prediction
Figure 4 for Towards Safe and Reliable Autonomous Driving: Dynamic Occupancy Set Prediction
Viaarxiv icon

StarCoder 2 and The Stack v2: The Next Generation

Add code
Feb 29, 2024
Figure 1 for StarCoder 2 and The Stack v2: The Next Generation
Figure 2 for StarCoder 2 and The Stack v2: The Next Generation
Figure 3 for StarCoder 2 and The Stack v2: The Next Generation
Figure 4 for StarCoder 2 and The Stack v2: The Next Generation
Viaarxiv icon

Learning to Learn Faster from Human Feedback with Language Model Predictive Control

Add code
Feb 18, 2024
Figure 1 for Learning to Learn Faster from Human Feedback with Language Model Predictive Control
Figure 2 for Learning to Learn Faster from Human Feedback with Language Model Predictive Control
Figure 3 for Learning to Learn Faster from Human Feedback with Language Model Predictive Control
Figure 4 for Learning to Learn Faster from Human Feedback with Language Model Predictive Control
Viaarxiv icon

PIVOT: Iterative Visual Prompting Elicits Actionable Knowledge for VLMs

Add code
Feb 12, 2024
Figure 1 for PIVOT: Iterative Visual Prompting Elicits Actionable Knowledge for VLMs
Figure 2 for PIVOT: Iterative Visual Prompting Elicits Actionable Knowledge for VLMs
Figure 3 for PIVOT: Iterative Visual Prompting Elicits Actionable Knowledge for VLMs
Figure 4 for PIVOT: Iterative Visual Prompting Elicits Actionable Knowledge for VLMs
Viaarxiv icon

WebVoyager: Building an End-to-End Web Agent with Large Multimodal Models

Add code
Jan 28, 2024
Figure 1 for WebVoyager: Building an End-to-End Web Agent with Large Multimodal Models
Figure 2 for WebVoyager: Building an End-to-End Web Agent with Large Multimodal Models
Figure 3 for WebVoyager: Building an End-to-End Web Agent with Large Multimodal Models
Figure 4 for WebVoyager: Building an End-to-End Web Agent with Large Multimodal Models
Viaarxiv icon

Gradient Shaping for Multi-Constraint Safe Reinforcement Learning

Add code
Dec 23, 2023
Figure 1 for Gradient Shaping for Multi-Constraint Safe Reinforcement Learning
Figure 2 for Gradient Shaping for Multi-Constraint Safe Reinforcement Learning
Figure 3 for Gradient Shaping for Multi-Constraint Safe Reinforcement Learning
Figure 4 for Gradient Shaping for Multi-Constraint Safe Reinforcement Learning
Viaarxiv icon