Picture for Ray Zhang

Ray Zhang

MR-MLLM: Mutual Reinforcement of Multimodal Comprehension and Vision Perception

Add code
Jun 22, 2024
Viaarxiv icon

RKHS-BA: A Semantic Correspondence-Free Multi-View Registration Framework with Global Tracking

Add code
Mar 02, 2024
Figure 1 for RKHS-BA: A Semantic Correspondence-Free Multi-View Registration Framework with Global Tracking
Figure 2 for RKHS-BA: A Semantic Correspondence-Free Multi-View Registration Framework with Global Tracking
Figure 3 for RKHS-BA: A Semantic Correspondence-Free Multi-View Registration Framework with Global Tracking
Figure 4 for RKHS-BA: A Semantic Correspondence-Free Multi-View Registration Framework with Global Tracking
Viaarxiv icon

Cloud-Device Collaborative Learning for Multimodal Large Language Models

Add code
Dec 26, 2023
Figure 1 for Cloud-Device Collaborative Learning for Multimodal Large Language Models
Figure 2 for Cloud-Device Collaborative Learning for Multimodal Large Language Models
Figure 3 for Cloud-Device Collaborative Learning for Multimodal Large Language Models
Figure 4 for Cloud-Device Collaborative Learning for Multimodal Large Language Models
Viaarxiv icon

BDIS-SLAM: A lightweight CPU-based dense stereo SLAM for surgery

Add code
Dec 25, 2023
Figure 1 for BDIS-SLAM: A lightweight CPU-based dense stereo SLAM for surgery
Figure 2 for BDIS-SLAM: A lightweight CPU-based dense stereo SLAM for surgery
Figure 3 for BDIS-SLAM: A lightweight CPU-based dense stereo SLAM for surgery
Figure 4 for BDIS-SLAM: A lightweight CPU-based dense stereo SLAM for surgery
Viaarxiv icon

FM-OV3D: Foundation Model-based Cross-modal Knowledge Blending for Open-Vocabulary 3D Detection

Add code
Dec 22, 2023
Figure 1 for FM-OV3D: Foundation Model-based Cross-modal Knowledge Blending for Open-Vocabulary 3D Detection
Figure 2 for FM-OV3D: Foundation Model-based Cross-modal Knowledge Blending for Open-Vocabulary 3D Detection
Figure 3 for FM-OV3D: Foundation Model-based Cross-modal Knowledge Blending for Open-Vocabulary 3D Detection
Figure 4 for FM-OV3D: Foundation Model-based Cross-modal Knowledge Blending for Open-Vocabulary 3D Detection
Viaarxiv icon

LiDAR-LLM: Exploring the Potential of Large Language Models for 3D LiDAR Understanding

Add code
Dec 21, 2023
Viaarxiv icon

Stain-free Detection of Embryo Polarization using Deep Learning

Add code
Nov 08, 2021
Figure 1 for Stain-free Detection of Embryo Polarization using Deep Learning
Figure 2 for Stain-free Detection of Embryo Polarization using Deep Learning
Figure 3 for Stain-free Detection of Embryo Polarization using Deep Learning
Figure 4 for Stain-free Detection of Embryo Polarization using Deep Learning
Viaarxiv icon

Deep Multi-Modal Contact Estimation for Invariant Observer Design on Quadruped Robots

Add code
Jul 07, 2021
Figure 1 for Deep Multi-Modal Contact Estimation for Invariant Observer Design on Quadruped Robots
Figure 2 for Deep Multi-Modal Contact Estimation for Invariant Observer Design on Quadruped Robots
Figure 3 for Deep Multi-Modal Contact Estimation for Invariant Observer Design on Quadruped Robots
Figure 4 for Deep Multi-Modal Contact Estimation for Invariant Observer Design on Quadruped Robots
Viaarxiv icon

Bayesian Spatial Kernel Smoothing for ScalableDense Semantic Mapping

Add code
Sep 10, 2019
Figure 1 for Bayesian Spatial Kernel Smoothing for ScalableDense Semantic Mapping
Figure 2 for Bayesian Spatial Kernel Smoothing for ScalableDense Semantic Mapping
Figure 3 for Bayesian Spatial Kernel Smoothing for ScalableDense Semantic Mapping
Figure 4 for Bayesian Spatial Kernel Smoothing for ScalableDense Semantic Mapping
Viaarxiv icon