RGB Stacking


Real-Time 2D LiDAR Object Detection Using Three-Frame RGB Scan Encoding

Add code
Feb 02, 2026
Viaarxiv icon

SMc2f: Robust Scenario Mining for Robotic Autonomy from Coarse to Fine

Add code
Jan 17, 2026
Viaarxiv icon

Diffusion Knows Transparency: Repurposing Video Diffusion for Transparent Object Depth and Normal Estimation

Add code
Dec 29, 2025
Viaarxiv icon

Multi-scale Attention-Guided Intrinsic Decomposition and Rendering Pass Prediction for Facial Images

Add code
Dec 18, 2025
Figure 1 for Multi-scale Attention-Guided Intrinsic Decomposition and Rendering Pass Prediction for Facial Images
Figure 2 for Multi-scale Attention-Guided Intrinsic Decomposition and Rendering Pass Prediction for Facial Images
Figure 3 for Multi-scale Attention-Guided Intrinsic Decomposition and Rendering Pass Prediction for Facial Images
Figure 4 for Multi-scale Attention-Guided Intrinsic Decomposition and Rendering Pass Prediction for Facial Images
Viaarxiv icon

Multimodal RGB-HSI Feature Fusion with Patient-Aware Incremental Heuristic Meta-Learning for Oral Lesion Classification

Add code
Nov 15, 2025
Figure 1 for Multimodal RGB-HSI Feature Fusion with Patient-Aware Incremental Heuristic Meta-Learning for Oral Lesion Classification
Figure 2 for Multimodal RGB-HSI Feature Fusion with Patient-Aware Incremental Heuristic Meta-Learning for Oral Lesion Classification
Figure 3 for Multimodal RGB-HSI Feature Fusion with Patient-Aware Incremental Heuristic Meta-Learning for Oral Lesion Classification
Viaarxiv icon

Real-Time Obstacle Avoidance for a Mobile Robot Using CNN-Based Sensor Fusion

Add code
Sep 09, 2025
Viaarxiv icon

Graph-Fused Vision-Language-Action for Policy Reasoning in Multi-Arm Robotic Manipulation

Add code
Sep 09, 2025
Figure 1 for Graph-Fused Vision-Language-Action for Policy Reasoning in Multi-Arm Robotic Manipulation
Figure 2 for Graph-Fused Vision-Language-Action for Policy Reasoning in Multi-Arm Robotic Manipulation
Figure 3 for Graph-Fused Vision-Language-Action for Policy Reasoning in Multi-Arm Robotic Manipulation
Figure 4 for Graph-Fused Vision-Language-Action for Policy Reasoning in Multi-Arm Robotic Manipulation
Viaarxiv icon

MonoMPC: Monocular Vision Based Navigation with Learned Collision Model and Risk-Aware Model Predictive Control

Add code
Aug 10, 2025
Viaarxiv icon

Information-Theoretic Graph Fusion with Vision-Language-Action Model for Policy Reasoning and Dual Robotic Control

Add code
Aug 07, 2025
Viaarxiv icon

UAVD-Mamba: Deformable Token Fusion Vision Mamba for Multimodal UAV Detection

Add code
Jul 01, 2025
Viaarxiv icon