Picture for Huimin Lu

Huimin Lu

National University of Defense Technology

DCP-CLIP:A Coarse-to-Fine Framework for Open-Vocabulary Semantic Segmentation with Dual Interaction

Add code
Mar 14, 2026
Viaarxiv icon

IGASA: Integrated Geometry-Aware and Skip-Attention Modules for Enhanced Point Cloud Registration

Add code
Mar 13, 2026
Viaarxiv icon

GeoLoco: Leveraging 3D Geometric Priors from Visual Foundation Model for Robust RGB-Only Humanoid Locomotion

Add code
Mar 08, 2026
Viaarxiv icon

Latent-Space Autoregressive World Model for Efficient and Robust Image-Goal Navigation

Add code
Nov 14, 2025
Figure 1 for Latent-Space Autoregressive World Model for Efficient and Robust Image-Goal Navigation
Figure 2 for Latent-Space Autoregressive World Model for Efficient and Robust Image-Goal Navigation
Figure 3 for Latent-Space Autoregressive World Model for Efficient and Robust Image-Goal Navigation
Figure 4 for Latent-Space Autoregressive World Model for Efficient and Robust Image-Goal Navigation
Viaarxiv icon

Dual-Arm Hierarchical Planning for Laboratory Automation: Vibratory Sieve Shaker Operations

Add code
Sep 18, 2025
Figure 1 for Dual-Arm Hierarchical Planning for Laboratory Automation: Vibratory Sieve Shaker Operations
Figure 2 for Dual-Arm Hierarchical Planning for Laboratory Automation: Vibratory Sieve Shaker Operations
Figure 3 for Dual-Arm Hierarchical Planning for Laboratory Automation: Vibratory Sieve Shaker Operations
Figure 4 for Dual-Arm Hierarchical Planning for Laboratory Automation: Vibratory Sieve Shaker Operations
Viaarxiv icon

Grasp Like Humans: Learning Generalizable Multi-Fingered Grasping from Human Proprioceptive Sensorimotor Integration

Add code
Sep 10, 2025
Figure 1 for Grasp Like Humans: Learning Generalizable Multi-Fingered Grasping from Human Proprioceptive Sensorimotor Integration
Figure 2 for Grasp Like Humans: Learning Generalizable Multi-Fingered Grasping from Human Proprioceptive Sensorimotor Integration
Figure 3 for Grasp Like Humans: Learning Generalizable Multi-Fingered Grasping from Human Proprioceptive Sensorimotor Integration
Figure 4 for Grasp Like Humans: Learning Generalizable Multi-Fingered Grasping from Human Proprioceptive Sensorimotor Integration
Viaarxiv icon

Probabilistic Temporal Masked Attention for Cross-view Online Action Detection

Add code
Aug 23, 2025
Figure 1 for Probabilistic Temporal Masked Attention for Cross-view Online Action Detection
Figure 2 for Probabilistic Temporal Masked Attention for Cross-view Online Action Detection
Figure 3 for Probabilistic Temporal Masked Attention for Cross-view Online Action Detection
Figure 4 for Probabilistic Temporal Masked Attention for Cross-view Online Action Detection
Viaarxiv icon

SurfAAV: Design and Implementation of a Novel Multimodal Surfing Aquatic-Aerial Vehicle

Add code
Jun 18, 2025
Figure 1 for SurfAAV: Design and Implementation of a Novel Multimodal Surfing Aquatic-Aerial Vehicle
Figure 2 for SurfAAV: Design and Implementation of a Novel Multimodal Surfing Aquatic-Aerial Vehicle
Figure 3 for SurfAAV: Design and Implementation of a Novel Multimodal Surfing Aquatic-Aerial Vehicle
Figure 4 for SurfAAV: Design and Implementation of a Novel Multimodal Surfing Aquatic-Aerial Vehicle
Viaarxiv icon

UniDetox: Universal Detoxification of Large Language Models via Dataset Distillation

Add code
Apr 29, 2025
Viaarxiv icon

UGNA-VPR: A Novel Training Paradigm for Visual Place Recognition Based on Uncertainty-Guided NeRF Augmentation

Add code
Mar 27, 2025
Figure 1 for UGNA-VPR: A Novel Training Paradigm for Visual Place Recognition Based on Uncertainty-Guided NeRF Augmentation
Figure 2 for UGNA-VPR: A Novel Training Paradigm for Visual Place Recognition Based on Uncertainty-Guided NeRF Augmentation
Figure 3 for UGNA-VPR: A Novel Training Paradigm for Visual Place Recognition Based on Uncertainty-Guided NeRF Augmentation
Figure 4 for UGNA-VPR: A Novel Training Paradigm for Visual Place Recognition Based on Uncertainty-Guided NeRF Augmentation
Viaarxiv icon