Picture for Rui Huang

Rui Huang

College of Computer Science and Technology, Civil Aviation University of China, China

Wavelet-Assisted Multi-Frequency Attention Network for Pansharpening

Add code
Feb 07, 2025
Figure 1 for Wavelet-Assisted Multi-Frequency Attention Network for Pansharpening
Figure 2 for Wavelet-Assisted Multi-Frequency Attention Network for Pansharpening
Figure 3 for Wavelet-Assisted Multi-Frequency Attention Network for Pansharpening
Figure 4 for Wavelet-Assisted Multi-Frequency Attention Network for Pansharpening
Viaarxiv icon

DEFOM-Stereo: Depth Foundation Model Based Stereo Matching

Add code
Jan 16, 2025
Viaarxiv icon

Fine-grained Video-Text Retrieval: A New Benchmark and Method

Add code
Dec 31, 2024
Viaarxiv icon

CRM: Retrieval Model with Controllable Condition

Add code
Dec 18, 2024
Figure 1 for CRM: Retrieval Model with Controllable Condition
Figure 2 for CRM: Retrieval Model with Controllable Condition
Figure 3 for CRM: Retrieval Model with Controllable Condition
Figure 4 for CRM: Retrieval Model with Controllable Condition
Viaarxiv icon

Grasp What You Want: Embodied Dexterous Grasping System Driven by Your Voice

Add code
Dec 14, 2024
Viaarxiv icon

GTPC-SSCD: Gate-guided Two-level Perturbation Consistency-based Semi-Supervised Change Detection

Add code
Nov 28, 2024
Figure 1 for GTPC-SSCD: Gate-guided Two-level Perturbation Consistency-based Semi-Supervised Change Detection
Figure 2 for GTPC-SSCD: Gate-guided Two-level Perturbation Consistency-based Semi-Supervised Change Detection
Figure 3 for GTPC-SSCD: Gate-guided Two-level Perturbation Consistency-based Semi-Supervised Change Detection
Figure 4 for GTPC-SSCD: Gate-guided Two-level Perturbation Consistency-based Semi-Supervised Change Detection
Viaarxiv icon

Training an Open-Vocabulary Monocular 3D Object Detection Model without 3D Data

Add code
Nov 23, 2024
Figure 1 for Training an Open-Vocabulary Monocular 3D Object Detection Model without 3D Data
Figure 2 for Training an Open-Vocabulary Monocular 3D Object Detection Model without 3D Data
Figure 3 for Training an Open-Vocabulary Monocular 3D Object Detection Model without 3D Data
Figure 4 for Training an Open-Vocabulary Monocular 3D Object Detection Model without 3D Data
Viaarxiv icon

QARM: Quantitative Alignment Multi-Modal Recommendation at Kuaishou

Add code
Nov 18, 2024
Figure 1 for QARM: Quantitative Alignment Multi-Modal Recommendation at Kuaishou
Figure 2 for QARM: Quantitative Alignment Multi-Modal Recommendation at Kuaishou
Figure 3 for QARM: Quantitative Alignment Multi-Modal Recommendation at Kuaishou
Figure 4 for QARM: Quantitative Alignment Multi-Modal Recommendation at Kuaishou
Viaarxiv icon

KuaiFormer: Transformer-Based Retrieval at Kuaishou

Add code
Nov 15, 2024
Viaarxiv icon

CFPNet: Improving Lightweight ToF Depth Completion via Cross-zone Feature Propagation

Add code
Nov 07, 2024
Figure 1 for CFPNet: Improving Lightweight ToF Depth Completion via Cross-zone Feature Propagation
Figure 2 for CFPNet: Improving Lightweight ToF Depth Completion via Cross-zone Feature Propagation
Figure 3 for CFPNet: Improving Lightweight ToF Depth Completion via Cross-zone Feature Propagation
Figure 4 for CFPNet: Improving Lightweight ToF Depth Completion via Cross-zone Feature Propagation
Viaarxiv icon