Picture for Hongyuan Zhu

Hongyuan Zhu

PointCloud-Text Matching: Benchmark Datasets and a Baseline

Mar 28, 2024
Figure 1 for PointCloud-Text Matching: Benchmark Datasets and a Baseline
Figure 2 for PointCloud-Text Matching: Benchmark Datasets and a Baseline
Figure 3 for PointCloud-Text Matching: Benchmark Datasets and a Baseline
Figure 4 for PointCloud-Text Matching: Benchmark Datasets and a Baseline
Viaarxiv icon

Contributing Dimension Structure of Deep Feature for Coreset Selection

Add code
Jan 29, 2024
Viaarxiv icon

Direct Distillation between Different Domains

Jan 12, 2024
Viaarxiv icon

M3DBench: Let's Instruct Large Models with Multi-modal 3D Prompts

Add code
Dec 17, 2023
Viaarxiv icon

LL3DA: Visual Interactive Instruction Tuning for Omni-3D Understanding, Reasoning, and Planning

Add code
Nov 30, 2023
Figure 1 for LL3DA: Visual Interactive Instruction Tuning for Omni-3D Understanding, Reasoning, and Planning
Figure 2 for LL3DA: Visual Interactive Instruction Tuning for Omni-3D Understanding, Reasoning, and Planning
Figure 3 for LL3DA: Visual Interactive Instruction Tuning for Omni-3D Understanding, Reasoning, and Planning
Figure 4 for LL3DA: Visual Interactive Instruction Tuning for Omni-3D Understanding, Reasoning, and Planning
Viaarxiv icon

Exploit the antenna response consistency to define the alignment criteria for CSI data

Oct 10, 2023
Figure 1 for Exploit the antenna response consistency to define the alignment criteria for CSI data
Figure 2 for Exploit the antenna response consistency to define the alignment criteria for CSI data
Figure 3 for Exploit the antenna response consistency to define the alignment criteria for CSI data
Figure 4 for Exploit the antenna response consistency to define the alignment criteria for CSI data
Viaarxiv icon

Towards Debiasing Frame Length Bias in Text-Video Retrieval via Causal Intervention

Add code
Sep 17, 2023
Figure 1 for Towards Debiasing Frame Length Bias in Text-Video Retrieval via Causal Intervention
Figure 2 for Towards Debiasing Frame Length Bias in Text-Video Retrieval via Causal Intervention
Figure 3 for Towards Debiasing Frame Length Bias in Text-Video Retrieval via Causal Intervention
Figure 4 for Towards Debiasing Frame Length Bias in Text-Video Retrieval via Causal Intervention
Viaarxiv icon

Vote2Cap-DETR++: Decoupling Localization and Describing for End-to-End 3D Dense Captioning

Add code
Sep 06, 2023
Figure 1 for Vote2Cap-DETR++: Decoupling Localization and Describing for End-to-End 3D Dense Captioning
Figure 2 for Vote2Cap-DETR++: Decoupling Localization and Describing for End-to-End 3D Dense Captioning
Figure 3 for Vote2Cap-DETR++: Decoupling Localization and Describing for End-to-End 3D Dense Captioning
Figure 4 for Vote2Cap-DETR++: Decoupling Localization and Describing for End-to-End 3D Dense Captioning
Viaarxiv icon

Self-Supervised Learning for WiFi CSI-Based Human Activity Recognition: A Systematic Study

Jul 19, 2023
Figure 1 for Self-Supervised Learning for WiFi CSI-Based Human Activity Recognition: A Systematic Study
Figure 2 for Self-Supervised Learning for WiFi CSI-Based Human Activity Recognition: A Systematic Study
Figure 3 for Self-Supervised Learning for WiFi CSI-Based Human Activity Recognition: A Systematic Study
Figure 4 for Self-Supervised Learning for WiFi CSI-Based Human Activity Recognition: A Systematic Study
Viaarxiv icon

An Overview of Challenges in Egocentric Text-Video Retrieval

Jun 07, 2023
Figure 1 for An Overview of Challenges in Egocentric Text-Video Retrieval
Figure 2 for An Overview of Challenges in Egocentric Text-Video Retrieval
Figure 3 for An Overview of Challenges in Egocentric Text-Video Retrieval
Figure 4 for An Overview of Challenges in Egocentric Text-Video Retrieval
Viaarxiv icon