Picture for Hongyuan Zhu

Hongyuan Zhu

PointCloud-Text Matching: Benchmark Datasets and a Baseline

Add code
Mar 28, 2024
Figure 1 for PointCloud-Text Matching: Benchmark Datasets and a Baseline
Figure 2 for PointCloud-Text Matching: Benchmark Datasets and a Baseline
Figure 3 for PointCloud-Text Matching: Benchmark Datasets and a Baseline
Figure 4 for PointCloud-Text Matching: Benchmark Datasets and a Baseline
Viaarxiv icon

Contributing Dimension Structure of Deep Feature for Coreset Selection

Add code
Jan 29, 2024
Viaarxiv icon

Direct Distillation between Different Domains

Add code
Jan 12, 2024
Viaarxiv icon

M3DBench: Let's Instruct Large Models with Multi-modal 3D Prompts

Add code
Dec 17, 2023
Viaarxiv icon

LL3DA: Visual Interactive Instruction Tuning for Omni-3D Understanding, Reasoning, and Planning

Add code
Nov 30, 2023
Figure 1 for LL3DA: Visual Interactive Instruction Tuning for Omni-3D Understanding, Reasoning, and Planning
Figure 2 for LL3DA: Visual Interactive Instruction Tuning for Omni-3D Understanding, Reasoning, and Planning
Figure 3 for LL3DA: Visual Interactive Instruction Tuning for Omni-3D Understanding, Reasoning, and Planning
Figure 4 for LL3DA: Visual Interactive Instruction Tuning for Omni-3D Understanding, Reasoning, and Planning
Viaarxiv icon

Exploit the antenna response consistency to define the alignment criteria for CSI data

Add code
Oct 10, 2023
Figure 1 for Exploit the antenna response consistency to define the alignment criteria for CSI data
Figure 2 for Exploit the antenna response consistency to define the alignment criteria for CSI data
Figure 3 for Exploit the antenna response consistency to define the alignment criteria for CSI data
Figure 4 for Exploit the antenna response consistency to define the alignment criteria for CSI data
Viaarxiv icon

Towards Debiasing Frame Length Bias in Text-Video Retrieval via Causal Intervention

Add code
Sep 17, 2023
Figure 1 for Towards Debiasing Frame Length Bias in Text-Video Retrieval via Causal Intervention
Figure 2 for Towards Debiasing Frame Length Bias in Text-Video Retrieval via Causal Intervention
Figure 3 for Towards Debiasing Frame Length Bias in Text-Video Retrieval via Causal Intervention
Figure 4 for Towards Debiasing Frame Length Bias in Text-Video Retrieval via Causal Intervention
Viaarxiv icon

Vote2Cap-DETR++: Decoupling Localization and Describing for End-to-End 3D Dense Captioning

Add code
Sep 06, 2023
Figure 1 for Vote2Cap-DETR++: Decoupling Localization and Describing for End-to-End 3D Dense Captioning
Figure 2 for Vote2Cap-DETR++: Decoupling Localization and Describing for End-to-End 3D Dense Captioning
Figure 3 for Vote2Cap-DETR++: Decoupling Localization and Describing for End-to-End 3D Dense Captioning
Figure 4 for Vote2Cap-DETR++: Decoupling Localization and Describing for End-to-End 3D Dense Captioning
Viaarxiv icon

Self-Supervised Learning for WiFi CSI-Based Human Activity Recognition: A Systematic Study

Add code
Jul 19, 2023
Figure 1 for Self-Supervised Learning for WiFi CSI-Based Human Activity Recognition: A Systematic Study
Figure 2 for Self-Supervised Learning for WiFi CSI-Based Human Activity Recognition: A Systematic Study
Figure 3 for Self-Supervised Learning for WiFi CSI-Based Human Activity Recognition: A Systematic Study
Figure 4 for Self-Supervised Learning for WiFi CSI-Based Human Activity Recognition: A Systematic Study
Viaarxiv icon

An Overview of Challenges in Egocentric Text-Video Retrieval

Add code
Jun 07, 2023
Figure 1 for An Overview of Challenges in Egocentric Text-Video Retrieval
Figure 2 for An Overview of Challenges in Egocentric Text-Video Retrieval
Figure 3 for An Overview of Challenges in Egocentric Text-Video Retrieval
Figure 4 for An Overview of Challenges in Egocentric Text-Video Retrieval
Viaarxiv icon