Picture for Hongyuan Zhu

Hongyuan Zhu

LL3DA: Visual Interactive Instruction Tuning for Omni-3D Understanding, Reasoning, and Planning

Add code
Nov 30, 2023
Figure 1 for LL3DA: Visual Interactive Instruction Tuning for Omni-3D Understanding, Reasoning, and Planning
Figure 2 for LL3DA: Visual Interactive Instruction Tuning for Omni-3D Understanding, Reasoning, and Planning
Figure 3 for LL3DA: Visual Interactive Instruction Tuning for Omni-3D Understanding, Reasoning, and Planning
Figure 4 for LL3DA: Visual Interactive Instruction Tuning for Omni-3D Understanding, Reasoning, and Planning
Viaarxiv icon

Exploit the antenna response consistency to define the alignment criteria for CSI data

Add code
Oct 10, 2023
Figure 1 for Exploit the antenna response consistency to define the alignment criteria for CSI data
Figure 2 for Exploit the antenna response consistency to define the alignment criteria for CSI data
Figure 3 for Exploit the antenna response consistency to define the alignment criteria for CSI data
Figure 4 for Exploit the antenna response consistency to define the alignment criteria for CSI data
Viaarxiv icon

Towards Debiasing Frame Length Bias in Text-Video Retrieval via Causal Intervention

Add code
Sep 17, 2023
Figure 1 for Towards Debiasing Frame Length Bias in Text-Video Retrieval via Causal Intervention
Figure 2 for Towards Debiasing Frame Length Bias in Text-Video Retrieval via Causal Intervention
Figure 3 for Towards Debiasing Frame Length Bias in Text-Video Retrieval via Causal Intervention
Figure 4 for Towards Debiasing Frame Length Bias in Text-Video Retrieval via Causal Intervention
Viaarxiv icon

Vote2Cap-DETR++: Decoupling Localization and Describing for End-to-End 3D Dense Captioning

Add code
Sep 06, 2023
Figure 1 for Vote2Cap-DETR++: Decoupling Localization and Describing for End-to-End 3D Dense Captioning
Figure 2 for Vote2Cap-DETR++: Decoupling Localization and Describing for End-to-End 3D Dense Captioning
Figure 3 for Vote2Cap-DETR++: Decoupling Localization and Describing for End-to-End 3D Dense Captioning
Figure 4 for Vote2Cap-DETR++: Decoupling Localization and Describing for End-to-End 3D Dense Captioning
Viaarxiv icon

Self-Supervised Learning for WiFi CSI-Based Human Activity Recognition: A Systematic Study

Add code
Jul 19, 2023
Figure 1 for Self-Supervised Learning for WiFi CSI-Based Human Activity Recognition: A Systematic Study
Figure 2 for Self-Supervised Learning for WiFi CSI-Based Human Activity Recognition: A Systematic Study
Figure 3 for Self-Supervised Learning for WiFi CSI-Based Human Activity Recognition: A Systematic Study
Figure 4 for Self-Supervised Learning for WiFi CSI-Based Human Activity Recognition: A Systematic Study
Viaarxiv icon

An Overview of Challenges in Egocentric Text-Video Retrieval

Add code
Jun 07, 2023
Figure 1 for An Overview of Challenges in Egocentric Text-Video Retrieval
Figure 2 for An Overview of Challenges in Egocentric Text-Video Retrieval
Figure 3 for An Overview of Challenges in Egocentric Text-Video Retrieval
Figure 4 for An Overview of Challenges in Egocentric Text-Video Retrieval
Viaarxiv icon

Multi-view Vision-Prompt Fusion Network: Can 2D Pre-trained Model Boost 3D Point Cloud Data-scarce Learning?

Add code
Apr 20, 2023
Figure 1 for Multi-view Vision-Prompt Fusion Network: Can 2D Pre-trained Model Boost 3D Point Cloud Data-scarce Learning?
Figure 2 for Multi-view Vision-Prompt Fusion Network: Can 2D Pre-trained Model Boost 3D Point Cloud Data-scarce Learning?
Figure 3 for Multi-view Vision-Prompt Fusion Network: Can 2D Pre-trained Model Boost 3D Point Cloud Data-scarce Learning?
Figure 4 for Multi-view Vision-Prompt Fusion Network: Can 2D Pre-trained Model Boost 3D Point Cloud Data-scarce Learning?
Viaarxiv icon

What Makes for Effective Few-shot Point Cloud Classification?

Add code
Mar 31, 2023
Figure 1 for What Makes for Effective Few-shot Point Cloud Classification?
Figure 2 for What Makes for Effective Few-shot Point Cloud Classification?
Figure 3 for What Makes for Effective Few-shot Point Cloud Classification?
Figure 4 for What Makes for Effective Few-shot Point Cloud Classification?
Viaarxiv icon

A Closer Look at Few-Shot 3D Point Cloud Classification

Add code
Mar 31, 2023
Viaarxiv icon

End-to-End 3D Dense Captioning with Vote2Cap-DETR

Add code
Jan 06, 2023
Figure 1 for End-to-End 3D Dense Captioning with Vote2Cap-DETR
Figure 2 for End-to-End 3D Dense Captioning with Vote2Cap-DETR
Figure 3 for End-to-End 3D Dense Captioning with Vote2Cap-DETR
Figure 4 for End-to-End 3D Dense Captioning with Vote2Cap-DETR
Viaarxiv icon