Picture for Na Zhao

Na Zhao

CCF: Complementary Collaborative Fusion for Domain Generalized Multi-Modal 3D Object Detection

Add code
Mar 24, 2026
Viaarxiv icon

VGGT-360: Geometry-Consistent Zero-Shot Panoramic Depth Estimation

Add code
Mar 19, 2026
Viaarxiv icon

Words at Play: Benchmarking Audio Pun Understanding in Large Audio-Language Models

Add code
Mar 19, 2026
Viaarxiv icon

Robust Depth Super-Resolution via Adaptive Diffusion Sampling

Add code
Feb 10, 2026
Viaarxiv icon

Graph Smoothing for Enhanced Local Geometry Learning in Point Cloud Analysis

Add code
Jan 16, 2026
Viaarxiv icon

RaLiFlow: Scene Flow Estimation with 4D Radar and LiDAR Point Clouds

Add code
Dec 11, 2025
Viaarxiv icon

AffordBot: 3D Fine-grained Embodied Reasoning via Multimodal Large Language Models

Add code
Nov 13, 2025
Figure 1 for AffordBot: 3D Fine-grained Embodied Reasoning via Multimodal Large Language Models
Figure 2 for AffordBot: 3D Fine-grained Embodied Reasoning via Multimodal Large Language Models
Figure 3 for AffordBot: 3D Fine-grained Embodied Reasoning via Multimodal Large Language Models
Figure 4 for AffordBot: 3D Fine-grained Embodied Reasoning via Multimodal Large Language Models
Viaarxiv icon

LiteUpdate: A Lightweight Framework for Updating AI-Generated Image Detectors

Add code
Nov 10, 2025
Figure 1 for LiteUpdate: A Lightweight Framework for Updating AI-Generated Image Detectors
Figure 2 for LiteUpdate: A Lightweight Framework for Updating AI-Generated Image Detectors
Figure 3 for LiteUpdate: A Lightweight Framework for Updating AI-Generated Image Detectors
Figure 4 for LiteUpdate: A Lightweight Framework for Updating AI-Generated Image Detectors
Viaarxiv icon

An Integrated Framework of Prompt Engineering and Multidimensional Knowledge Graphs for Legal Dispute Analysis

Add code
Jul 10, 2025
Viaarxiv icon

How Do Images Align and Complement LiDAR? Towards a Harmonized Multi-modal 3D Panoptic Segmentation

Add code
May 25, 2025
Viaarxiv icon