Picture for He Zhu

He Zhu

4DNeX: Feed-Forward 4D Generative Modeling Made Easy

Add code
Aug 18, 2025
Viaarxiv icon

Agent KB: Leveraging Cross-Domain Experience for Agentic Problem Solving

Add code
Jul 08, 2025
Viaarxiv icon

TAG-INSTRUCT: Controlled Instruction Complexity Enhancement through Structure-based Augmentation

Add code
May 24, 2025
Viaarxiv icon

PlanGPT-VL: Enhancing Urban Planning with Domain-Specific Vision-Language Models

Add code
May 21, 2025
Viaarxiv icon

Retrieval-augmented in-context learning for multimodal large language models in disease classification

Add code
May 04, 2025
Viaarxiv icon

FMLGS: Fast Multilevel Language Embedded Gaussians for Part-level Interactive Agents

Add code
Apr 11, 2025
Viaarxiv icon

Grounding 3D Object Affordance with Language Instructions, Visual Observations and Interactions

Add code
Apr 07, 2025
Figure 1 for Grounding 3D Object Affordance with Language Instructions, Visual Observations and Interactions
Figure 2 for Grounding 3D Object Affordance with Language Instructions, Visual Observations and Interactions
Figure 3 for Grounding 3D Object Affordance with Language Instructions, Visual Observations and Interactions
Figure 4 for Grounding 3D Object Affordance with Language Instructions, Visual Observations and Interactions
Viaarxiv icon

Multi-view Reconstruction via SfM-guided Monocular Depth Estimation

Add code
Mar 18, 2025
Viaarxiv icon

Structural Entropy Guided Unsupervised Graph Out-Of-Distribution Detection

Add code
Mar 05, 2025
Viaarxiv icon

LayAlign: Enhancing Multilingual Reasoning in Large Language Models via Layer-Wise Adaptive Fusion and Alignment Strategy

Add code
Feb 17, 2025
Viaarxiv icon