Picture for Hang Yin

Hang Yin

Real-Time Iteration Scheme for Diffusion Policy

Add code
Aug 07, 2025
Viaarxiv icon

IGL-Nav: Incremental 3D Gaussian Localization for Image-goal Navigation

Add code
Aug 01, 2025
Figure 1 for IGL-Nav: Incremental 3D Gaussian Localization for Image-goal Navigation
Figure 2 for IGL-Nav: Incremental 3D Gaussian Localization for Image-goal Navigation
Figure 3 for IGL-Nav: Incremental 3D Gaussian Localization for Image-goal Navigation
Figure 4 for IGL-Nav: Incremental 3D Gaussian Localization for Image-goal Navigation
Viaarxiv icon

Air Quality Prediction with A Meteorology-Guided Modality-Decoupled Spatio-Temporal Network

Add code
Apr 14, 2025
Viaarxiv icon

UniGoal: Towards Universal Zero-shot Goal-oriented Navigation

Add code
Mar 13, 2025
Viaarxiv icon

BEHAVIOR Robot Suite: Streamlining Real-World Whole-Body Manipulation for Everyday Household Activities

Add code
Mar 07, 2025
Viaarxiv icon

External Large Foundation Model: How to Efficiently Serve Trillions of Parameters for Online Ads Recommendation

Add code
Feb 26, 2025
Figure 1 for External Large Foundation Model: How to Efficiently Serve Trillions of Parameters for Online Ads Recommendation
Figure 2 for External Large Foundation Model: How to Efficiently Serve Trillions of Parameters for Online Ads Recommendation
Figure 3 for External Large Foundation Model: How to Efficiently Serve Trillions of Parameters for Online Ads Recommendation
Figure 4 for External Large Foundation Model: How to Efficiently Serve Trillions of Parameters for Online Ads Recommendation
Viaarxiv icon

Generative Video Semantic Communication via Multimodal Semantic Fusion with Large Model

Add code
Feb 19, 2025
Figure 1 for Generative Video Semantic Communication via Multimodal Semantic Fusion with Large Model
Figure 2 for Generative Video Semantic Communication via Multimodal Semantic Fusion with Large Model
Figure 3 for Generative Video Semantic Communication via Multimodal Semantic Fusion with Large Model
Figure 4 for Generative Video Semantic Communication via Multimodal Semantic Fusion with Large Model
Viaarxiv icon

Top Ten Challenges Towards Agentic Neural Graph Databases

Add code
Jan 24, 2025
Figure 1 for Top Ten Challenges Towards Agentic Neural Graph Databases
Viaarxiv icon

Enhancing Transformers for Generalizable First-Order Logical Entailment

Add code
Jan 01, 2025
Figure 1 for Enhancing Transformers for Generalizable First-Order Logical Entailment
Figure 2 for Enhancing Transformers for Generalizable First-Order Logical Entailment
Figure 3 for Enhancing Transformers for Generalizable First-Order Logical Entailment
Figure 4 for Enhancing Transformers for Generalizable First-Order Logical Entailment
Viaarxiv icon

Do Multimodal Language Models Really Understand Direction? A Benchmark for Compass Direction Reasoning

Add code
Dec 21, 2024
Figure 1 for Do Multimodal Language Models Really Understand Direction? A Benchmark for Compass Direction Reasoning
Figure 2 for Do Multimodal Language Models Really Understand Direction? A Benchmark for Compass Direction Reasoning
Figure 3 for Do Multimodal Language Models Really Understand Direction? A Benchmark for Compass Direction Reasoning
Figure 4 for Do Multimodal Language Models Really Understand Direction? A Benchmark for Compass Direction Reasoning
Viaarxiv icon