Picture for Hang Yin

Hang Yin

Real-Time Iteration Scheme for Diffusion Policy

Add code
Aug 07, 2025
Viaarxiv icon

IGL-Nav: Incremental 3D Gaussian Localization for Image-goal Navigation

Add code
Aug 01, 2025
Viaarxiv icon

Air Quality Prediction with A Meteorology-Guided Modality-Decoupled Spatio-Temporal Network

Add code
Apr 14, 2025
Viaarxiv icon

UniGoal: Towards Universal Zero-shot Goal-oriented Navigation

Add code
Mar 13, 2025
Viaarxiv icon

BEHAVIOR Robot Suite: Streamlining Real-World Whole-Body Manipulation for Everyday Household Activities

Add code
Mar 07, 2025
Viaarxiv icon

External Large Foundation Model: How to Efficiently Serve Trillions of Parameters for Online Ads Recommendation

Add code
Feb 26, 2025
Figure 1 for External Large Foundation Model: How to Efficiently Serve Trillions of Parameters for Online Ads Recommendation
Figure 2 for External Large Foundation Model: How to Efficiently Serve Trillions of Parameters for Online Ads Recommendation
Figure 3 for External Large Foundation Model: How to Efficiently Serve Trillions of Parameters for Online Ads Recommendation
Figure 4 for External Large Foundation Model: How to Efficiently Serve Trillions of Parameters for Online Ads Recommendation
Viaarxiv icon

Generative Video Semantic Communication via Multimodal Semantic Fusion with Large Model

Add code
Feb 19, 2025
Viaarxiv icon

Top Ten Challenges Towards Agentic Neural Graph Databases

Add code
Jan 24, 2025
Viaarxiv icon

Enhancing Transformers for Generalizable First-Order Logical Entailment

Add code
Jan 01, 2025
Figure 1 for Enhancing Transformers for Generalizable First-Order Logical Entailment
Figure 2 for Enhancing Transformers for Generalizable First-Order Logical Entailment
Figure 3 for Enhancing Transformers for Generalizable First-Order Logical Entailment
Figure 4 for Enhancing Transformers for Generalizable First-Order Logical Entailment
Viaarxiv icon

Do Multimodal Language Models Really Understand Direction? A Benchmark for Compass Direction Reasoning

Add code
Dec 21, 2024
Figure 1 for Do Multimodal Language Models Really Understand Direction? A Benchmark for Compass Direction Reasoning
Figure 2 for Do Multimodal Language Models Really Understand Direction? A Benchmark for Compass Direction Reasoning
Figure 3 for Do Multimodal Language Models Really Understand Direction? A Benchmark for Compass Direction Reasoning
Figure 4 for Do Multimodal Language Models Really Understand Direction? A Benchmark for Compass Direction Reasoning
Viaarxiv icon