Picture for Shuqiang Jiang

Shuqiang Jiang

GA-VLN: Geometry-Aware BEV Representation for Efficient Vision-Language Navigation

Add code
May 21, 2026
Viaarxiv icon

TrajRAG: Retrieving Geometric-Semantic Experience for Zero-Shot Object Navigation

Add code
May 03, 2026
Viaarxiv icon

Multi-Scale Gaussian-Language Map for Zero-shot Embodied Navigation and Reasoning

Add code
May 03, 2026
Viaarxiv icon

OmniFood8K: Single-Image Nutrition Estimation via Hierarchical Frequency-Aligned Fusion

Add code
Apr 14, 2026
Viaarxiv icon

Embodied Arena: A Comprehensive, Unified, and Evolving Evaluation Platform for Embodied AI

Add code
Sep 18, 2025
Viaarxiv icon

Sim-to-Real Transfer via 3D Feature Fields for Vision-and-Language Navigation

Add code
Jun 14, 2024
Figure 1 for Sim-to-Real Transfer via 3D Feature Fields for Vision-and-Language Navigation
Figure 2 for Sim-to-Real Transfer via 3D Feature Fields for Vision-and-Language Navigation
Figure 3 for Sim-to-Real Transfer via 3D Feature Fields for Vision-and-Language Navigation
Figure 4 for Sim-to-Real Transfer via 3D Feature Fields for Vision-and-Language Navigation
Viaarxiv icon

FoodSky: A Food-oriented Large Language Model that Passes the Chef and Dietetic Examination

Add code
Jun 11, 2024
Figure 1 for FoodSky: A Food-oriented Large Language Model that Passes the Chef and Dietetic Examination
Figure 2 for FoodSky: A Food-oriented Large Language Model that Passes the Chef and Dietetic Examination
Figure 3 for FoodSky: A Food-oriented Large Language Model that Passes the Chef and Dietetic Examination
Figure 4 for FoodSky: A Food-oriented Large Language Model that Passes the Chef and Dietetic Examination
Viaarxiv icon

DiffGen: Robot Demonstration Generation via Differentiable Physics Simulation, Differentiable Rendering, and Vision-Language Model

Add code
May 12, 2024
Figure 1 for DiffGen: Robot Demonstration Generation via Differentiable Physics Simulation, Differentiable Rendering, and Vision-Language Model
Figure 2 for DiffGen: Robot Demonstration Generation via Differentiable Physics Simulation, Differentiable Rendering, and Vision-Language Model
Figure 3 for DiffGen: Robot Demonstration Generation via Differentiable Physics Simulation, Differentiable Rendering, and Vision-Language Model
Figure 4 for DiffGen: Robot Demonstration Generation via Differentiable Physics Simulation, Differentiable Rendering, and Vision-Language Model
Viaarxiv icon

Lookahead Exploration with Neural Radiance Representation for Continuous Vision-Language Navigation

Add code
Apr 02, 2024
Figure 1 for Lookahead Exploration with Neural Radiance Representation for Continuous Vision-Language Navigation
Figure 2 for Lookahead Exploration with Neural Radiance Representation for Continuous Vision-Language Navigation
Figure 3 for Lookahead Exploration with Neural Radiance Representation for Continuous Vision-Language Navigation
Figure 4 for Lookahead Exploration with Neural Radiance Representation for Continuous Vision-Language Navigation
Viaarxiv icon

Synthesizing Knowledge-enhanced Features for Real-world Zero-shot Food Detection

Add code
Feb 14, 2024
Viaarxiv icon