Picture for Baoxiong Jia

Baoxiong Jia

Unifying 3D Vision-Language Understanding via Promptable Queries

Add code
May 19, 2024
Figure 1 for Unifying 3D Vision-Language Understanding via Promptable Queries
Figure 2 for Unifying 3D Vision-Language Understanding via Promptable Queries
Figure 3 for Unifying 3D Vision-Language Understanding via Promptable Queries
Figure 4 for Unifying 3D Vision-Language Understanding via Promptable Queries
Viaarxiv icon

Closed-Loop Open-Vocabulary Mobile Manipulation with GPT-4V

Add code
Apr 16, 2024
Figure 1 for Closed-Loop Open-Vocabulary Mobile Manipulation with GPT-4V
Figure 2 for Closed-Loop Open-Vocabulary Mobile Manipulation with GPT-4V
Figure 3 for Closed-Loop Open-Vocabulary Mobile Manipulation with GPT-4V
Figure 4 for Closed-Loop Open-Vocabulary Mobile Manipulation with GPT-4V
Viaarxiv icon

PhyScene: Physically Interactable 3D Scene Synthesis for Embodied AI

Add code
Apr 15, 2024
Viaarxiv icon

Move as You Say, Interact as You Can: Language-guided Human Motion Generation with Scene Affordance

Add code
Mar 26, 2024
Figure 1 for Move as You Say, Interact as You Can: Language-guided Human Motion Generation with Scene Affordance
Figure 2 for Move as You Say, Interact as You Can: Language-guided Human Motion Generation with Scene Affordance
Figure 3 for Move as You Say, Interact as You Can: Language-guided Human Motion Generation with Scene Affordance
Figure 4 for Move as You Say, Interact as You Can: Language-guided Human Motion Generation with Scene Affordance
Viaarxiv icon

SceneVerse: Scaling 3D Vision-Language Learning for Grounded Scene Understanding

Add code
Jan 17, 2024
Figure 1 for SceneVerse: Scaling 3D Vision-Language Learning for Grounded Scene Understanding
Figure 2 for SceneVerse: Scaling 3D Vision-Language Learning for Grounded Scene Understanding
Figure 3 for SceneVerse: Scaling 3D Vision-Language Learning for Grounded Scene Understanding
Figure 4 for SceneVerse: Scaling 3D Vision-Language Learning for Grounded Scene Understanding
Viaarxiv icon

An Embodied Generalist Agent in 3D World

Add code
Nov 18, 2023
Figure 1 for An Embodied Generalist Agent in 3D World
Figure 2 for An Embodied Generalist Agent in 3D World
Figure 3 for An Embodied Generalist Agent in 3D World
Figure 4 for An Embodied Generalist Agent in 3D World
Viaarxiv icon

ProBio: A Protocol-guided Multimodal Dataset for Molecular Biology Lab

Add code
Nov 01, 2023
Figure 1 for ProBio: A Protocol-guided Multimodal Dataset for Molecular Biology Lab
Figure 2 for ProBio: A Protocol-guided Multimodal Dataset for Molecular Biology Lab
Figure 3 for ProBio: A Protocol-guided Multimodal Dataset for Molecular Biology Lab
Figure 4 for ProBio: A Protocol-guided Multimodal Dataset for Molecular Biology Lab
Viaarxiv icon

X-VoE: Measuring eXplanatory Violation of Expectation in Physical Events

Add code
Aug 21, 2023
Figure 1 for X-VoE: Measuring eXplanatory Violation of Expectation in Physical Events
Figure 2 for X-VoE: Measuring eXplanatory Violation of Expectation in Physical Events
Figure 3 for X-VoE: Measuring eXplanatory Violation of Expectation in Physical Events
Figure 4 for X-VoE: Measuring eXplanatory Violation of Expectation in Physical Events
Viaarxiv icon

ARNOLD: A Benchmark for Language-Grounded Task Learning With Continuous States in Realistic 3D Scenes

Add code
Apr 09, 2023
Figure 1 for ARNOLD: A Benchmark for Language-Grounded Task Learning With Continuous States in Realistic 3D Scenes
Figure 2 for ARNOLD: A Benchmark for Language-Grounded Task Learning With Continuous States in Realistic 3D Scenes
Figure 3 for ARNOLD: A Benchmark for Language-Grounded Task Learning With Continuous States in Realistic 3D Scenes
Figure 4 for ARNOLD: A Benchmark for Language-Grounded Task Learning With Continuous States in Realistic 3D Scenes
Viaarxiv icon

Diffusion-based Generation, Optimization, and Planning in 3D Scenes

Add code
Jan 15, 2023
Figure 1 for Diffusion-based Generation, Optimization, and Planning in 3D Scenes
Figure 2 for Diffusion-based Generation, Optimization, and Planning in 3D Scenes
Figure 3 for Diffusion-based Generation, Optimization, and Planning in 3D Scenes
Figure 4 for Diffusion-based Generation, Optimization, and Planning in 3D Scenes
Viaarxiv icon