Picture for Fei Xia

Fei Xia

Google DeepMind

Global-Local Collaborative Inference with LLM for Lidar-Based Open-Vocabulary Detection

Add code
Jul 12, 2024
Viaarxiv icon

Mobility VLA: Multimodal Instruction Navigation with Long-Context VLMs and Topological Graphs

Add code
Jul 10, 2024
Viaarxiv icon

Find Parent then Label Children: A Two-stage Taxonomy Completion Method with Pre-trained Language Model

Add code
Jun 25, 2024
Figure 1 for Find Parent then Label Children: A Two-stage Taxonomy Completion Method with Pre-trained Language Model
Figure 2 for Find Parent then Label Children: A Two-stage Taxonomy Completion Method with Pre-trained Language Model
Figure 3 for Find Parent then Label Children: A Two-stage Taxonomy Completion Method with Pre-trained Language Model
Figure 4 for Find Parent then Label Children: A Two-stage Taxonomy Completion Method with Pre-trained Language Model
Viaarxiv icon

VADER: Visual Affordance Detection and Error Recovery for Multi Robot Human Collaboration

Add code
May 25, 2024
Viaarxiv icon

GenCHiP: Generating Robot Policy Code for High-Precision and Contact-Rich Manipulation Tasks

Add code
Apr 09, 2024
Figure 1 for GenCHiP: Generating Robot Policy Code for High-Precision and Contact-Rich Manipulation Tasks
Figure 2 for GenCHiP: Generating Robot Policy Code for High-Precision and Contact-Rich Manipulation Tasks
Figure 3 for GenCHiP: Generating Robot Policy Code for High-Precision and Contact-Rich Manipulation Tasks
Figure 4 for GenCHiP: Generating Robot Policy Code for High-Precision and Contact-Rich Manipulation Tasks
Viaarxiv icon

Extracting Social Determinants of Health from Pediatric Patient Notes Using Large Language Models: Novel Corpus and Methods

Add code
Apr 04, 2024
Viaarxiv icon

CoNVOI: Context-aware Navigation using Vision Language Models in Outdoor and Indoor Environments

Add code
Mar 22, 2024
Viaarxiv icon

MicroDiffusion: Implicit Representation-Guided Diffusion for 3D Reconstruction from Limited 2D Microscopy Projections

Add code
Mar 16, 2024
Figure 1 for MicroDiffusion: Implicit Representation-Guided Diffusion for 3D Reconstruction from Limited 2D Microscopy Projections
Figure 2 for MicroDiffusion: Implicit Representation-Guided Diffusion for 3D Reconstruction from Limited 2D Microscopy Projections
Figure 3 for MicroDiffusion: Implicit Representation-Guided Diffusion for 3D Reconstruction from Limited 2D Microscopy Projections
Figure 4 for MicroDiffusion: Implicit Representation-Guided Diffusion for 3D Reconstruction from Limited 2D Microscopy Projections
Viaarxiv icon

BEHAVIOR-1K: A Human-Centered, Embodied AI Benchmark with 1,000 Everyday Activities and Realistic Simulation

Add code
Mar 14, 2024
Figure 1 for BEHAVIOR-1K: A Human-Centered, Embodied AI Benchmark with 1,000 Everyday Activities and Realistic Simulation
Figure 2 for BEHAVIOR-1K: A Human-Centered, Embodied AI Benchmark with 1,000 Everyday Activities and Realistic Simulation
Figure 3 for BEHAVIOR-1K: A Human-Centered, Embodied AI Benchmark with 1,000 Everyday Activities and Realistic Simulation
Figure 4 for BEHAVIOR-1K: A Human-Centered, Embodied AI Benchmark with 1,000 Everyday Activities and Realistic Simulation
Viaarxiv icon

Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context

Add code
Mar 08, 2024
Viaarxiv icon