Picture for Aniruddha Kembhavi

Aniruddha Kembhavi

PoliFormer: Scaling On-Policy RL with Transformers Results in Masterful Navigators

Add code
Jun 28, 2024
Figure 1 for PoliFormer: Scaling On-Policy RL with Transformers Results in Masterful Navigators
Figure 2 for PoliFormer: Scaling On-Policy RL with Transformers Results in Masterful Navigators
Figure 3 for PoliFormer: Scaling On-Policy RL with Transformers Results in Masterful Navigators
Figure 4 for PoliFormer: Scaling On-Policy RL with Transformers Results in Masterful Navigators
Viaarxiv icon

CodeNav: Beyond tool-use to using real-world codebases with LLM agents

Add code
Jun 18, 2024
Figure 1 for CodeNav: Beyond tool-use to using real-world codebases with LLM agents
Figure 2 for CodeNav: Beyond tool-use to using real-world codebases with LLM agents
Figure 3 for CodeNav: Beyond tool-use to using real-world codebases with LLM agents
Figure 4 for CodeNav: Beyond tool-use to using real-world codebases with LLM agents
Viaarxiv icon

Task Me Anything

Add code
Jun 17, 2024
Figure 1 for Task Me Anything
Figure 2 for Task Me Anything
Figure 3 for Task Me Anything
Figure 4 for Task Me Anything
Viaarxiv icon

Preserving Identity with Variational Score for General-purpose 3D Editing

Add code
Jun 13, 2024
Figure 1 for Preserving Identity with Variational Score for General-purpose 3D Editing
Figure 2 for Preserving Identity with Variational Score for General-purpose 3D Editing
Figure 3 for Preserving Identity with Variational Score for General-purpose 3D Editing
Figure 4 for Preserving Identity with Variational Score for General-purpose 3D Editing
Viaarxiv icon

Iterated Learning Improves Compositionality in Large Vision-Language Models

Add code
Apr 02, 2024
Figure 1 for Iterated Learning Improves Compositionality in Large Vision-Language Models
Figure 2 for Iterated Learning Improves Compositionality in Large Vision-Language Models
Figure 3 for Iterated Learning Improves Compositionality in Large Vision-Language Models
Figure 4 for Iterated Learning Improves Compositionality in Large Vision-Language Models
Viaarxiv icon

Seeing the Unseen: Visual Common Sense for Semantic Placement

Add code
Jan 15, 2024
Figure 1 for Seeing the Unseen: Visual Common Sense for Semantic Placement
Figure 2 for Seeing the Unseen: Visual Common Sense for Semantic Placement
Figure 3 for Seeing the Unseen: Visual Common Sense for Semantic Placement
Figure 4 for Seeing the Unseen: Visual Common Sense for Semantic Placement
Viaarxiv icon

Unified-IO 2: Scaling Autoregressive Multimodal Models with Vision, Language, Audio, and Action

Add code
Dec 28, 2023
Figure 1 for Unified-IO 2: Scaling Autoregressive Multimodal Models with Vision, Language, Audio, and Action
Figure 2 for Unified-IO 2: Scaling Autoregressive Multimodal Models with Vision, Language, Audio, and Action
Figure 3 for Unified-IO 2: Scaling Autoregressive Multimodal Models with Vision, Language, Audio, and Action
Figure 4 for Unified-IO 2: Scaling Autoregressive Multimodal Models with Vision, Language, Audio, and Action
Viaarxiv icon

Holodeck: Language Guided Generation of 3D Embodied AI Environments

Add code
Dec 14, 2023
Viaarxiv icon

Promptable Behaviors: Personalizing Multi-Objective Rewards from Human Preferences

Add code
Dec 14, 2023
Figure 1 for Promptable Behaviors: Personalizing Multi-Objective Rewards from Human Preferences
Figure 2 for Promptable Behaviors: Personalizing Multi-Objective Rewards from Human Preferences
Figure 3 for Promptable Behaviors: Personalizing Multi-Objective Rewards from Human Preferences
Figure 4 for Promptable Behaviors: Personalizing Multi-Objective Rewards from Human Preferences
Viaarxiv icon

Harmonic Mobile Manipulation

Add code
Dec 11, 2023
Figure 1 for Harmonic Mobile Manipulation
Figure 2 for Harmonic Mobile Manipulation
Figure 3 for Harmonic Mobile Manipulation
Figure 4 for Harmonic Mobile Manipulation
Viaarxiv icon