Picture for Philipp Krähenbühl

Philipp Krähenbühl

Compressed Map Priors for 3D Perception

Add code
Dec 31, 2025
Viaarxiv icon

Spherical Leech Quantization for Visual Tokenization and Generation

Add code
Dec 16, 2025
Figure 1 for Spherical Leech Quantization for Visual Tokenization and Generation
Figure 2 for Spherical Leech Quantization for Visual Tokenization and Generation
Figure 3 for Spherical Leech Quantization for Visual Tokenization and Generation
Figure 4 for Spherical Leech Quantization for Visual Tokenization and Generation
Viaarxiv icon

Interactive Post-Training for Vision-Language-Action Models

Add code
May 22, 2025
Viaarxiv icon

PerceptionLM: Open-Access Data and Models for Detailed Visual Understanding

Add code
Apr 17, 2025
Figure 1 for PerceptionLM: Open-Access Data and Models for Detailed Visual Understanding
Figure 2 for PerceptionLM: Open-Access Data and Models for Detailed Visual Understanding
Figure 3 for PerceptionLM: Open-Access Data and Models for Detailed Visual Understanding
Figure 4 for PerceptionLM: Open-Access Data and Models for Detailed Visual Understanding
Viaarxiv icon

QLIP: Text-Aligned Visual Tokenization Unifies Auto-Regressive Multimodal Understanding and Generation

Add code
Feb 07, 2025
Viaarxiv icon

Robust Autonomy Emerges from Self-Play

Add code
Feb 05, 2025
Viaarxiv icon

Reinforcement Learning for Long-Horizon Interactive LLM Agents

Add code
Feb 04, 2025
Viaarxiv icon

Cut Your Losses in Large-Vocabulary Language Models

Add code
Nov 13, 2024
Figure 1 for Cut Your Losses in Large-Vocabulary Language Models
Figure 2 for Cut Your Losses in Large-Vocabulary Language Models
Figure 3 for Cut Your Losses in Large-Vocabulary Language Models
Figure 4 for Cut Your Losses in Large-Vocabulary Language Models
Viaarxiv icon

Promptable Closed-loop Traffic Simulation

Add code
Sep 09, 2024
Figure 1 for Promptable Closed-loop Traffic Simulation
Figure 2 for Promptable Closed-loop Traffic Simulation
Figure 3 for Promptable Closed-loop Traffic Simulation
Viaarxiv icon

Image and Video Tokenization with Binary Spherical Quantization

Add code
Jun 11, 2024
Viaarxiv icon