Picture for Thomas Brox

Thomas Brox

University of Freiburg

Simple LLM Baselines are Competitive for Model Diffing

Add code
Feb 10, 2026
Viaarxiv icon

On Geometric Understanding and Learned Data Priors in VGGT

Add code
Dec 12, 2025
Viaarxiv icon

Orbis: Overcoming Challenges of Long-Horizon Prediction in Driving World Models

Add code
Jul 17, 2025
Viaarxiv icon

CLIP Won't Learn Object-Attribute Binding from Natural Data and Here is Why

Add code
Jul 10, 2025
Viaarxiv icon

cVLA: Towards Efficient Camera-Space VLAs

Add code
Jul 02, 2025
Viaarxiv icon

Using Knowledge Graphs to harvest datasets for efficient CLIP model training

Add code
May 05, 2025
Viaarxiv icon

Detect, Classify, Act: Categorizing Industrial Anomalies with Multi-Modal Large Language Models

Add code
May 05, 2025
Viaarxiv icon

Label-Efficient LiDAR Semantic Segmentation with 2D-3D Vision Transformer Adapters

Add code
Mar 05, 2025
Viaarxiv icon

When and How Does CLIP Enable Domain and Compositional Generalization?

Add code
Feb 13, 2025
Viaarxiv icon

sshELF: Single-Shot Hierarchical Extrapolation of Latent Features for 3D Reconstruction from Sparse-Views

Add code
Feb 06, 2025
Figure 1 for sshELF: Single-Shot Hierarchical Extrapolation of Latent Features for 3D Reconstruction from Sparse-Views
Figure 2 for sshELF: Single-Shot Hierarchical Extrapolation of Latent Features for 3D Reconstruction from Sparse-Views
Figure 3 for sshELF: Single-Shot Hierarchical Extrapolation of Latent Features for 3D Reconstruction from Sparse-Views
Figure 4 for sshELF: Single-Shot Hierarchical Extrapolation of Latent Features for 3D Reconstruction from Sparse-Views
Viaarxiv icon